Gene Namu_3668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3668 
Symbol 
ID8449287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4021415 
End bp4023109 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content67% 
IMG OID645042733 
ProductIstA2 
Protein accessionYP_003202969 
Protein GI258653813 
COG category[L] Replication, recombination and repair 
COG ID[COG4584] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0182144 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.155285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTCCA GAGTGGTGCT GTTCGAGCAG ATCCGACGTG ATTCCCGGAT CGAGGGCCTG 
TCGGTGCGGG CGCTGGCCAA ACGTCACCAT GTTCATCGGC GCACGGTGCG CCAAGCGTTG
GCGTCCGCGA CTCCACCGCC GCCGGCCCGC CGGGTGTGGA AGCGAACGAA GATCACCCCG
TTCGCGGCGG CGATCGATGA CATGTTGCGG GCGGATCTGA CGGCCCCACG GAAGCAGCGC
CACACGGTGG TACGGATCCT GGACCGGTTG GTCGACGAAC ACGGCGCGAC GGATCTGACC
TATGGGACGG TCCGGGCGTA TGTGGCGCAG CGGCGCCCGG AGATCAACGC CGAGGCCGGG
CGGCCGGTCG CTGAGGTCTT CATCGCACAA ACGCATCAGC CCGGGGCGGA GGCCGAGGTC
GACTTCGCCG AGTTGTGGGT GGACCTGCCG GCCGGGCGGA CCAAGTGCTA CTTGTTCACG
CTGCGGCTGT GTTTCTCCGG TCGGGCGGTC CACCGGGTGT TCGCGACCCA GTCGCAGGAG
GCGTTCCTGG AAGGTCACAT CGACGCGTTC ACCGAGCTGG GCGGGATCCC GGTCAAGCAC
ATCAAGTACG ACAACTTGAA GTCCGCAGTC ACGACGGTGC TGTTTGGGAA CAACCGACGG
CGGACCGAGA ACGACCGATG GGTGCTGTTC CGGTCCCATT ACCAGTTCGA CTCGTTCTAC
TGCATGCCCG GCGTGGACGG CGCCCACGAG AAGGGTGGAG TCGAGGGTGA GGGTGGCCGA
TTCCGCCGCA ACCACCTCGT CCCGGTCCCG AAAGTGGCCT CGTTGGCCGA GCTGAACGTC
CGGCTCGCGA AGGCGGACCG GGCCGATGAT CACCGCCGAA TCAGCGGACA GGTCCGCACC
GTCGGTGACA TGTTCGCCAT CGAGCAACAG ATGCTGCACC CGTTGCCCGT GGAGGTGTTC
GAACCGGGCC TGACGATGAA CCCGCGGGTC GATCGGCACG CCCGGATCAT GGTCCGCAAC
GTGCAGTACT CGGTCCCGGC CCGGTTCATC AACCGGCGGG TCCGGGTCAT CTTGCGGTCC
AACGAGGTGA TCGTGTTCGA CGGTCGCACC CAGCTCGTCC GGCACGAACG GTCCAGCCGG
AAAGGCTCCC AGGTGCTGGT CCTGGATCAC TACCTGGAGG TGCTGCGGTT CAAGCCCGGC
GCCCTGCCCG GGGCGACGGC ATTGGTCCCG GCCCGGGCGA ACGGGTCGTT CACCACCGTG
CACGAGGCGT TCTGGGCGGC CGCCCGGAAA GCTCACGGCG ACGCGGCGGG CACCCGGGAA
CTGATCGAGG TGTTGTTGTT GCACCGGCAC ATGACCCATG CCGATGTCGT CGCCGGTCTG
CAGGCCGCCC TCACGGTCGG CGCGAGCAAC GCCGACGTCG TCGCTGTCGA GGCCCGCAAA
CACCAGACCG CAGCGGGTGG GGCCGGCCAA CGGCACCATC CCGTCGATCA GAACGCGGGC
GTTGAGCAAC GTGTTGTCAG CCTGACCGAA CGCCGGCTCG CCGATCCGGC TGCGGTCATC
GCCGGGCTAC CGACAGACTC CAGGCCGTTG CCGTCCGTCG CTGAGTACGA CGAGCTGCTG
ATCCGGCGCC GCATCACACC GACCGAGACC ACCACCACCA TTGATGTCGC CGCGCAGAAG
GGGAACGTGT CATGA
 
Protein sequence
MRSRVVLFEQ IRRDSRIEGL SVRALAKRHH VHRRTVRQAL ASATPPPPAR RVWKRTKITP 
FAAAIDDMLR ADLTAPRKQR HTVVRILDRL VDEHGATDLT YGTVRAYVAQ RRPEINAEAG
RPVAEVFIAQ THQPGAEAEV DFAELWVDLP AGRTKCYLFT LRLCFSGRAV HRVFATQSQE
AFLEGHIDAF TELGGIPVKH IKYDNLKSAV TTVLFGNNRR RTENDRWVLF RSHYQFDSFY
CMPGVDGAHE KGGVEGEGGR FRRNHLVPVP KVASLAELNV RLAKADRADD HRRISGQVRT
VGDMFAIEQQ MLHPLPVEVF EPGLTMNPRV DRHARIMVRN VQYSVPARFI NRRVRVILRS
NEVIVFDGRT QLVRHERSSR KGSQVLVLDH YLEVLRFKPG ALPGATALVP ARANGSFTTV
HEAFWAAARK AHGDAAGTRE LIEVLLLHRH MTHADVVAGL QAALTVGASN ADVVAVEARK
HQTAAGGAGQ RHHPVDQNAG VEQRVVSLTE RRLADPAAVI AGLPTDSRPL PSVAEYDELL
IRRRITPTET TTTIDVAAQK GNVS