Gene Namu_5208 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5208 
Symbol 
ID8450839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5805372 
End bp5806643 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content65% 
IMG OID645044239 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003204463 
Protein GI258655307 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGTGA TCGACGACGA CGAGATCGCC TACGGCCGGG TCGCGGGGAT GGACATCTCC 
AAGCGCGATG TCAAGGTCGC GGTCCGGCTG ATCGAGAACG GCCGGGTCAA ACGACTCAAG
GTCCGCACCT TCGCCACGAC GACACCATCG CTGCTGCGGC TACGGGACTG GCTGACCGAA
CTGGACATCG AGCTGGTCGC GATGGAATCG ACCGGGGTGT ACTGGAAACC ATTGTTCCTG
GTGCTGGAAG ACCAGTTCAG ATGCTGGCTA CTCAACCCGC GGGACGTCAA ACGGGTACCG
GGCAACAAGA CCGACGTCAA GGACGCCGAA TGGATCGCTC GGATGGCTCA GCTCGGGCTC
GTCACACCCT CGTTCGTGCC GGACATCCCG GTGCGGGAGC TGCGGGAGTT GACCCGGTAC
CGCAGCAACG TGGTCCGCGA CCGGACCCGC GCCGTGCAAC GGCTGCAGGA TCTGCTCGAA
TCCGCCGGGA TCAAGCTGTC CAGCACGGTC AGCGACATCA CCGGCAAATC GGCCACCGCG
ATGCTGCACG CGATGATCAA CGACCCGGCC GGGGCGATGA GTGATCCCCG CCAGGTCGCG
GACCTGGCGC TGGTCCGGAT GCGCAGCAAG CTCCCGGAAC TGACCGAGGC CCTGACCGGA
CACTTCACCG ACCACCACGC GCGACTGGCC GCCACGATGA TGCGACAGAT CAACGACCTC
GACACGCTCC TCACGGACCT GGACCAACAG ATCGACCAGG AGATCGCCCC TTTCGCCCGA
GCGGTTGACC ACCTCGAGAC GATCCCGGGA GTCAGCCGCC GCGCCGCGAT GATCATCGTC
TCCGAGATCG GGATCGACAT GAGCCGGTTC ACCGGGACCG ACCGGCTGGC CTCGTGGGCC
GGGCTCAGCC CCGGGAACAA CGAATCAGCC GGACGACACC TCTCGACCCG GACCCGCAAG
GGCAACCGGT CACTACGCGC AGTGCTGTTC CAGTGCGCCA AGGCCGCCGC CCGCACCCAC
AGCACGTACC TGGCCGCGAA GTACGCCGAT CTGTGTACCC GGATGAAACC AACCAAAGCC
CTGGTCGCGA TCTCCAGGAT CATCCTGGAG ACCTGTCACC ACCTGATCCG CAAAGACACC
GACTACCACG ACCTCGGACC CACCTACCTG AACGAATACC GCCGCCGCGA CCTGGACAAG
AACCGGATCC GCCGCGCCCG CACCATCCTC GAAACCAACG GCTACACCGT CACCCACAAC
GACGCCGCAT GA
 
Protein sequence
MPVIDDDEIA YGRVAGMDIS KRDVKVAVRL IENGRVKRLK VRTFATTTPS LLRLRDWLTE 
LDIELVAMES TGVYWKPLFL VLEDQFRCWL LNPRDVKRVP GNKTDVKDAE WIARMAQLGL
VTPSFVPDIP VRELRELTRY RSNVVRDRTR AVQRLQDLLE SAGIKLSSTV SDITGKSATA
MLHAMINDPA GAMSDPRQVA DLALVRMRSK LPELTEALTG HFTDHHARLA ATMMRQINDL
DTLLTDLDQQ IDQEIAPFAR AVDHLETIPG VSRRAAMIIV SEIGIDMSRF TGTDRLASWA
GLSPGNNESA GRHLSTRTRK GNRSLRAVLF QCAKAAARTH STYLAAKYAD LCTRMKPTKA
LVAISRIILE TCHHLIRKDT DYHDLGPTYL NEYRRRDLDK NRIRRARTIL ETNGYTVTHN
DAA