Gene Namu_2232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2232 
Symbol 
ID8447843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2461088 
End bp2462488 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content65% 
IMG OID645041354 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003201598 
Protein GI258652442 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000585875 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.443218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGACCA AACAGGTCGA GGAGATCCTC GACGAACCCC ATGAACAGAT CGTGCAGCGG 
GTCTGCGCGA TCGACGTGGG CAAGGACTCG GGCACAGTCT GCGTTCGCGT ACCAGCCGCG
TCCGGGACGG GTCGGCGAGT GAGCAAAGTC TGGGACGTCC CGGCCCGAAC CAGAGCGGTC
CTGGGCCTGG CCGCACAGCT GTCAGACCAG GGCATCGAGA AGGTGACCCT GGAATCGACT
TCGGACTACT GGCGGATCTG GTTCTATCTG CTGGAAGCTC ATGGCCTGGA CGTGCAGTTG
GTCAATGCCC GCGATGTCAA GAACGTCCCC GGTCGCCCCA AAACGGACAA GCTGGACAGT
GTGTGGTTGG CCAAGCTCAC CGAGAAGGGG CTGTTGCGTC CATCGTTCGT GCCATCAGCG
CAGGTCCGGC AGTTGCGCGA CTACACCCGG ATGCGGGCCG ATCTGACCGG CGACCGGACC
AGGTACTGGC AACGGCTGGA GAAGCTGCTG GAGGACGCCC TGATCAAGGT CACCTCCGTG
GCGAGCAGGA TCGACACCCT GTCCGTCCGG GACATGATTG AGGCCCTGAT CGCGGGCCAG
CGGGACCCGC GGGTTCTGGC CGGCATGGCC CGCGGCCGGA TGCGGCTCAA GTACGCCGAC
CTGGTCGAGT CGCTGACCGG TCAGTTCGAC GATCATCACG CCGAGCTGGC CCGGATGCTG
CTGCATCAGA TCGACACGCT GACCGATCAG ATCGACGTCC TGACCGCACG CATCGAGGCA
CTCCTGGCCA GCTTGCCGGC CGGTAACACC CCCGATCCGG ACCGCCCCGC ACCGGATGGC
CAAACTCGGC CCGGTACGAG GGCTAACGCA CCCGCCGACG AGGCGGCCCA GCGCCGGACA
CCGCCGACGG CCGCAGACAT GATCAAGATC CTGGACCAGA TACCCGGGAT CGGCCCAAGC
AACGCACAGG TCATCATCGC CGAGATCGGG CTGGACATGA GCCGGTTCCC GACCGCTGGC
CATCTGGTGT CCTGGACCCG GCTGTGCCCC CGCACGATCC AGTCCGGGAA ACGATCAACA
ACCGGTAAGA CCGGCAAGGG CAACCGTTAC CTGCGCGCCG TGCTTGGTGA AGCGGCCGCG
ACCGGCGGCA AGACCCAAAC CTTCCTGGGA GAACGCTATC GACGCCTGAT CAAACGCCGC
GGCAAACTCA AAACGATCGT CGCCATCGCC CGATCCATCC TTGTCATCAT CTGGCACCTG
CTCGCCAACC CCGGCACGAC CTTCCACGAC CTCGGCGTCG ATTTCAACGA CCACCGCATC
GACATCGGAC GCCGAACCCG TAACCACGTC CGGCAACTCG AAGCCCTCGG CTTCAACGTC
ACCCTGACCG CGGCCGCCTA A
 
Protein sequence
MMTKQVEEIL DEPHEQIVQR VCAIDVGKDS GTVCVRVPAA SGTGRRVSKV WDVPARTRAV 
LGLAAQLSDQ GIEKVTLEST SDYWRIWFYL LEAHGLDVQL VNARDVKNVP GRPKTDKLDS
VWLAKLTEKG LLRPSFVPSA QVRQLRDYTR MRADLTGDRT RYWQRLEKLL EDALIKVTSV
ASRIDTLSVR DMIEALIAGQ RDPRVLAGMA RGRMRLKYAD LVESLTGQFD DHHAELARML
LHQIDTLTDQ IDVLTARIEA LLASLPAGNT PDPDRPAPDG QTRPGTRANA PADEAAQRRT
PPTAADMIKI LDQIPGIGPS NAQVIIAEIG LDMSRFPTAG HLVSWTRLCP RTIQSGKRST
TGKTGKGNRY LRAVLGEAAA TGGKTQTFLG ERYRRLIKRR GKLKTIVAIA RSILVIIWHL
LANPGTTFHD LGVDFNDHRI DIGRRTRNHV RQLEALGFNV TLTAAA