Gene Namu_2973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_2973 
Symbol 
ID8448586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3258826 
End bp3260226 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content65% 
IMG OID645042058 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_003202300 
Protein GI258653144 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0717724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0172565 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATGACCA AACAGGTCGA GGAGATCCTC GACGAACCCC ATGAACAGAT CGTGCAGCGG 
GTCTGCGCGA TCGACGTGGG CAAGGACTCG GGCACAGTCT GCGTTCGCGT ACCAGCCGCG
TCCGGGACGG GTCGGCGAGT GAGCAAAGTC TGGGACGTCC CGGCCCGAAC CAGAGCGGTC
CTGGGCCTGG CCGCACAGCT GTCAGACCAG GGCATCGAGA AGGTGACCCT GGAATCGACC
TCGGACTACT GGCGGATCTG GTTCTATCTG CTGGAAGCTC ATGGCCTGGA CGTGCAGTTG
GTCAATGCCC GCGATGTCAA GAACGTCCCC GGTCGCCCCA AAACGGACAA GCTGGACAGT
GTGTGGTTGG CCAAGCTCAC CGAGAAGGGG CTGTTGCGTC CATCGTTCGT GCCATCAGCG
CAGGTCCGGC AGTTGCGCGA CTACACCCGG ATGCGGGCCG ATCTGACCGG CGACCGGACC
AGGTACTGGC AACGGCTGGA GAAGCTGCTG GAGGACGCCC TGATCAAGGT CACCTCCGTG
GCGAGCAGGA TCGACACCCT GTCCGTCCGG GACATGATTG AGGCCCTGAT CGCGGGCCAG
CGGGACCCGC GGGTTCTGGC CGGCATGGCC CGCGGCCGGA TGCGGCTCAA GCACGCCGAC
CTGGTCGAGT CGCTGACCGG TCAGTTCGAC GATCATCACG CCGAGCTGGC CCGGATGCTG
CTGCATCAGA TCGACACGCT GACCGATCAG ATCGACGTCC TGACCGCACG CATCGAGGCA
CTCCTGGCCA GGTTGCCGGC CGGTAACACC CCCGATCCGG ACCGCCCCGC ACCGGATGGC
CAAACTCGGC CCGGTACGAG GGCGAACGCA CCCGCCGACG AGGCGGCCCA GCGCCGGACA
CCGCCGACGG CCGCAGACAT GATCAAGATC CTGGACCAGA TACCCGGGAT CGGCCCAAGC
AACGCACAGG TCATCATCGC CGAGATCGGG CTGGACATGA GCCGGTTCCC GACCGCTGGC
CATCTGGTGT CCTGGACCCG GCTGTGCCCC CGCACGATCC AGTCCGGGAA ACGATCAACA
ACCGGTAAGA CCGGCAAGGG CAACCGTTAC CTGCGCGCCG TGCTCGGTGA AGCGGCCGCG
ACCGGCGGCA AGACCCAAAC CTTCCTGGGA GAACGCTATC GACGCCTGAT CAAACGCCGC
GGCAAACTCA AAACGATCGT CGCCATCGCC CGATCCATCC TTGTCATCAT CTGGCACCTG
CTCGCCAACC CCGGCACGAC CTTCCACGAC CTCGGCGTCG ACTTCAACGA CCAACGCATC
GACATCGGAC GCCGAACCCG TAACCACGTC CGGCAACTCG AAGCCCTCGG CTTCAACGTC
ACCCTGACCG CGGCCGCCTA A
 
Protein sequence
MMTKQVEEIL DEPHEQIVQR VCAIDVGKDS GTVCVRVPAA SGTGRRVSKV WDVPARTRAV 
LGLAAQLSDQ GIEKVTLEST SDYWRIWFYL LEAHGLDVQL VNARDVKNVP GRPKTDKLDS
VWLAKLTEKG LLRPSFVPSA QVRQLRDYTR MRADLTGDRT RYWQRLEKLL EDALIKVTSV
ASRIDTLSVR DMIEALIAGQ RDPRVLAGMA RGRMRLKHAD LVESLTGQFD DHHAELARML
LHQIDTLTDQ IDVLTARIEA LLARLPAGNT PDPDRPAPDG QTRPGTRANA PADEAAQRRT
PPTAADMIKI LDQIPGIGPS NAQVIIAEIG LDMSRFPTAG HLVSWTRLCP RTIQSGKRST
TGKTGKGNRY LRAVLGEAAA TGGKTQTFLG ERYRRLIKRR GKLKTIVAIA RSILVIIWHL
LANPGTTFHD LGVDFNDQRI DIGRRTRNHV RQLEALGFNV TLTAAA