Gene Namu_3950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3950 
Symbol 
ID8449569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4358219 
End bp4361089 
Gene Length2871 bp 
Protein Length956 aa 
Translation table11 
GC content71% 
IMG OID645042995 
ProductNADH/Ubiquinone/plastoquinone (complex I) 
Protein accessionYP_003203231 
Protein GI258654075 
COG category[C] Energy production and conversion
[P] Inorganic ion transport and metabolism 
COG ID[COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0588976 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGCC CGACCGGTCT GTTCGCCGTC GCGCCCTGGC TGCTGCTGCT GTACCTGGCG 
CTGGCGATCC TCGCTCCGGC GCTGTGCCGC CGGCTCGGCC GCGGCGCGCT GATCGTGCTT
GCCCTGCTGC CGGCCGCCAC CACCGTCTGG GCCGCGGCCA TGCTGCCGTC GGTGATGAAC
GGCGGGGCGC TGACGTCGTC CACCCGGTGG GTGCCGGCCC TCGACCTGGC CATCGACCTG
CGGATGGACG CACTGGCGAT GGCGTTGACG TTGATCATCG CGTTCATCGG CATGCTGGTG
TTGCTGTACT CGGCCCGGTA CTTCCCGGCC GACGACGACG GCCTGAGCAA GTACAGCGGC
TCCCTGGTCG CGTTCGCCGG GGCCATGCTC GGCCTGGTCT GGGCGAACAA CCTGATCCTG
CTGGTGGTCT GCTGGGAGCT CACCGGCATC CTGTCCTACC TGCTCATCGC CCACCGCGCG
GGCAAGAAGT CGTCCCGGAC GGCCGCGTCG CAGGCGCTGA TGGTGACCAC CGCGGGCGGG
CTGGTCATGC TGATCGGCGC GGTGATGCTG GGCACACTGG CCGGCACCTT CACCATCTCG
GAGATCCTGG CCGCCCCGCC GTCCGGCCCG CTGGTCACCG TCTCGGTGCT GCTGCTGATC
GTCGGCGGCA TCAGCAAGTC GGCAATCGTG CCGTTCCAGT TCTGGCTGCC CGGGGCCATG
GCCGCACCCA CCCCGGCCAG CGCGTACCTG CACGCCGCGA CCATGGTCAA GGCCGGCATC
TACCTGTTCC TGCGGCTCGC CCCGGCGTTC GCCACGCTGC CGGTGTGGCA GCCGGTGCTG
ATCGCGCTGG GCGGCGGCAC CATGCTCTTC GGCGCGGTGC TGGCCCTGCG CCAGCGTGAC
CTCAAGCTGC TGCTGGCCTA CGGCACCGTC TCCCAGCTGG GGTTGATGAC CCTGGTCATC
GGCACCGGCA ACCCGGATGC GCTGCTGGGC GGGCTGGCCA TGCTGATCGC CCACGCCACG
TTCAAGGCGC CCCTGTTCTT CACCGTCGGC ATCATCGACA CCACCACCGG CACCCGCGAC
CTGACCAAGC TGTCCGGACT GGGCCGCCGG ATGCCTCTGC TGGCCACGTT GGCCGTGCTG
TCCGCGTTGA GCATGGCCGG CATCCCCCCG ATGCTCGGCT TCGCGGCCAA GGAAGCCGAC
TACGCCGGCC TGCTCGAGGG CGGTACCGGC GGGTACGTCG CGGTCGTGGT GATGGCGCTG
GGCTCGGCGA TCACCACCGC CTACAGCGCC CGGTTCATCT GGGGCGCGTT CGCGGCCAAG
AAGGACGTGC CGGCCGCCGA CCCGGCCCCG ACCAGCGCCT TCATGGTCGG ACCGACCATG
GCGATGGTGG CCATCGGGCT GGTCCTGGGC ATCGCGCCGG GGCTGCTGGA ACCGCTGCTG
CAGTCGTACG CGCAGACCGC CGGGCCGACC CATCCGGGGG CCATGGCCCT GTGGCACGGC
TGGTCGATCC CGCTGGCCCT GTCCGCGGCC GGCTGGCTGA TCGGCGCCGC GGTCTTCCTC
GGCCAGCGCG CCTGGGAACG CCGCCGGGAC CTGATCGCCC GGCCGGGGGC CGAGCCCGGC
ATCGCCTACC GGGTCGGCAT CAAGGGTGTC GACGTCGTCG CCGACGGCGT CACCAACGCC
ACCCAGCGCG GGTCGCTGCC GGTCTCGCTG GGCGCGATCC TGCTGGTGCT GGTGCTGTTC
CCGGGCACCA TGCTCCTCAC CTCCGGGGTC GGCCCGAAGG ACGTCGAGGT GATCGGGCAG
CCGGGCACGC TGGTGATCTG CGTGGTCATC CTGGCCATCG CAGCGGCCAC CCTGCGGGCC
CGCCGGGCCC TGCCGGCGGT GATGATGGTC GGCGGCATCG GCTATGCGAT GGCCGTGCTG
TTCATCCTGC GCGGGGCTCC CGACCTGGCC CTGACCCAGA TCCTGGTGGA GACCATCACC
CTGGTCGCGG CGCTGCTGGT GCTGACCCGG CTGCCCGACG ACCTGCTGTT CGCCAAGCAC
CGCGGCAACG CCTTCCGGGC GATCATCGCG GTGGCCGCCG GCGCGCTGAT GACCGGCCTG
GCGCTGATCA TCCCCGGTAC CCGGGTGGCC ACCCCGGTCT CGGCCGATCT GGCCGGGCCG
GCCGTCGAAT TCGGCGGCGG CTACAACATC GTCAACGTGA TCCTGGTCGA CGTCCGGGCC
TGGGACACCT TCGGTGAGCT GACCGTGCTG ATCGCCGCGG CCACCGGTGT CGCGTCGATG
ATCTTCCTGG TCCGCCGCAC CGGCCGGACC CCGCGACGGC CCACCCAGGA ATCGAGCACC
AGCCGGCCGC ACGAGCCGTC GCCCTGGCTG GCGACCAACT GGATGCCCCG GCGTTCGCTG
CTGCTCGAGG TCGTCACCCG GATGATCTTC CACGTCATCG TGGTGTTCTC CGTCTACGTG
CTGTTCGTCG GCCACGACGC CCCCGGCGGC GGGTTCGCCG CCGGGCTGAT CGTCGGGCTG
GCCCTGGCCC TGCGCTACAT CGCCGGCGGC GCCTACGAAC TCGGCGAGGC CGCGCCCTGG
GACCCGGGCA TCCTGATGGG CACCGGGCTG TTCATCTCGG CGGCCACCGC GATCTACGGC
GTCATCGCCG GCGGGGCCGC GCTGCAGTCC ACCATCCTCA AGGCCACCGT GCCGCTGCTG
GGTGACCTGA AGTTCGTCAC CTCGTCGATC TTCGACGTCG GGGTGTACCT GATCGTGATC
GGTCTGGTGC TGGACGTGCT GCGGGCGATG GGCGCCGAAC TGGACCGGCA GGGCATGCTC
GAACGCAGCG AGCGCACCTC CTCGACCCGG GATCGGCAGG GTGCCCGATG A
 
Protein sequence
MTSPTGLFAV APWLLLLYLA LAILAPALCR RLGRGALIVL ALLPAATTVW AAAMLPSVMN 
GGALTSSTRW VPALDLAIDL RMDALAMALT LIIAFIGMLV LLYSARYFPA DDDGLSKYSG
SLVAFAGAML GLVWANNLIL LVVCWELTGI LSYLLIAHRA GKKSSRTAAS QALMVTTAGG
LVMLIGAVML GTLAGTFTIS EILAAPPSGP LVTVSVLLLI VGGISKSAIV PFQFWLPGAM
AAPTPASAYL HAATMVKAGI YLFLRLAPAF ATLPVWQPVL IALGGGTMLF GAVLALRQRD
LKLLLAYGTV SQLGLMTLVI GTGNPDALLG GLAMLIAHAT FKAPLFFTVG IIDTTTGTRD
LTKLSGLGRR MPLLATLAVL SALSMAGIPP MLGFAAKEAD YAGLLEGGTG GYVAVVVMAL
GSAITTAYSA RFIWGAFAAK KDVPAADPAP TSAFMVGPTM AMVAIGLVLG IAPGLLEPLL
QSYAQTAGPT HPGAMALWHG WSIPLALSAA GWLIGAAVFL GQRAWERRRD LIARPGAEPG
IAYRVGIKGV DVVADGVTNA TQRGSLPVSL GAILLVLVLF PGTMLLTSGV GPKDVEVIGQ
PGTLVICVVI LAIAAATLRA RRALPAVMMV GGIGYAMAVL FILRGAPDLA LTQILVETIT
LVAALLVLTR LPDDLLFAKH RGNAFRAIIA VAAGALMTGL ALIIPGTRVA TPVSADLAGP
AVEFGGGYNI VNVILVDVRA WDTFGELTVL IAAATGVASM IFLVRRTGRT PRRPTQESST
SRPHEPSPWL ATNWMPRRSL LLEVVTRMIF HVIVVFSVYV LFVGHDAPGG GFAAGLIVGL
ALALRYIAGG AYELGEAAPW DPGILMGTGL FISAATAIYG VIAGGAALQS TILKATVPLL
GDLKFVTSSI FDVGVYLIVI GLVLDVLRAM GAELDRQGML ERSERTSSTR DRQGAR