Gene GSU3430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3430 
SymbolnuoM-2 
ID2686865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3775526 
End bp3777016 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content65% 
IMG OID637128125 
ProductNADH dehydrogenase I, M subunit 
Protein accessionNP_954470 
Protein GI39998519 
COG category[C] Energy production and conversion 
COG ID[COG1008] NADH:ubiquinone oxidoreductase subunit 4 (chain M) 
TIGRFAM ID[TIGR01972] proton-translocating NADH-quinone oxidoreductase, chain M 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTCCT ACTCCATCCT CACCATCTTG ATCCTCCTCC CCCTGGCGGG GTGCTTCTGC 
CTGGCGCCGG TCTGGAACCG TCCGGAGTGG GCCCGCCCCC TCGCCCTGGG GATTGCCGTG
GCGGAACTGG CCCTGGCCGG CTGGGTCCTC GTCGCCGCGC CGGGAATGCC GCCGGCGCCG
GGTGCCGCAG CAGGGTATTT TCTCTGGGAA GATGCCGCCT GGATCGAGCG GTTCGGCATC
CGGTACCTGC TGGGGATGGA CGGGATCAGC CTCCTCATGG TGGCCCTCAC CGCCTTCACC
ACGGTGGTGG CGATGCTCGT TTCGTGGCGC GGCATCACCG AGCGGGCCAC GCTCCACTAC
TTCCTGATCC TTCTCATGGA GAGCGGGATC ATGGGAGTGT TCCTTTCCCT TGACCTGGTC
CTCTTCTACC TCTTCTGGGA AGTGATGCTG ATCCCCATGT TCTTCTTGAT CGGCATCTGG
GGCCACGGCC GACGCATCTA CTCGGCAGTT AAGTTCTTCC TTTATACCCT GGTGGGCTCG
CTCCTGATGC TGCTGGCCAT CATCGGGGTC TACCTCATTC ATGGCGATGC CACCGGCACC
TTCACCTTTG CCCTGCCGCT CCTGGCCAAG TCACCCATTG CCCATGCCGC GGCTCCGTGG
CTTTTCGGGG CGTTTCTGCT GGCCTTCGCC ATCAAGTTTC CGCTGTTTCC GGTCCACACC
TGGCTCCCGG ACGCCCACAC CGACGCCCCC ACGGCCGGCA GCGTGATCCT GGCGGCGCTG
CTCCTGAAGA CCGGGGCCTA CGGTCTGGTC CGCTTCGGCT ACCCGCTCTT TCCCGAGGCG
GCCAAAGGAT TCACGCCGCT CCTCTACGTG CTGGCAATCA TCGGCATCCT CTATGCCTCG
TGGATCGCCT ACGCCCAGGA GGACATGAAA CGGATGGTGG CCTACTCCAG CGTTGGGCAC
ATGGGGTTCG TCGCCCTCGG GATCGCCTCC TGGGGGCCGG TGGCCCTGTC GGGCTCCATT
CTCCAGATGG TGAACCACGG CTTCACCACC GCCGCTCTCT TCGCCCTGGT GGGGATGCTG
GACGAGCGCG CCCACACCCG GGAGGTGTCG GCGTTCGGCG GACTCTGGGG AACCATGCCG
GCCTTCTCCT TTTTCTTTCT CTTCTTCGCC ATGGCTTCGG CGGGGCTGCC GGGGCTCAAC
AACTTCGTGG GAGAGTTCCT GATCCTGGTG GGGGTCTTCC GGATCACGCC AGCAGCCGGG
GCAATCGCCT TCCTCGGTAT CGTGCTGCCG CTTATCTACA CCGTGCGACT CGTGCAGGAG
GTTCTCTTCC AGACGGAACG GCGGCCACTG CGCCTGCCCG ACCTGACCCT GCGTGAGGGG
GCGGTGCTGG CCGTACTGGC CGTGATCGAT CTCTACATCG GGGTCCATCC GAAACCGCTC
CTGGATATCC TCAAGGTGCC GGTGGCGCTG CTGACGGGGG TGGCACCGTG A
 
Protein sequence
MTSYSILTIL ILLPLAGCFC LAPVWNRPEW ARPLALGIAV AELALAGWVL VAAPGMPPAP 
GAAAGYFLWE DAAWIERFGI RYLLGMDGIS LLMVALTAFT TVVAMLVSWR GITERATLHY
FLILLMESGI MGVFLSLDLV LFYLFWEVML IPMFFLIGIW GHGRRIYSAV KFFLYTLVGS
LLMLLAIIGV YLIHGDATGT FTFALPLLAK SPIAHAAAPW LFGAFLLAFA IKFPLFPVHT
WLPDAHTDAP TAGSVILAAL LLKTGAYGLV RFGYPLFPEA AKGFTPLLYV LAIIGILYAS
WIAYAQEDMK RMVAYSSVGH MGFVALGIAS WGPVALSGSI LQMVNHGFTT AALFALVGML
DERAHTREVS AFGGLWGTMP AFSFFFLFFA MASAGLPGLN NFVGEFLILV GVFRITPAAG
AIAFLGIVLP LIYTVRLVQE VLFQTERRPL RLPDLTLREG AVLAVLAVID LYIGVHPKPL
LDILKVPVAL LTGVAP