Gene GM21_4003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4003 
Symbol 
ID8139377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4587047 
End bp4588096 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content61% 
IMG OID644871619 
ProductNADH dehydrogenase (quinone) 
Protein accessionYP_003023777 
Protein GI253702588 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value5.90654e-20 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATACGC TAATCTTAGG ACTGCCTGTC GCGTACTACA TAGCCATGGT TGCCAAAGTG 
CTCGTAGCCT TTGTCTTCGT GCTCCTGACC GTGGCCTACG CCACCTACGC GGAGCGCAAG
ATAATCGGGC ATATGCAGGT GCGCCTGGGT CCCATGAGGA CCGGCTGGCA CGGACTGCTG
CAGCCGATCG CGGACGGCGT CAAGCTGTTC TTCAAGGAGG AGATCGTCCC GACTCAGGCA
AGCAAGTTCG CCTTCCTGAT CGCTCCCTTG GTCGCCTTGA TCCCGGCGTT CATATCCTTC
GCCGTCATCC CCTTTGGCGC CCCGGTGACC ATCGCCGGTT ACACGGTCCC GCTGCAGATC
GCGGCCTACT ACGACCAGGC AGGTCAGCAG GTCTTCGACG TCAACGTCGG CGTCCTCTAC
ATCCTCGCCA TGGCGAGCCT CGGGGTCTAC GGCGTGGTCC TCGCGGGTTG GGCCTCCAAC
TCCAAGTACT CGCTTCTGGG CGGCCTTCGT TCCGCGGCGC AGATGATCTC CTACGAACTC
GCCGCCGGCC TCGCGATCAT CGCCGTCTTC ATGCTCTCCG AGTCGCTGTC GCTGCACAAG
ATCGTCGCCG ACCAGGCCAA CGGTGCCTGG TACGTATTCA AACAGCCGCT CGCCTTCGTG
ATCTTCTTCA TCTGCTCGCT GGCTGAGATA AACAGGACCC CGTTCGACCT TCCCGAGGCG
GAGACGGAGC TCGTGTCCGG CTTCATCACC GAGTACTCCT CCATGAAATA CGCCATGTTC
TTCATGGCCG AGTACGCCAA CATGATCACC GTCTGCGCGG TCACCACCAC CCTGTTCCTG
GGCGGCTGGC ACGGCCCGGC GTTTCTCCCC GGCTGGTTCT GGTTCGTCGC CAAGGTGTAC
TTCCTGATCT TCTGCTGCAT GTGGATCAGG GCAACCTACC CGCGTTACCG CTACGACCAG
CTCATGCGTC TGGGGTGGAA GGTGTTCCTG CCGCTGACCC TGGTCAACGT CATGGCGACC
GGAATCTGGG TCATGGTCTT CAACAAGTAG
 
Protein sequence
MDTLILGLPV AYYIAMVAKV LVAFVFVLLT VAYATYAERK IIGHMQVRLG PMRTGWHGLL 
QPIADGVKLF FKEEIVPTQA SKFAFLIAPL VALIPAFISF AVIPFGAPVT IAGYTVPLQI
AAYYDQAGQQ VFDVNVGVLY ILAMASLGVY GVVLAGWASN SKYSLLGGLR SAAQMISYEL
AAGLAIIAVF MLSESLSLHK IVADQANGAW YVFKQPLAFV IFFICSLAEI NRTPFDLPEA
ETELVSGFIT EYSSMKYAMF FMAEYANMIT VCAVTTTLFL GGWHGPAFLP GWFWFVAKVY
FLIFCCMWIR ATYPRYRYDQ LMRLGWKVFL PLTLVNVMAT GIWVMVFNK