Gene GSU3436 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3436 
SymbolnuoH-2 
ID2686874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3780842 
End bp3781831 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content64% 
IMG OID637128131 
ProductNADH dehydrogenase I, H subunit 
Protein accessionNP_954476 
Protein GI39998525 
COG category[C] Energy production and conversion 
COG ID[COG1005] NADH:ubiquinone oxidoreductase subunit 1 (chain H) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGAG TTGCTCTCGA CATAGCCATC CACGGGGCCA AGATCGCCCT GATCTTCTTC 
GTGGTGCTCA CCCTGGCGGC CTACCTGGTC TTCGCCGAGC GGCGGCTCCT GGCCTGGATC
CAGGACCGCA AGGGGCCGAA CCGGGTGGGC CCCTTCGGCC TGCTGCAGCC CCTGGCGGAT
CTCATCAAAC TCCTGACCAA GGAGGATTTC CGCCCCGCCG GCGCCGACAA GTGGCTCTTC
TATCTGGCCC CGGCCATGGC TGCCGTCCCG GCGATCCTCA CCTTTGCGGT GATCCCCTTC
GGGGCGCCGG TCACGATTCT CGGCCGTGAG ATCCCGCTTC AGGTGGCGGA TCTGAACGTG
GGGCTCCTCT TCTTCTTGGC ACTCTCCTCC ATTGCGGTGT ATGGTGTGGC CCTGGGGGGA
TGGGCGTCCA ACTCCAAGTA CGCGCTCCTG GGCTCCATCC GGGGGCTGGC CCAGCTCATC
TCCTACGAGC TCTCCATGGG GCTGTCCCTG GTGCCCACGG TGATGCTGGC CGGGTCGCTC
CGGCTGTCGG ACATCGTGGC GGCCCAGGAG GGGGTCTGGT TCATCGCCTA CCAGCCCGTG
GCCTTTCTGA TCTTTCTCAT CAGCATCGCG GCCGAATGCA AGCGGATACC CTTTGACATC
CCCGAGGCGG AGGGAGAGCT GGTGGCGGGG TTCCACACCG AGTACTCGGG GATGCGTTTC
GGCCTCTTTT TCGTGGGCGA GTACATCAAC ATCATCGTCC TCGGCGGTTT GGCAACCACC
TTCTTTCTGG GCGGCTGGCA GGGGCCGCTG CTGCCTCCCT TCGTCTGGTT TTCGGTGAAG
ACTCTCGCCT TCGCCTTCTT TTTCATCTGG ATGCGGGGAA CCCTGCCGCG GTTGCGCTAC
GATCAGCTCA TGCACCTGGG ATGGAAGGTG CTGACGCCGC TGGCACTGCT CAACATCCTG
ATCACCGGAT GGGTACTGAT GTTCGTGTAA
 
Protein sequence
MNGVALDIAI HGAKIALIFF VVLTLAAYLV FAERRLLAWI QDRKGPNRVG PFGLLQPLAD 
LIKLLTKEDF RPAGADKWLF YLAPAMAAVP AILTFAVIPF GAPVTILGRE IPLQVADLNV
GLLFFLALSS IAVYGVALGG WASNSKYALL GSIRGLAQLI SYELSMGLSL VPTVMLAGSL
RLSDIVAAQE GVWFIAYQPV AFLIFLISIA AECKRIPFDI PEAEGELVAG FHTEYSGMRF
GLFFVGEYIN IIVLGGLATT FFLGGWQGPL LPPFVWFSVK TLAFAFFFIW MRGTLPRLRY
DQLMHLGWKV LTPLALLNIL ITGWVLMFV