Gene GSU3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3441 
SymbolnuoF 
ID2688153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3786573 
End bp3787844 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content67% 
IMG OID637128136 
ProductNADH dehydrogenase I, F subunit 
Protein accessionNP_954481 
Protein GI39998530 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGG TTCTCTTCCG ACATAACCGT CCCGGCCGGT GCGTAACCTT TGCCGAGTAC 
CGGGCGGAGG GAGGCTTCGC GGCCCTGGAG AAGGCCCTTT CCGGCATGTC GCCCAACGAT
GTCCAGCAGG TGGTGATCGA CGCCAATCTG CGGGGCCGGG GCGGGGCCGG GTTCCCCACC
GGGAAGAAGT GGTCCTTCGT GCCGCGGGAC ATCCCAGGCC CCCGTTATCT CATCTGTAAC
TGCGACGAGA TGGAGCCGGG CACCTACAAG GACCGGATAC TCCTGGAGGC GAACCCCTAT
TCTCTGGTGG AGGGGATGAC CCTGGCCGCT TATGCCATCG GTGTGGCCCA TGCCTTCATC
TTCATCCGCC GCGGCTACGA AGAGGCGGCG GAGAATTGCC GGCGCGCCAT TGCCGAGGCA
AAGGAAGCGG GCCTGCTGGG GAAGAACATC CTCGGCTCCG GTTTCTCCCT GGACCTGGAC
GTCCACCAGT CCGCCGGGCG TTACATCTGT GGCGAGGAAA CAGCCCTCAT GAACGCGCTG
GAGGGGAGGC GGGCCAACCC GCGGAGCAAA CCCCCCTTCC CGGCAGTGAA AGGGCTCTGG
GGGCGCCCCA CGGTGGTGAA CAACGTGGAG ACCCTGGCCA ACATCCCGGC TATCGTGGCC
AACGGCGCCG CCTGGTTCAA GGGGCTGGCG AGAATCTCCG AAGCGGCCGG TACCAAGCTC
TTCTGCGTGA GCGGACATGT GAACAACGCC GCCTGCTTCG AGCTTCCCCT GGGGATGAGC
CTGGGCGAGA TCATCGACGG CCCCTGCGGC GGCATGCTGC CGGGGCGGGA GTTCAAGGCG
TGCATACCCG GCGGGGCGTC CACGCCGTTC TTTACCCGGG AGCACTGGAA CGTCCCCATG
GACTTCGATG CCGTGGCAAG GGCCGGCTCC CGGCTCGGCA CCGGCGGCAT CGTGGTCTTC
GACCGGAACA CCTGCATGGT GGCGGCAACC CTGAACCTGG TGTCCTTTTA CGCTCGGGAG
TCGTGCGGCT GGTGTACCCC CTGCCGGGAA GGGCTCCCCT TTGTGAAGGA CGTCCTGGCC
CGGATCGAGG CGGGCGCCGG GCGGGAGGAG CACATCGCCA TCCTGCGGGA GCATGTCCAG
TACCTGAACT ACGCCTTCTG TCCCCTGGCC CCCGGTGCCA TGGGGCCGGT GGAGGGGCTC
CTGCGGCTCT TCGAGGACGA GATCCGGGAG CACATCGTGA TGGGGCGCTG CCCCCTCGGA
GGAAAGGGAT GA
 
Protein sequence
MEQVLFRHNR PGRCVTFAEY RAEGGFAALE KALSGMSPND VQQVVIDANL RGRGGAGFPT 
GKKWSFVPRD IPGPRYLICN CDEMEPGTYK DRILLEANPY SLVEGMTLAA YAIGVAHAFI
FIRRGYEEAA ENCRRAIAEA KEAGLLGKNI LGSGFSLDLD VHQSAGRYIC GEETALMNAL
EGRRANPRSK PPFPAVKGLW GRPTVVNNVE TLANIPAIVA NGAAWFKGLA RISEAAGTKL
FCVSGHVNNA ACFELPLGMS LGEIIDGPCG GMLPGREFKA CIPGGASTPF FTREHWNVPM
DFDAVARAGS RLGTGGIVVF DRNTCMVAAT LNLVSFYARE SCGWCTPCRE GLPFVKDVLA
RIEAGAGREE HIAILREHVQ YLNYAFCPLA PGAMGPVEGL LRLFEDEIRE HIVMGRCPLG
GKG