Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2721 |
Symbol | hoxF |
ID | 2686035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2997688 |
End bp | 2999400 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637127412 |
Product | NAD-reducing hydrogenase, alpha subunit |
Protein accession | NP_953766 |
Protein GI | 39997815 |
COG category | [C] Energy production and conversion |
COG ID | [COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCCCG CCGATCTCCA TGCCATGGCC CGGGAAGAAC AGGCCCGCCA GGAGGCGCTG CGGTGCCGTA TCATGGTCTG TGCCGGTACC CCCTGTCTTT CCGCCGGCGC ACTGGCGGTC CTGGACGCGC TGCGCCAGGC GGTGGAGGAG AGCCGGCTCG ATGCGGAGAT CGAGGCCGTC AGCACCGGCT GCATGGGGCC GTGCAGCCGC GGTCCCCTCG TGAAAGTGGC AGTTCAGGGG AAACCCGAGA TCGTCTACGA GCGCGTCACC CCCGAACTGG CCCGCCAGAT TCTCTACTCG GTGGTGAAGG GGCGCCGGCC GCCGACGGCC TCGCCGCTGC CGCCGGATCA CCCCTATTTC ACCCGCCAGA TGAAGATCGT CTTGGCCAAC TGCGGTTCCA TCGACCCCGA GCGGATCGAG GAGTATGTGG CCACGGGGGG CTACGATGCC CTTGCCCACG CCCTGCACGA GATGACCCCC GAGGATGTCT GTCGCGAGAT CTCCGCTTCG GGACTGCGGG GGCGGGGGGG GGCCGGCTAT CCGGCCGGCG TCAAGTGGAA CCTGGCCCGC AAGGCGCCGG GCGAGCGGAA ATACGTGGTG GCCAACGGCG ACGAGGGGGA CCCGGGCGCC TACATGGACC GCTCCGTCAT GGAGTCGGAC CCCCACCGCA TCCTGGAAGG AATGGCCATT GCCGGCTACG CCATCGGCGC GGATCAGGGC TACATCTACG TCCGGGGCGA GTACTACCTG GCCGGAAGGC GGCTCGAGGC CGCCATCCGC GACGCCGAGC GCAAGGGGCT CCTGGGAAGC CGGGTGCTCG GCAGCAACTT CAGTTTCCGC ATCGACATCC GCACCGGCGC CGGCGCCTTT GTCTGCGGCG AGGAAACCTC GCTCATGGCT TCCATCATGG GGCGGCGCGG ACAGCCCTGG CACCGGCCGC CCTATCCGGC CCAGCGGGGG CTCTGGGGGT GTCCGACCCT CATCAATAAC GTGGAGACCC TTGCCACCGT CCCGGCTATC ATCGGACGCG GGGCCACCTG GTACGCCGGC ATCGGCTCGA CCCGGAGCCC GGGAACCAAG GTCTATGCCC TGGCCGGCCA GGTGGAGACC GCCGGCCTGA TCGAAGTCCC CATGGGTACC ACGCTGCGGG AGGTGGTGTT CGACATCGGC GGCGGAATCC CCGGCGGCAA GCGCTTCAAG GCTGCCCAGT CGGGCGGCCC TTCGGGCGGC TGCATTCCGG CCGAACACCT GGACACCCCC CTCGATTACG AAAGTATGGA GCGGATCGGC ACCATCATGG GCTCCGGCGG ACTTATCATC ATGGACGAGA CGAGCTGCAT GCCCGACGTC GCCTCGTTTT TCCTCGATTT CTGCCGGGAC GAGAGCTGCG GCAAGTGCAT CCCGTGCCGC GTCGGCAGCA TGGAGATGCA CCGGCTGCTG CGCCGCATAA CCTCCGGCAC CGCTTGGCCG GACGACCTGC GCCGGCTGGA GGAGCTCTGC GATCTGATGG GCGCCACGAG CCTCTGTGGT CTCGGCCAGA CCGCGCCCAA CCCGGTGGTG AGCACCCTGC GCTACTTCCG GCACGAATAC GAGGCCCACA TCCGCGAGCG CCGCTGCCCG GCGGGCCGCT GCACCATGGC TGCCGAGCAG GAGCCCGGAC CCGGCGGCAA GGCACTCCTG CACGCCGTCG GCGTCCCTGG GGAGGTAGGC TGA
|
Protein sequence | MNPADLHAMA REEQARQEAL RCRIMVCAGT PCLSAGALAV LDALRQAVEE SRLDAEIEAV STGCMGPCSR GPLVKVAVQG KPEIVYERVT PELARQILYS VVKGRRPPTA SPLPPDHPYF TRQMKIVLAN CGSIDPERIE EYVATGGYDA LAHALHEMTP EDVCREISAS GLRGRGGAGY PAGVKWNLAR KAPGERKYVV ANGDEGDPGA YMDRSVMESD PHRILEGMAI AGYAIGADQG YIYVRGEYYL AGRRLEAAIR DAERKGLLGS RVLGSNFSFR IDIRTGAGAF VCGEETSLMA SIMGRRGQPW HRPPYPAQRG LWGCPTLINN VETLATVPAI IGRGATWYAG IGSTRSPGTK VYALAGQVET AGLIEVPMGT TLREVVFDIG GGIPGGKRFK AAQSGGPSGG CIPAEHLDTP LDYESMERIG TIMGSGGLII MDETSCMPDV ASFFLDFCRD ESCGKCIPCR VGSMEMHRLL RRITSGTAWP DDLRRLEELC DLMGATSLCG LGQTAPNPVV STLRYFRHEY EAHIRERRCP AGRCTMAAEQ EPGPGGKALL HAVGVPGEVG
|
| |