Gene Glov_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_2056 
Symbol 
ID6368026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp2192013 
End bp2193134 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content57% 
IMG OID642677469 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001952292 
Protein GI189425115 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGAG ATGAATGTGC TGGAAAGAAA CAGGAAGGTT TCTCGGTTGC CAGGATGCTT 
GAAGAACGGG GAGTCTCGCG GCGTGATTTC CTGAAATTCT GTTCTACTGT CACCGCTGCC
ATGGCGTTAC CTGCCACCAT GGCACCCAAG GTGGCTCAAG CCCTGGACAA GGTGCAGCGT
CCTCCGCTGG TCTGGCTGGA GTTTCAGGAT TGCTGCGGCG ACACGGAGGC TCTGTTACGT
TCAGCCAACC CCACCGTGGG AGAGCTGGTA CTGGATATCC TCTCGGTTGA TTACCATGAA
ACCATCATGG CTGCTGCCGG TCATCAGGCT GAGGCCAACC TCGAAAAGAC CATCAAAGAG
TTCCAGGGCA AATACCTCTG CGTGGTTGAG GGTTCCATCC CGATGAAGGA AGGAGGAGCC
TATGGCTGTG TTGGCGGCAA GTCCCATCTG GCCCGGGCCA AACAGGTCTG TGGATCTGCA
GCAGCCACCA TTGCTGTGGG CACCTGTGCC AGTTTCGGCG GTATTCCCGC TGCTGCTCCC
AATCCCACCG GCGCAGTTGG GGTCAAAGAG GCGGTGCCCG GTGCTACGGT GATCAACCTG
CCCGGCTGCC CCTGCAATGC CGATAACCTG ACCGCTGTAG TGGTTCACTT CCTTACCTTT
GGTAAACTTC CCAGTCTTGA CAGCCATGGC CGTCCCCTGT TTGCCTACGG CAAGCGGATT
CATGACAACT GTGAACGTCG TCCCCACTTT GATGCCGGTC AGTATGTTGA GCATTGGGGG
GATGATGCCC ACCGCAAGGG GCACTGCCTC TACAAGATGG GCTGTAAGGG TCCGGCAACC
TTCCATAACT GTCCCACCCA GCGTTTTAAC GAGAGAATCA GCTGGCCGGT TGCTGCCGGT
CATGGCTGTG TCGGCTGTTC CGAACCCCAG TTCTGGGATA CTTCGCCACT CTATCGCCGT
CTGCCCAACG TGCCTGGCTT TGGTATTGAG CAGAGTGCCG ACAAGATCGG GCTTGCCTTT
ACTGCCGGTG TGGGTGGTGC CTTTGCTATC CATGGTGCCA TGAATGCCCT GCGCAAGGAT
AAAGATACGG CTGACGAGAA CACAAAAGAC GGGGAGGAAT AG
 
Protein sequence
MDRDECAGKK QEGFSVARML EERGVSRRDF LKFCSTVTAA MALPATMAPK VAQALDKVQR 
PPLVWLEFQD CCGDTEALLR SANPTVGELV LDILSVDYHE TIMAAAGHQA EANLEKTIKE
FQGKYLCVVE GSIPMKEGGA YGCVGGKSHL ARAKQVCGSA AATIAVGTCA SFGGIPAAAP
NPTGAVGVKE AVPGATVINL PGCPCNADNL TAVVVHFLTF GKLPSLDSHG RPLFAYGKRI
HDNCERRPHF DAGQYVEHWG DDAHRKGHCL YKMGCKGPAT FHNCPTQRFN ERISWPVAAG
HGCVGCSEPQ FWDTSPLYRR LPNVPGFGIE QSADKIGLAF TAGVGGAFAI HGAMNALRKD
KDTADENTKD GEE