Gene Cpha266_0972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0972 
Symbol 
ID4570875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1114304 
End bp1115386 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content48% 
IMG OID639765575 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_911444 
Protein GI119356800 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00108314 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAA ACCAGTCTTT TGCCGATATT TTCAGGGCCA GTGGCGTGAG TCGAAGAGAT 
TTTTTAAAGT TCTGCTCACT TACATCGGTT TATCTTGGTC TTTCCCCTTC GATGGTACCG
AGTATTGTTC AGGCCATGGA AACAAAGCCA AGAACTCCGG TTATCTGGCT GCATGGTCTT
GAGTGTACCT GCTGTTCCGA ATCGTTTATC CGCTCCTCTC ATCCTACTAT CGAAGACATC
ATTTTCAATA TGATTTCACT CGATTATGAT GATGTTCTCA GCGCAGCGGC AGGTCATCAG
CTTGAGGATG TCCGAAAGAA GACGATGACC GATTATAAAG GGAAATATAT TCTTGCCGTT
GAGGGAAATG TGTCAACGAA AGACGATGGC GTGTACTGTA TGGTGGGTGG CGATTCTTTT
CTCAATACGC TCAGGGAGAC AGCCGCTGAT GCTGCCGCAA TTATTGCATG GGGCGCTTGT
GCTTCATTCG GTTGTGTGCA GAATGCCGAT CCGAACCCTA CAGGGGCCGC ACCGATTTCA
GAGATTATAA AGGATAAACC TATCGTCAAT GTCCCCGGCT GCCCTCCGAT TGCCGAGGTT
ATGACCGGAG TTATTACGCA TTTTCACACA TTCGGTAAAC TGCCCGACCT TGACCGCTTC
AATCGTCCCA AGGCTTTTTA TAAAACGAGG ATTCATGATA AATGCTATCG TCGTGCATTT
TTTGATGCCG GCATGTTTGT CAGAAGCTTC GATGATGAAT CGACCAGAAA AGGGTGGTGC
CTCTATAAGA TGGGTTGCAA GGGACCGACA ACCTATAACT CATGTTCGAC GATTCAGTGG
AATGACGGGA CAAGTTTTCC GATCGGTTCG GGCCACCCCT GTATCGGCTG CTCAGAACCG
CATTTCTGGG ACAATGGGCC TTTCTACAAG AGACTTGCCG ATGTATCGTT CCTTGGCTCT
GATAGCAATG CGGACAGAAT CGGAGCTGTG GCGCTTGGAG CTGCGGCGGC CGGAGCTGCG
GCACATGCGA CGATTACGGC AATTAAAAAG GGAAAATCAG GTAAAGGTAA TGACAAAGCT
TAA
 
Protein sequence
MQKNQSFADI FRASGVSRRD FLKFCSLTSV YLGLSPSMVP SIVQAMETKP RTPVIWLHGL 
ECTCCSESFI RSSHPTIEDI IFNMISLDYD DVLSAAAGHQ LEDVRKKTMT DYKGKYILAV
EGNVSTKDDG VYCMVGGDSF LNTLRETAAD AAAIIAWGAC ASFGCVQNAD PNPTGAAPIS
EIIKDKPIVN VPGCPPIAEV MTGVITHFHT FGKLPDLDRF NRPKAFYKTR IHDKCYRRAF
FDAGMFVRSF DDESTRKGWC LYKMGCKGPT TYNSCSTIQW NDGTSFPIGS GHPCIGCSEP
HFWDNGPFYK RLADVSFLGS DSNADRIGAV ALGAAAAGAA AHATITAIKK GKSGKGNDKA