Gene Clim_0855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0855 
Symbol 
ID6353927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp937308 
End bp938393 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content53% 
IMG OID642668480 
Producthydrogenase (NiFe) small subunit HydA 
Protein accessionYP_001942913 
Protein GI189346384 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA)
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACAACA ATCAGACCTT CGCCGAAATA TTCAGGGCCA GCGGCATAAG CCGACGGGAT 
TTTTTGAAAT TCTGTTCACT GACCTCAGTC TATCTCGGTC TCTCACCTTC GATTGTACCA
CAGATCGTTC AGGCTATGGA AACAAAGCCG AGAACTCCGG TTATCTGGCT TCACGGTCTT
GAATGTACCT GTTGTTCGGA ATCATTCATC CGTTCTTCAC ATCCCACCAT CGAGGATATC
ATCTTCAACA TGATCTCTCT CGACTATGAT GATGTTCTCA GCGCTGCGGC CGGCCATCAG
CTTGAGGATG TGCGTAAAAA AATCATGCAG GAGTACAAGG GTAAATACAT TCTTGCCGTT
GAAGGCAACG CGTCAACGAA GGATGACGGG GTCTATTGCA TGGTGGGAGG CGATTCATTC
CTGAACACGC TGAAGGAGAC CGCGGCAGAT GCTGCGGCAA TCATCGCCTG GGGGGCTTGT
GCATCTTACG GATGTGTTCA GAACGCCGAT CCGAACCCTA CCGGTGCAGC GCCTGTTTCG
GAAATCATCA AGGATAAACC CATCGTCAAC GTTCCGGGGT GTCCACCTAT CTCCGAAGTG
ATGACCGGGG TTGTGGCACA TTTCCACACC TTCGGCACCC TGCCCGAGCT CGACCGCATG
GGCAGGCCGA AAGCCTTCTA CAACACCAGG ATTCACGACA AGTGTTATCG GCGCGCATTT
TACGATGCCG GCATGTTTGT CAGAAGCTTC GACGACGAGG CGACAAGAAA AGGGTGGTGT
CTCTACAAAA TGGGGTGCAA GGGCCCGACA ACCTATAATT CCTGTTCGAA AATTCAGTGG
AACGGCGGGG TCAGCTTTCC GATCGGGTCC GGCCATCCGT GCATCGGCTG TTCCGAACCG
AACTTCTGGG ACAAGGGGCC TTTCTATGAG CGTCTTGCCG ATGTTTCGTT CCTCGGTACG
GACAGCAATG CCGACAGGAT CGGCGTGATA GCCGTAGGAG CAGCTGCAGC CGGAGCTGCG
GCACATGCTG CCGTAACGGC AGTCAAAAAG GCCAAGGCAG GAAAGGATTC AGAGGATAAA
GCTTAA
 
Protein sequence
MHNNQTFAEI FRASGISRRD FLKFCSLTSV YLGLSPSIVP QIVQAMETKP RTPVIWLHGL 
ECTCCSESFI RSSHPTIEDI IFNMISLDYD DVLSAAAGHQ LEDVRKKIMQ EYKGKYILAV
EGNASTKDDG VYCMVGGDSF LNTLKETAAD AAAIIAWGAC ASYGCVQNAD PNPTGAAPVS
EIIKDKPIVN VPGCPPISEV MTGVVAHFHT FGTLPELDRM GRPKAFYNTR IHDKCYRRAF
YDAGMFVRSF DDEATRKGWC LYKMGCKGPT TYNSCSKIQW NGGVSFPIGS GHPCIGCSEP
NFWDKGPFYE RLADVSFLGT DSNADRIGVI AVGAAAAGAA AHAAVTAVKK AKAGKDSEDK
A