Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0972 |
Symbol | |
ID | 4570875 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1114304 |
End bp | 1115386 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 639765575 |
Product | hydrogenase (NiFe) small subunit HydA |
Protein accession | YP_911444 |
Protein GI | 119356800 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00108314 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAAAAA ACCAGTCTTT TGCCGATATT TTCAGGGCCA GTGGCGTGAG TCGAAGAGAT TTTTTAAAGT TCTGCTCACT TACATCGGTT TATCTTGGTC TTTCCCCTTC GATGGTACCG AGTATTGTTC AGGCCATGGA AACAAAGCCA AGAACTCCGG TTATCTGGCT GCATGGTCTT GAGTGTACCT GCTGTTCCGA ATCGTTTATC CGCTCCTCTC ATCCTACTAT CGAAGACATC ATTTTCAATA TGATTTCACT CGATTATGAT GATGTTCTCA GCGCAGCGGC AGGTCATCAG CTTGAGGATG TCCGAAAGAA GACGATGACC GATTATAAAG GGAAATATAT TCTTGCCGTT GAGGGAAATG TGTCAACGAA AGACGATGGC GTGTACTGTA TGGTGGGTGG CGATTCTTTT CTCAATACGC TCAGGGAGAC AGCCGCTGAT GCTGCCGCAA TTATTGCATG GGGCGCTTGT GCTTCATTCG GTTGTGTGCA GAATGCCGAT CCGAACCCTA CAGGGGCCGC ACCGATTTCA GAGATTATAA AGGATAAACC TATCGTCAAT GTCCCCGGCT GCCCTCCGAT TGCCGAGGTT ATGACCGGAG TTATTACGCA TTTTCACACA TTCGGTAAAC TGCCCGACCT TGACCGCTTC AATCGTCCCA AGGCTTTTTA TAAAACGAGG ATTCATGATA AATGCTATCG TCGTGCATTT TTTGATGCCG GCATGTTTGT CAGAAGCTTC GATGATGAAT CGACCAGAAA AGGGTGGTGC CTCTATAAGA TGGGTTGCAA GGGACCGACA ACCTATAACT CATGTTCGAC GATTCAGTGG AATGACGGGA CAAGTTTTCC GATCGGTTCG GGCCACCCCT GTATCGGCTG CTCAGAACCG CATTTCTGGG ACAATGGGCC TTTCTACAAG AGACTTGCCG ATGTATCGTT CCTTGGCTCT GATAGCAATG CGGACAGAAT CGGAGCTGTG GCGCTTGGAG CTGCGGCGGC CGGAGCTGCG GCACATGCGA CGATTACGGC AATTAAAAAG GGAAAATCAG GTAAAGGTAA TGACAAAGCT TAA
|
Protein sequence | MQKNQSFADI FRASGVSRRD FLKFCSLTSV YLGLSPSMVP SIVQAMETKP RTPVIWLHGL ECTCCSESFI RSSHPTIEDI IFNMISLDYD DVLSAAAGHQ LEDVRKKTMT DYKGKYILAV EGNVSTKDDG VYCMVGGDSF LNTLRETAAD AAAIIAWGAC ASFGCVQNAD PNPTGAAPIS EIIKDKPIVN VPGCPPIAEV MTGVITHFHT FGKLPDLDRF NRPKAFYKTR IHDKCYRRAF FDAGMFVRSF DDESTRKGWC LYKMGCKGPT TYNSCSTIQW NDGTSFPIGS GHPCIGCSEP HFWDNGPFYK RLADVSFLGS DSNADRIGAV ALGAAAAGAA AHATITAIKK GKSGKGNDKA
|
| |