Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0973 |
Symbol | |
ID | 4570876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 1115404 |
End bp | 1117122 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 639765576 |
Product | nickel-dependent hydrogenase, large subunit |
Protein accession | YP_911445 |
Protein GI | 119356801 |
COG category | [C] Energy production and conversion |
COG ID | [COG0374] Ni,Fe-hydrogenase I large subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000120136 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAGAA AAATTGTTGT TGATCCTATT CCCCGTATAG AGGGACACCT GAGAATTGAG GCAAAGCTGA ATGAGAGCAA TCAGATTGAA GAGGCGTTCA GCAGCGGTAC CATGTGGCGG GGTATCGAGG TTATACTCAA AGGTCGAGAT CCCCGAGATG CGTGGGCTTT TGCTGAACGT ATCTGTGGTG TCTGTACCAG TGTTCATGCG CTTGCATCGG TCCGTTGTGT CGAGGATGCT CTGGGTATTG ATGTGCCTCT TAATGCCCGC ATTATCAGAA ACCTGATGAA TGCCACGCAG CAGACACAGG ATCATCTGGT TCACTTTTAT CATCTTCACG CACTTGACTG GGTCGATGTT GTCAGTGCGT TGAAGGCGGA TCCGAAAATG ACATCCGATA TTGCTCAGAG CATCTCTCCG TGGCCGAAAT CATCAGTGGG GTATTTCAAG GATCTTCAGC AGCGTCTTAC AGGTTTTGTC GAGAGCGGTC AACTCGGCAT ATTTTCAAAT GGTTACTGGG GACATCCTGC CTACAGGCTG CCTCCTGAAG TCAACCTTCT TGCGGTTGCG CACTACCTTG AAGCTCTTGA TTTTCAGAAG GAGATCATCA AGATCCATAC CATTTTCGGA GGCAAGAATC CGCATCCGAA CTATCTGGTT GGAGGAATGG CGTGCGCTAT CGATCCAAAC AGTGATACGG CAATCAATAT CGAGCGTTTT GCCTTGATCA AGAAGATCAT CGACGATACC AATACCTTTA TTGATCAGGT CTATATTCCG GACCTGATCG CTATTGCCGG TTATTACAAG GATTGGCTCT ATGGCGGCGG TCTGGGCAAC TATCTCAGTT ATGGCGATTT TCCGGAAAAT TCACTTGATG ACTACAAGAC CCTGCTTTGG CCGAGAGGAG CCATTCTCAA CAGGGATCTT TCAAACGTTA TCGATGTTGA TCCACGCGAT TCCGCCCAGG TTACCGAAGA GGTCAGCCAT AGCTGGTATA CCTATTCAAA AGGGGACAGT AAAGGCCTCC ATCCCTGGGA GGGTGAGACA AAACCGGCTT ATACCGGCCC TAAACCACCG TTTGAATTTC TCGATACTGA CAAGAAGTAC AGTTGGTTAA AAACTCCCCG GTGGAAAAAC CACGCAATGG AGGTCGGACC GCTTGCCCGG GTTCTTGTGG CATATGCAAA AGGGGATCCC ATGATCAAGG ATACCGTCGG TATGGTTCTC GGCAAGCTTC AGGTTGGTCC GGAAGCACTG TTTTCGACTC TTGGCAGAAC TGCTGCACGG GGTATCGAGT GCAAGCAGAC TGCCGGATTC ATGCGTCATT TTTATGATCA GCTTGTTGAT AATGTCAAAA AAGGTGATTA CAGGACGTTC AACAGCGATA AATGGGATCC GTCAACATGG CCTTCTGAGT CGAAGGGCTT CGGATATACT GAAGCTCCGC GTGGTTCGCT TGGTCACTGG ATACATATCA AAGACCAGAA GATCAAGGAG TACCAGATTG TTGTCCCTTC AACATGGAAC GCTTCGCCCA GAGATGCCGC AGGTAATTCA GGCGCTTATG AAGCTGCATT GAAAGGAACT CCCATGGCTA ATCCTGAACA GCCGCTTGAG ATTATCAGAA CTGTACACTC ATTCGATCCC TGTCTTGCAT GCGCATCCCA TGTGGTTGAT ATGCATGGCA AAGAGATCAC CAAGGTCAAA ATCGTTTAA
|
Protein sequence | MSRKIVVDPI PRIEGHLRIE AKLNESNQIE EAFSSGTMWR GIEVILKGRD PRDAWAFAER ICGVCTSVHA LASVRCVEDA LGIDVPLNAR IIRNLMNATQ QTQDHLVHFY HLHALDWVDV VSALKADPKM TSDIAQSISP WPKSSVGYFK DLQQRLTGFV ESGQLGIFSN GYWGHPAYRL PPEVNLLAVA HYLEALDFQK EIIKIHTIFG GKNPHPNYLV GGMACAIDPN SDTAINIERF ALIKKIIDDT NTFIDQVYIP DLIAIAGYYK DWLYGGGLGN YLSYGDFPEN SLDDYKTLLW PRGAILNRDL SNVIDVDPRD SAQVTEEVSH SWYTYSKGDS KGLHPWEGET KPAYTGPKPP FEFLDTDKKY SWLKTPRWKN HAMEVGPLAR VLVAYAKGDP MIKDTVGMVL GKLQVGPEAL FSTLGRTAAR GIECKQTAGF MRHFYDQLVD NVKKGDYRTF NSDKWDPSTW PSESKGFGYT EAPRGSLGHW IHIKDQKIKE YQIVVPSTWN ASPRDAAGNS GAYEAALKGT PMANPEQPLE IIRTVHSFDP CLACASHVVD MHGKEITKVK IV
|
| |