Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_0470 |
Symbol | |
ID | 7266638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 580984 |
End bp | 581955 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 643565333 |
Product | NADH ubiquinone oxidoreductase 20 kDa subunit |
Protein accession | YP_002461847 |
Protein GI | 219847414 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.139203 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAT TGCTCTGGCT ACAGGGTGGC GCCTGTAGCG GCAACACGAT GTCGTTCCTC AACGCCGAGG AACCCAGTGC CTGCGATCTC GTGACCGATT TCGGCATCGA GATTCTCTGG CACCCTTCAT TGGGCATGGA GTTGGGCGAA AACGCGAAGC GGATTTTCTA CGACTGCGCC AGCGGTGCTC GCCCACTCGA CATCTTTGTC TTTGAAGGAA CGGTCATTCA AGCGCCGAAT GGCACCGGAC GCTTCAACAT GTTTGCCGAC CGCCCAATGC AAGAATGGGT GCGCGAGTTG GCTGCGCAGG CCAGCATTGT GGTCGCAATC GGCGATTGCG CCTGCTGGGG CGGCATCCCG GCAATGGCTC CCAACCCCAG CCAATCGACC GGCCTGCAAT TCCACAAACG CGAACTTGGT GGTTTTCTCG GACCAAACTG GCGCTCGAAG GCCGGCTTGC CGGTGATCAA TATTCCCGGC TGCCCGGCCC ACCCCGACTG GGTGACGCAG ATTCTGGTCG CCGTCGCCGG TGGCCGCGCC GGTGACATTG CGCTCGATGA CTTGCAGCGT CCGCAGACCT TCTTCACCAG CTTCACCCAA ACCGGCTGCA CCCGCGTGCA GTTCTTTGAG TACAAGCAGT CCACCCTTGA ATTTGGGCAG GGGACCCGCA CCGGCTGCCT GTTCTACGAG TTTGGCTGCC GCGGCCCGAT GACCCGCTCG CCCTGCAACC GCATCCTCTG GAATCGGCAG TCGTCGAAGA CCCGCGCCGG TATGCCCTGC ACCGGCTGCA CCGAGCCAGA GTTTCCGTTC TTTGATCTCG CTCCCGGTAC CGTCTTCAAG ACGCAGAAGA TTAGCGGCGC GATCCCGAAA GAAGTGCCAA CCGGGACCGA TCCGATCAGT TATATGGCGT TGGCGGCAGC GGCAAGAGTG GCAGCTCCGA AGTGGGCAAA AGAGGATATG TTTGTTGTAT AA
|
Protein sequence | MATLLWLQGG ACSGNTMSFL NAEEPSACDL VTDFGIEILW HPSLGMELGE NAKRIFYDCA SGARPLDIFV FEGTVIQAPN GTGRFNMFAD RPMQEWVREL AAQASIVVAI GDCACWGGIP AMAPNPSQST GLQFHKRELG GFLGPNWRSK AGLPVINIPG CPAHPDWVTQ ILVAVAGGRA GDIALDDLQR PQTFFTSFTQ TGCTRVQFFE YKQSTLEFGQ GTRTGCLFYE FGCRGPMTRS PCNRILWNRQ SSKTRAGMPC TGCTEPEFPF FDLAPGTVFK TQKISGAIPK EVPTGTDPIS YMALAAAARV AAPKWAKEDM FVV
|
| |