Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2540 |
Symbol | |
ID | 8732983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2704584 |
End bp | 2705645 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646503155 |
Product | NADH ubiquinone oxidoreductase 20 kDa subunit |
Protein accession | YP_003394337 |
Protein GI | 284043997 |
COG category | [C] Energy production and conversion |
COG ID | [COG1740] Ni,Fe-hydrogenase I small subunit |
TIGRFAM ID | [TIGR00391] hydrogenase (NiFe) small subunit (hydA) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.733354 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGACCG TCGAGTTCAC CCCGAGACAG CGCGAGACGG AGGCGATCAC CGCTCACGTG CTGTGGATGA CGACCGGGCT GGGGTGCGAC GGCGACTCGG TCGCGATGAC CTCGGCGACG AACCCGAGCC TCGAAGACAT CATCACCGGC GCGATCCCGG GGATGCCGAG AGTGGTGGTC CACAACCCGG TGATCGCCTA CGAGCAGGGC GAGGACTTCA TGAGAGCGTG GTTCGCCGCC GAGCGCGGCG AGCTGGACCC GTTCGTGCTG ATCCTCGAAG GGTCGCTCGG CAACGAGAAG ATCAACGGCG CCGGGCACTG GTCCGGCCTC GGGACCGACC CGTCGACCGG CCAGCCGATC ACGACGAACG CGTGGATCGA CCGGCTTGCG CCGAAGGCGG CGGCGGTCGT CGCGGTCGGC ACCTGCGCGA CGTACGGCGG CATCCCGGCG ATGGCTGGCA ACGCGACCGG CGCGATGGGG CTGCGCGACT ACCTCGGCTG GAGATGGACG TCGAAGGCGG GGATCCCGAT CGTCAACATA CCCGGCTGCC CGGCGCAGCC GGACAACATG ACCGAGATGC TCGTCCACCT CGTCTTCGCG CTCGCGGGGA TGGCGCCGGT GCCGGAGCTG GACGACGCCG GCCGCCCGAC CTCGCTGTTC GGGCGCACCG CGCACGAGAG CTGCAACCGC GCCGCGTTCT ACGAGTCGGG CAACTTCGCG ACCGAGTACG GCTCCGACCA CCGCTGCCTC GTCAAGCTCG GGTGCAAGGG ACCGGTCGTC AAGTGCAACG TCCCGTTGCG CGGCTGGCAG AGCGGGATGG GCGGCTGCCC CAACGTCGGC GGCATCTGCA TGGCGTGCAC GATGCCCGGC TTCCCCGACA AGTACATGCC GTTCATGGAG GAGGCGGGCA ACGCGAGAAT CTCCTCGGCG ATCGCGAGAT TCACCTACGG GCCGATCCTG CGGTGGGGCC GCAGCATCGA GATGAGACGC AGATACGACA AGGAGCCGGA GTGGCGCCAC AACCGCGCCG AACTCACCAC CGGCTACTCC AAGCGCTGGT AG
|
Protein sequence | MSTVEFTPRQ RETEAITAHV LWMTTGLGCD GDSVAMTSAT NPSLEDIITG AIPGMPRVVV HNPVIAYEQG EDFMRAWFAA ERGELDPFVL ILEGSLGNEK INGAGHWSGL GTDPSTGQPI TTNAWIDRLA PKAAAVVAVG TCATYGGIPA MAGNATGAMG LRDYLGWRWT SKAGIPIVNI PGCPAQPDNM TEMLVHLVFA LAGMAPVPEL DDAGRPTSLF GRTAHESCNR AAFYESGNFA TEYGSDHRCL VKLGCKGPVV KCNVPLRGWQ SGMGGCPNVG GICMACTMPG FPDKYMPFME EAGNARISSA IARFTYGPIL RWGRSIEMRR RYDKEPEWRH NRAELTTGYS KRW
|
| |