Gene Cwoe_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2540 
Symbol 
ID8732983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2704584 
End bp2705645 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content69% 
IMG OID646503155 
ProductNADH ubiquinone oxidoreductase 20 kDa subunit 
Protein accessionYP_003394337 
Protein GI284043997 
COG category[C] Energy production and conversion 
COG ID[COG1740] Ni,Fe-hydrogenase I small subunit 
TIGRFAM ID[TIGR00391] hydrogenase (NiFe) small subunit (hydA) 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.733354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACCG TCGAGTTCAC CCCGAGACAG CGCGAGACGG AGGCGATCAC CGCTCACGTG 
CTGTGGATGA CGACCGGGCT GGGGTGCGAC GGCGACTCGG TCGCGATGAC CTCGGCGACG
AACCCGAGCC TCGAAGACAT CATCACCGGC GCGATCCCGG GGATGCCGAG AGTGGTGGTC
CACAACCCGG TGATCGCCTA CGAGCAGGGC GAGGACTTCA TGAGAGCGTG GTTCGCCGCC
GAGCGCGGCG AGCTGGACCC GTTCGTGCTG ATCCTCGAAG GGTCGCTCGG CAACGAGAAG
ATCAACGGCG CCGGGCACTG GTCCGGCCTC GGGACCGACC CGTCGACCGG CCAGCCGATC
ACGACGAACG CGTGGATCGA CCGGCTTGCG CCGAAGGCGG CGGCGGTCGT CGCGGTCGGC
ACCTGCGCGA CGTACGGCGG CATCCCGGCG ATGGCTGGCA ACGCGACCGG CGCGATGGGG
CTGCGCGACT ACCTCGGCTG GAGATGGACG TCGAAGGCGG GGATCCCGAT CGTCAACATA
CCCGGCTGCC CGGCGCAGCC GGACAACATG ACCGAGATGC TCGTCCACCT CGTCTTCGCG
CTCGCGGGGA TGGCGCCGGT GCCGGAGCTG GACGACGCCG GCCGCCCGAC CTCGCTGTTC
GGGCGCACCG CGCACGAGAG CTGCAACCGC GCCGCGTTCT ACGAGTCGGG CAACTTCGCG
ACCGAGTACG GCTCCGACCA CCGCTGCCTC GTCAAGCTCG GGTGCAAGGG ACCGGTCGTC
AAGTGCAACG TCCCGTTGCG CGGCTGGCAG AGCGGGATGG GCGGCTGCCC CAACGTCGGC
GGCATCTGCA TGGCGTGCAC GATGCCCGGC TTCCCCGACA AGTACATGCC GTTCATGGAG
GAGGCGGGCA ACGCGAGAAT CTCCTCGGCG ATCGCGAGAT TCACCTACGG GCCGATCCTG
CGGTGGGGCC GCAGCATCGA GATGAGACGC AGATACGACA AGGAGCCGGA GTGGCGCCAC
AACCGCGCCG AACTCACCAC CGGCTACTCC AAGCGCTGGT AG
 
Protein sequence
MSTVEFTPRQ RETEAITAHV LWMTTGLGCD GDSVAMTSAT NPSLEDIITG AIPGMPRVVV 
HNPVIAYEQG EDFMRAWFAA ERGELDPFVL ILEGSLGNEK INGAGHWSGL GTDPSTGQPI
TTNAWIDRLA PKAAAVVAVG TCATYGGIPA MAGNATGAMG LRDYLGWRWT SKAGIPIVNI
PGCPAQPDNM TEMLVHLVFA LAGMAPVPEL DDAGRPTSLF GRTAHESCNR AAFYESGNFA
TEYGSDHRCL VKLGCKGPVV KCNVPLRGWQ SGMGGCPNVG GICMACTMPG FPDKYMPFME
EAGNARISSA IARFTYGPIL RWGRSIEMRR RYDKEPEWRH NRAELTTGYS KRW