Gene Cwoe_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2028 
Symbol 
ID8732471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2129977 
End bp2131014 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content66% 
IMG OID646502647 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003393829 
Protein GI284043489 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGG CCCTGATCGC TGCTGGCTGC GGAAGCAGCG GCGACGACAC GACGTCGACG 
ACATCGACGG GCGTGTCGAC GTCCGCCGGG ATGTCCCCCA CCGCGTCGGG CGGGATCACC
GTCGGATACT CGGACCCGGT CGCATCGAAC CCTGCTCAGC AGGCTGTGGC CCGCGGACAG
GAGGAAGCGG CGAAGGAGTT CGGCTGGGAC CTCGTGCACT TGGACGCGAA CCTGTCCGCG
TCCAAGCAGG TCTCGGACAT CGACACGCTC ATCTCGAAGA AGGTCGATGC GATCAACTCG
TTCACCATCG ATCAGGGTGC CGCCGATGCC GTCTATCAGC GCGCGAGCCA GGCGGGCATC
CCCGTGATTG GACAGTCATC CAGGTCCAAG TACATCCAGT CGTCGGTCTG GAACCAGCAG
AACTTCGACT GCAGCGTGGC CGCGAAGGCG GCGGCGTACA TCAACGCGCG CACGCCCGGG
GCGAAGACGC TGGTCATCGG CGGGCCGCCC GTCGGCGCGA TCACGCAGTA CGTGAATTGC
TTCCAGGACG AGGCGGAGAA GGCCGGCCTG GACGTGCTCG AGAAGAAGGA CAACACGACC
GACACCGCCT CGGGCGGGCA GCCGATCGCG GCCGCGCTGA TCAACAAGCA CCCCGACGTG
CAGGCGATCT GGTGTTACAA CGACCCGAGC TGCCTCGGTG CCGGCAACGC CCTCAAGGCG
GCGGGCAAGA AGATCTGGAA GCAGGGCGAG TCGGACTCCG GCGTCATCGT GATCGGCTCG
AACGGGTCGA CCGACGGGAT CAGCGCCATC AAGAGCGGTT TGATGACGGT CTCCTACGAC
ATCAACCCCG ACAAGGTCGG CGCGTCGGTG ATCGCGCTGC TGGCCAAGCA CTTCGAGGAT
GGCGTGCCCG TGAAGGACCT GCCCAAGGAC GTCGTGGTCC CGACCACGGA ATGGGACGCT
TCCAACGTCG GTGACTACGT CGATCCGATC AAGCGCTCGA TCGACACCAA GACCGTCGAC
GTCGACGGCC AGGGCTGA
 
Protein sequence
MSAALIAAGC GSSGDDTTST TSTGVSTSAG MSPTASGGIT VGYSDPVASN PAQQAVARGQ 
EEAAKEFGWD LVHLDANLSA SKQVSDIDTL ISKKVDAINS FTIDQGAADA VYQRASQAGI
PVIGQSSRSK YIQSSVWNQQ NFDCSVAAKA AAYINARTPG AKTLVIGGPP VGAITQYVNC
FQDEAEKAGL DVLEKKDNTT DTASGGQPIA AALINKHPDV QAIWCYNDPS CLGAGNALKA
AGKKIWKQGE SDSGVIVIGS NGSTDGISAI KSGLMTVSYD INPDKVGASV IALLAKHFED
GVPVKDLPKD VVVPTTEWDA SNVGDYVDPI KRSIDTKTVD VDGQG