Gene Cwoe_2037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2037 
Symbol 
ID8732480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2138305 
End bp2139480 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content68% 
IMG OID646502656 
Productperiplasmic binding protein/LacI transcriptional regulator 
Protein accessionYP_003393838 
Protein GI284043498 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.476065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAGAA CGCTGTGGCG AGCCGGAGCG CTCGCCGAAG ACGCCGCCGA GCGCGCGGAT 
CGTGAGGGAG CCAGCAGCGC GCGCAGGCTC GTCTCGGCTC GTCGGGCCGG ACTGGCAGCG
CTCGCGCTGG CGGGGCTGGT CGTGGCCGGG TGCGGCTCGG GCGACAAGGG CAGCGGCGGT
GGCTCGACCG CGGCGTCGAC GTCGCAGCAC GACAGGAAGA TCGTCGTGGG CTACTCCGAC
CCTGTCGGAT CGAATCAGGC CCAGCAGGCG GTCTATCGCG CGCAGAAGGA GGCGGCCAAG
CAGCTGGGCT GGGAGATCGT GCACCTGGAC GCGAACCTCT CGCCGTCCAA GCAGCTGTCG
GACATCGATT CGATGATCTC GCGCAAGGTC GACGCGATCA ACTCGTGGAC GTTGGAAGAG
GGCGCCGCCG ACGCCGCCTA CCGGCGGGCC GTGGAGGCGG GGATCGTGAT CGTTGGGCAG
TCGACCGCCT CGCCGTACAT GAGCTCGACG GTGTGGCTCC AGCAGAATTA CGGGTGCAGC
CTCGCCAAGA TGGGCGCAAA GTACATCGCC GACCGCCGCC CTGGTGCCAA GACGCTCGTC
ATCGGCGGCC CACCGGTGAA GGCGATCACC CACTACGCCC AGTGCTTTCT GGACGCGGCC
AAGGCGGCCG GCCTCACCGT CCTCGACAAG AAGGACAACA TGGCCGACAC GGCGGCTGGA
TCGCAGCCCA TCGCGGCGGC GATGGTCAAC CAGCATCCTG ACGTGGAGGC CGTCTGGACC
TACAACGATC CGACGGCGCT CGGTGCCGGC AACGCGCTGA AGGCGGCGGG CAAGCAGGTC
TGGCAGGAGG GCAAGAGCGA CGACGGGGTG ATCGTCATCG GGTCCAACGG CACCGAGGAA
GGCATCCAGG GGATCAAGAG CGGGCTGATG ACCGTCACCT ACGACATGCA CCCCGACGTG
ATCGGCACCC AGATCATCGC GGTGCTCGCC AAGCATTTCC GCGACGGTGT GCCCGCCAAG
GACCTCCCGA AGAACGTCGT CGTCCCGACG ACCAAATGGG ACCTCTCGAA CGTCGCTGAC
TACGTCGACC CCATGAAGCG CCCCATCAAG TTGGGCGCCG TGCTGGGCAC TGGCGAGAAC
TCGGCCGGCC AGGGCGACCA CGAGATCACG AGATGA
 
Protein sequence
MTRTLWRAGA LAEDAAERAD REGASSARRL VSARRAGLAA LALAGLVVAG CGSGDKGSGG 
GSTAASTSQH DRKIVVGYSD PVGSNQAQQA VYRAQKEAAK QLGWEIVHLD ANLSPSKQLS
DIDSMISRKV DAINSWTLEE GAADAAYRRA VEAGIVIVGQ STASPYMSST VWLQQNYGCS
LAKMGAKYIA DRRPGAKTLV IGGPPVKAIT HYAQCFLDAA KAAGLTVLDK KDNMADTAAG
SQPIAAAMVN QHPDVEAVWT YNDPTALGAG NALKAAGKQV WQEGKSDDGV IVIGSNGTEE
GIQGIKSGLM TVTYDMHPDV IGTQIIAVLA KHFRDGVPAK DLPKNVVVPT TKWDLSNVAD
YVDPMKRPIK LGAVLGTGEN SAGQGDHEIT R