Gene Cwoe_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3939 
Symbol 
ID8734396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4181279 
End bp4182436 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content69% 
IMG OID646504563 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003395731 
Protein GI284045391 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00132412 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCAAGA CCCGTGCTGT GACGGCGGCC GTCGCCGCGC TGCTGATCGC CGTGACGGCG 
GGCTGCGGCT CGTCGACCGG CGGCTCCGGC GACGACTCCG CCGGTTCCAC GACCGGCGGC
GGCGACGCGG CGACGTCGGC GGTCGGCAAG CAGGGCACCG TCGACCAGCT GAGACCGATG
TCCGACTACT GCGGCGACAG ACCGATCACG GTCGGGATCG CCTTCGGCTT CGCCGGAAAC
ACCTGGTACA ACGTCGCCAG AGCCGAGTTC GAGGCCGCCG CGAAGGAGTG CCCGAACATC
GAGAAGGTGC TCTACGCCGA CGGGCAGAAC AACGCTCAGA AGGCGATCTC CGACATCCAG
AGCCTCGTCG CCCAGGGCGC CGACGCGCTC GTCGTCTTCC CGAACGTCAA CAGCAAGGCG
ATGCTGCCCG CGATCCGCGC GGCGACGCAG CGCGGCGTGA AGGTCGTCCC GGCCGTCGCG
AGCCCGGGCG GTGAGCCCGG CAAGGACTAC GTCGACTTCG TCGGCCAGAA CTCCGTCAAC
GACGGCAGAC AGATGGCCGA GTTCGCCGTC AGAGCGCTGA GAGGCAGAGG CAACGTCGTC
TTCCTCGGCG GCACGCCCGG CAACACGCAG AGCGCGGAGG AGTTCGAGGG CGCGAAGGAG
GTCTTCGACG CCAATCCGGA CATCAGAATC CTCGGCGGCC GGATCGTCGA CACCAACTGG
GACGTCGCGC AGTACCCGAG AATCGTCGGC GGCCTGCTGA CGAAGTACGG CGACATCGAC
GCCGTCCTGT CCGACTACGG CTCCGGCGCC GCCGCCGGCA TGCGCGCGTT CGTCAGCGCC
GGCAAGCCGC TGCCCGTCTT CACCGGCAGC GACGGCAACG AGTTCTCGTG CATGTACGAG
AGAAACAAGA GAACGAGCCC GGACATGCAG ATAGCGACGA TGTCGTCGCG CCCGTGGATC
ACGCGCGTCG CGCTGGCGAA GGCGGTCGCG GCGGCCGAGG GCGTGCCGAA CGAGGAGCCG
TCGCTGCTCG ACATCCCGCT CGTCGAGGAC TCGGTCGCCG GCGGCGAGCA GGCCCCGAGA
TGCGACGAGT CGCTGCCGCC GGACTTCTTC TGGTCCTCGA GACTGACGAA GGACGAGCAG
CTGAAGGCGT TCGGCTGA
 
Protein sequence
MSKTRAVTAA VAALLIAVTA GCGSSTGGSG DDSAGSTTGG GDAATSAVGK QGTVDQLRPM 
SDYCGDRPIT VGIAFGFAGN TWYNVARAEF EAAAKECPNI EKVLYADGQN NAQKAISDIQ
SLVAQGADAL VVFPNVNSKA MLPAIRAATQ RGVKVVPAVA SPGGEPGKDY VDFVGQNSVN
DGRQMAEFAV RALRGRGNVV FLGGTPGNTQ SAEEFEGAKE VFDANPDIRI LGGRIVDTNW
DVAQYPRIVG GLLTKYGDID AVLSDYGSGA AAGMRAFVSA GKPLPVFTGS DGNEFSCMYE
RNKRTSPDMQ IATMSSRPWI TRVALAKAVA AAEGVPNEEP SLLDIPLVED SVAGGEQAPR
CDESLPPDFF WSSRLTKDEQ LKAFG