Gene Cwoe_4991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4991 
Symbol 
ID8735457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5322042 
End bp5323199 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content71% 
IMG OID646505618 
ProductABC-type sugar transport system periplasmic component-like protein 
Protein accessionYP_003396777 
Protein GI284046437 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.575468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGAG CAAGCGAAGC GCCCGACGGG CGCGAGGACC GCCGTACGTC GCGCCGCAGG 
TTCCTGGAGG CGGCCGGCCT CGTCGGCATC GGCGGGCTGG CGCTGTCCGC CTGCGGCGGC
GGTGACGACG ACACCGGCGT CTCGACCGCG AACGCGAGCG GAAGCGGCAC GTCGACCGCG
GCCGCGAGCG GCGCGGCGGC CGGCCTGCTG GGCGAGCCGA AGAAGGTGAT CTGGGCGCTC
GCGGCGATCG CGCCGTGGAA CCTCCAGCTC GACGTCGGCT TCACCGAGGC GACGCGGACG
CTCGGCTGGG AGTACCAGAA GGTCGGCGTG CCGATCGCGC AGTACTCGGC CGAGTCGGTC
GTCAACGTCG TCAGCCGCGC CGCCGCGGCG CGGCCCGACG TGCTCGTCAC GCCGGCGTGG
GTCCCGGGGC TGGAGAAGGT CGTGCGCAAG GCGCAGGACG ACGGGATCCT CGTGATGTTC
AACAACGCCA ACAACCTGCC GGAGCTGTCG AACGAGCTGG GCATCGCGTT CATCGGCGCC
GACGAGTACG AGGGCGGGAA GGCGCTCGCG CCGGTGCTGT TCGAGGCGAT GAGAAGAGCC
GGCAAGAGCG ACGGCGTCGT GCTCGGCGGC CTCCCGTTCC CCGGCAACGA CAACGTCGAG
ATGCGGCTGG AGGGCGCGCG CGACGGGCTG ATGGAGCTGA ACAGACTGCA CGGCACGAAC
TTCGAGTACG AGCGCTTCAT CGACGGCTCG GCGAGAGGCA ACGCGCCCGC GCAGACGGCC
TACAGAGCGA AGCTGCGCCA GCTCGGTGAC GACGGCGTCG CCGGGATCCT GTCGATCTCC
GACGTCAACG GCGTGATCCC CGCGCTGCGC TCCTCCGGCG TCGAGCCCGG CGAGATCCCG
ATCGGCACGT GGGACCTGCT GGAGTCGACG ATCACCGGCA TCGAGCAGGG CTGGGTCGCC
GGTACCGTCA ACGCGCAGCC CTACCAGCAG GGCTACATCC CGGTGATGCT CGCGTGGCAG
GAGTTCGAGC GCGGCCAGAC GCCGCGCGAC TACGACAGCG GCGGCGCGAT CGTCACGCAG
GCGACGATCG CCGCGACGGC GAGAGCCGAG GCGATCATCC GCGACAAGGC GAAGGAGTAC
GACATCAAGC TGACGTGA
 
Protein sequence
MERASEAPDG REDRRTSRRR FLEAAGLVGI GGLALSACGG GDDDTGVSTA NASGSGTSTA 
AASGAAAGLL GEPKKVIWAL AAIAPWNLQL DVGFTEATRT LGWEYQKVGV PIAQYSAESV
VNVVSRAAAA RPDVLVTPAW VPGLEKVVRK AQDDGILVMF NNANNLPELS NELGIAFIGA
DEYEGGKALA PVLFEAMRRA GKSDGVVLGG LPFPGNDNVE MRLEGARDGL MELNRLHGTN
FEYERFIDGS ARGNAPAQTA YRAKLRQLGD DGVAGILSIS DVNGVIPALR SSGVEPGEIP
IGTWDLLEST ITGIEQGWVA GTVNAQPYQQ GYIPVMLAWQ EFERGQTPRD YDSGGAIVTQ
ATIAATARAE AIIRDKAKEY DIKLT