Gene Cwoe_0304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0304 
Symbol 
ID8730732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp310434 
End bp312344 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content72% 
IMG OID646500918 
Productglycoside hydrolase 15-related protein 
Protein accessionYP_003392115 
Protein GI284041775 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.692086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAATTCG AAGTCGCCAC CGCCGAGCCC GCCGCGCAGG AGATCCCGCT CGGCGGGAGC 
CCCTACCCGC CGATCGCGGA CTACGGCTTC CTGTCCGACT GCGAGGTCTG CGCGCTCGTC
GCCAAGAGCG GCAACGTCGA GTGGATGTGC CTGCCGCGGA TGGACGGGCC GAGCGTCTTC
GCCGGGATCC TCGACCGCCA CGCCGGCCGC TTCCGCGTCG GCCCGCGCGA CCAGGTCGTC
CCGGCCGACC GCCGCTACCT GCCCGGGACG ATGATCCTGG AGACGACGTG GGGCACTCCG
ACCGGCTGGT TGGTCGTGCG CGACGTGCTC GTCGTCGGTC CCTGGCACCA CGGCAGCGAA
CGCTCCGGCC GCTACGCGCG CTCGCCGCGC GACCACGAGG CCGCGCACGT GCTGCTGCGC
ACGATGAGGT GCCTGAACGG CTTCGTCGAC GTCGAGCTCG ACTGCGAGCC TGCATTCGAC
TACGCCCGCA AGCGCGGCAG CTGGGACTAC GCCGGCGACG GCTACGGCGA CGGCATCTGC
AGCGCCGAGG GCGTCCCGAC GCAGCTGCGC CTGCGGACCG ACATGCGGCT CGGCTTCGAG
GGCCCGCGCG CCCGCGCCGA GACGCGCATG AGAGCGGGTG ACACGCTCTT CTGCGCGCTC
GGCTGGTCGA GCCATCTGCC GCCGGAGACC TACGACGAGG CGCACTCGCA GCTGTCGGTG
ACCGGCAACT TCTGGCACGA GTGGATCAGC CGCGGCCGCT TCCCCGACCA TCCCTGGCGC
GTCTACCTGC AGCGCTCGGC GCTGACGCTG AAGGGCCTCA CGTACGCGCC GACCGGCGCG
ATGATGGCGG CGGCGACGAC GTCGCTGCCC GAGACGCCGG GCGGTGAGCG CAACTGGGAC
TACCGCTACA CGTGGCTGCG CGACTCGACG TTCATGCTGT GGGGCCTCTC GACGCTCGGC
TTCGACCGCG AGGCGCACGA CTTCCTCTAC TTCATCACCG ACCGGCTGGA GGCCGGAGGC
CGGCTCGGGA TCATGTACGG GATCGACGGG CGAGAACGGC TCGACGAGGA GATCCTCGAC
CACCTCGCCG GGTACGAGGG CGCCAAGCCG GTGCGGATCG GCAACGGCGC GTGGGACCAG
CGCCAGCACG ACGTCTGGGG TGTCCTGCTC GACTCGATCC GCCTCCACAT CCGCTCCGGC
GACCGCCTCG ACGACCGCCT CTGGCCGCTC GTCGTGCGGC AGGTCGACAC CGCGGTCGCG
GAGTGGCGCG AGCCCGACCG CGGCATCTGG GAGGTGCGCG GCGAGCCGCA GCACTTCACC
TCCTCGAAGA TCTTCTGCTG GGTCGCGGCC GATCGCGGCG CGCGCCTGGC GCGGCTGCGC
GGCGACCGCG ACGCGGCCAA GCGCTGGCGC GAGGCTGCGG ACGAGATGCA CGCCGAGATC
TGCGAGCGCG GGCTCGACGA CCGCGGCGTC TTCGTGCAGC ACTACGACAC CGATGCGCTC
GACGCCTCGC TGCTGCTGAT CCCGATGCTC GGCTTCCTGC CCGCGAGCGA CGAGCGCGTC
CGCAAGACCG TGCTCGCGAT CGCCGACGAG CTGACCGTCA ACGAGCTGGT GCTGCGCTAC
AAGGTCGCCG AGACCGACGA CGGCCTCACC GGCGAGGAGG GCTCGTTCGC GATCTGCTCG
TTCTGGCTCG TCTCGGCGCT GGTCGAGATC GGCGAGGTCC AGCGCGCCCG CGACCTCTGC
GACAAGCTGC TCTCCTACGC CAGCCCGCTC GCGCTCTACG CGGAGGAGAT CGACCCGCAC
TCGGGCCGCC ACCTCGGCAA CTTCCCGCAG GCGTTCACCC ACCTCGCCCT CATCAACGCG
GTGATGCACA TCATCCGCGC GGACGAGGCG CTCGTCGACG ACACGCACTA G
 
Protein sequence
MEFEVATAEP AAQEIPLGGS PYPPIADYGF LSDCEVCALV AKSGNVEWMC LPRMDGPSVF 
AGILDRHAGR FRVGPRDQVV PADRRYLPGT MILETTWGTP TGWLVVRDVL VVGPWHHGSE
RSGRYARSPR DHEAAHVLLR TMRCLNGFVD VELDCEPAFD YARKRGSWDY AGDGYGDGIC
SAEGVPTQLR LRTDMRLGFE GPRARAETRM RAGDTLFCAL GWSSHLPPET YDEAHSQLSV
TGNFWHEWIS RGRFPDHPWR VYLQRSALTL KGLTYAPTGA MMAAATTSLP ETPGGERNWD
YRYTWLRDST FMLWGLSTLG FDREAHDFLY FITDRLEAGG RLGIMYGIDG RERLDEEILD
HLAGYEGAKP VRIGNGAWDQ RQHDVWGVLL DSIRLHIRSG DRLDDRLWPL VVRQVDTAVA
EWREPDRGIW EVRGEPQHFT SSKIFCWVAA DRGARLARLR GDRDAAKRWR EAADEMHAEI
CERGLDDRGV FVQHYDTDAL DASLLLIPML GFLPASDERV RKTVLAIADE LTVNELVLRY
KVAETDDGLT GEEGSFAICS FWLVSALVEI GEVQRARDLC DKLLSYASPL ALYAEEIDPH
SGRHLGNFPQ AFTHLALINA VMHIIRADEA LVDDTH