Gene Cwoe_0494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0494 
Symbol 
ID8730922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp512479 
End bp514089 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content70% 
IMG OID646501107 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003392304 
Protein GI284041964 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.080678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.301708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAAG TCAAGAAGGT GCTGCCGTCG GTCCGTGGAC GCGGGGTGAG CCGGCGCGCT 
TTCCTGAACG AGGGTGCGGC ATGGGGGCTC TCGGCCTCGA CGGTCGGACG GCTGCTCGCC
GTCGGCGCCG CCCCGGCCGG CCTGCTCGCG GGCTGCGGGA CCGACTCCGA GAGCGCGAGC
GGCGGGGGCG GAGGCGGCGG AGCCGCCGGG ATCATCGCGA TCGGCAACGC CGAGCCGCCG
ACGTCGGCCT ACTGGGACCC CCACGCCCAG TTCGGCATGG CCGACACGCA GCTCTGGTCG
CTGACGTACG ACATGCTGCT CAGCTACGAC AAGTCCGGGC GCGTCGTCGG CGGCCTGGCG
CGGCGCTGGC ACCGTACGAG CCCGCAGCGC ATGCGCTTGG AGCTGCGCGA GGACGCCCGC
TTCCAGGACG GCGCGCCGGT GCTCGCGAAG GACGTCAAGG CGAGCCTCGA CCGGCTCGGC
GATCCGGAGT CGAGACTCGT GCTGTCCGCC TACGCGACCC CGGGCATGAG AGTCGAGGTG
ATCGACGAGC ACACGATCGA GATCGTCACC CCGCGCCCGT TCGGACCGCT GGAGTCGGCG
CTGACGCTGT TCGCGATCGC GCCCGCGAGA GACATCGCGC AGCCGGATGT CTTCAGAGAG
CGGCCGCTCG GCAGCGGCCC GTTCAGATTC GTCCGCTACA AGAACAACGT CGTCGAGCTG
GTCGCGAACG AGAGATACTG GCGCGGCAAG CCGGCCTCCA GAGGCGTCGA GCTGCGCTAC
ATCGCCGACC CCGAGGCGCG TCTCAACGCG CTGCTGACGG GCGCGATCGA CATCTACACG
CGCGGCAGCT CGCTGACGCT CGACGCGACG AAGAAGGACG GCTACCACGT CACCACGACC
GGCCCGGCCA GCCAGCTGAT CTACATCCCG CAGCACAACA CGGAGCTCAG CGACCCGCGT
GTGCGGCAGG CGATCGCCCA CGCGATCGAC CGCCGCGCGA TCGCCAAGAG CCTGATCAGA
ATCGACCCGC CGGCGCGGTC GAGCCTGCCC GCCGGCACCG ACGGCTTCCG CCCGCTGGCG
CCGAGCTTCG AGTACGACCC CGACAAGGCC CGCAGACTGC TCGCCGACGC CGGCCACGCG
AACGGGCTGA AGATCACGAT GGCGTCCTCG AACCTCGTCA CGCACCAGCC CGCGATCGAC
CAGCTCGTCA AGAGCTGGCT GGAGGAGGTC GGCATCGAGG TCGAGCTGAG AACACTCGAG
ACCGGCACGT TCCGCAGCTC CTACAACCAG TACGCGCTGT CGTTCAACGC GCTCGGCACG
ATGAACCCCG ACCCCGACTC GCTGCTGACG TTCTTCCGCC CGGTCGTCGC GCAGGCGGCG
CTGAACCTCG ACGACCCGAA GATCGGGCGG CTGCTGCAGC GCACGCGCGA GACGACCGGT
GCCGCGCGAC GCGCTGCGAT CGACGCGTAC GCGTCGTATC TGTGGCAGAA CCAGATCATG
ATCTACGTCA CCGACGACAT CTGGTTCACG GTCGTGAATC CGAAGCTGCG CAACTACCAC
CGCACCCCGC AGCAGGGAGA GCCGCTCCTG TGGCGCGCGT CGAAGGCGTG A
 
Protein sequence
MDEVKKVLPS VRGRGVSRRA FLNEGAAWGL SASTVGRLLA VGAAPAGLLA GCGTDSESAS 
GGGGGGGAAG IIAIGNAEPP TSAYWDPHAQ FGMADTQLWS LTYDMLLSYD KSGRVVGGLA
RRWHRTSPQR MRLELREDAR FQDGAPVLAK DVKASLDRLG DPESRLVLSA YATPGMRVEV
IDEHTIEIVT PRPFGPLESA LTLFAIAPAR DIAQPDVFRE RPLGSGPFRF VRYKNNVVEL
VANERYWRGK PASRGVELRY IADPEARLNA LLTGAIDIYT RGSSLTLDAT KKDGYHVTTT
GPASQLIYIP QHNTELSDPR VRQAIAHAID RRAIAKSLIR IDPPARSSLP AGTDGFRPLA
PSFEYDPDKA RRLLADAGHA NGLKITMASS NLVTHQPAID QLVKSWLEEV GIEVELRTLE
TGTFRSSYNQ YALSFNALGT MNPDPDSLLT FFRPVVAQAA LNLDDPKIGR LLQRTRETTG
AARRAAIDAY ASYLWQNQIM IYVTDDIWFT VVNPKLRNYH RTPQQGEPLL WRASKA