Gene Cwoe_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3349 
Symbol 
ID8733798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3562835 
End bp3564514 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content70% 
IMG OID646503966 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003395142 
Protein GI284044802 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.109598 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0359992 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGAGA TCGTGCGCGC CGAGATGACC CGGCGCGAGG CGCTGCGGCG CCTCGGCGTC 
GGCGGCGCCG TGCTGACGCT GCCCGCGCTG CTGGCCGCGT GCGGCTCCGG CGGCTCCGGC
TCGACGTCGG CGTCGGGCGG GACGAGAGCG AGCGGCAGCG TCGCCGGCGC AGGCACCGAC
GCGGAGATCG ACCACGTCAC GTGGTCGCTC GGTGGCACCC CGCCGACGCT CGACATCGCG
ACCGGCAACC TCACCGTCGG CGAGATGGTG ATGGCGCTCG GGATGGAGAC GCTGATGGGC
CTCGACGACA GACTGCGGCT CAAGCCGGTG CTGGCCGAGT CCTCCGAGGA GCCCGACCCG
CGCACCTACG TCTACAGACT GCGCGAGGGC GTCAGATTCT GGGACGGCTC GCCGCTGACG
GTCGACGACG TCGTCTGGTC GCTGCGGCGG CACATGGACC CGAAGGTCAG CTCGCAGATC
TCGACGTACT TCACCCATGT GCGCTCGATA GAGGCGACCG GGCCGCGCGA GGTGACGGTG
CGGATGAAGC AGCCGGACCC GCTGTTCCCG TACGCCCACG TCCACATCTT CATCATGCCG
AAGGCGTTCG GCGAGAGACT TGGCAAGAAG CTCGGCGCGC CGGCCGCGAC CGTCAGCGTG
ATGGGAACCG GGCCGTACAG AATCACGTCG TTCACCGGTG ACAACGAGAT CGTCGTCGAG
CGCAACGACG ACTACTGGGG TGAGCGCCAG CGCGTGCGCA GAGCGTCACT GAAGTTCATC
GGCGACCCGC GCACGAACCT GCTCGCGATG CGCTCCGGCG AGATCGACGG GATGTTCGAG
TTCGCGATCA GCACGGCGAG AGAGTGGGAC CGGCTGCCGG ATGCGAGAAC CGAGTGGGCG
CCGGGGATGA GCGTCGTGCT GCTCTCGTTC GACCTCTCGC AGGCGCCGTG GAACGACGTC
CACGTGCGCA GAGCGGTCGC TCATGCGGCC GACCGCGCCG GCTACGTGCG CGCGTTCCTC
GGCGGGCACG GCGAGCCGGC GACGACGATC CCCGCGCCGC TCCAGTGGGG CGACGTGGCG
ACGCCGGACG AGGTCAGAGC GATCTACGCG AGGCTGCCCG CCTACGCGTA CGACCTCGAG
GCCGCGAAGG CCGAGCTGGC GAAGTCCCAG CACCCGGACG GCTTCACCGC CGACGTCGTG
TTCCCCAACT CGGCCGCCCC GGCCGGCCGC GCGCTCGTGA GCCTGTCCGA GACGCTCAAG
CAGCTCGGCA TCACGCTCAA CGTCCGCGAG GTGCCGCAGA ACACGTGGCT GGCGAAGCTG
TACGCGCACA AGGACCTTGG GCTCCAGTAC CTGCGGCTGT CGCCCGACTA CGTCGACCCG
TCGAACTTCC CGGGCGCGCT GCTGCCGAGC GCGAACGCGG TCCCGAACAA CTTCAACCTC
GCGAACTTCA GAGACCCCGA GGTCGACCGC CTGCTCGCGC AGCAGAGCAG GACGACCGAC
GCCGCGGCGC GAACGCAGGC GCTGACGCGG GTGCTGCAGA TAGCCGGCGA GCAGCTGCCC
TATCTGCCGC TGTGGTGGGA GAGCGTGCCG ATGGGTCTCG CCGACAGATT CGTCTACGAG
GGCTTCAACC CGATCTACTA CGCGGAGAAC TGGCTCGGCA AGCTGCGCGT GCGCGCATGA
 
Protein sequence
MNEIVRAEMT RREALRRLGV GGAVLTLPAL LAACGSGGSG STSASGGTRA SGSVAGAGTD 
AEIDHVTWSL GGTPPTLDIA TGNLTVGEMV MALGMETLMG LDDRLRLKPV LAESSEEPDP
RTYVYRLREG VRFWDGSPLT VDDVVWSLRR HMDPKVSSQI STYFTHVRSI EATGPREVTV
RMKQPDPLFP YAHVHIFIMP KAFGERLGKK LGAPAATVSV MGTGPYRITS FTGDNEIVVE
RNDDYWGERQ RVRRASLKFI GDPRTNLLAM RSGEIDGMFE FAISTAREWD RLPDARTEWA
PGMSVVLLSF DLSQAPWNDV HVRRAVAHAA DRAGYVRAFL GGHGEPATTI PAPLQWGDVA
TPDEVRAIYA RLPAYAYDLE AAKAELAKSQ HPDGFTADVV FPNSAAPAGR ALVSLSETLK
QLGITLNVRE VPQNTWLAKL YAHKDLGLQY LRLSPDYVDP SNFPGALLPS ANAVPNNFNL
ANFRDPEVDR LLAQQSRTTD AAARTQALTR VLQIAGEQLP YLPLWWESVP MGLADRFVYE
GFNPIYYAEN WLGKLRVRA