Gene Cwoe_0456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0456 
Symbol 
ID8730884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp471717 
End bp473312 
Gene Length1596 bp 
Protein Length531 aa 
Translation table11 
GC content69% 
IMG OID646501070 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003392267 
Protein GI284041927 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.15867 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA AGCTCGCCAG AGGCGTCGCG ACGTTGCTCG CGGCCGCCCT CGTGGTCGCC 
GGATGCGGCG GGACCAGCAG CAACAACGAC AGCACCGGTT CGACCGGCTC CGGCGGCGGC
ACCTCGGCGT CGTTCGACGC TGAGGCGACG ATTCGCACCG CGCAGTTCGG CGATGGCGTC
GGCGGGATGG ACCCGGAGAT CTGGTACGAC CTCAACGGCG GGTCGCTGCA CACCGCCGTC
TACGAAGGGC TGCTGCGCTA CAAGACCGGC ACGACCGAGA TAGAGCCCGC GCTGGCCGAG
TCCTACGAGG TCAGCAGAGA CGGCAGGACC TATACGTTCA AGCTTCGTCA GGGCGTCAGA
TTCCACGACG GCACGCCGCT GACGCCGGAG GCGGTCGCCG GCTCGTTCGC GCGCCGCGCC
GCGCTCAGAG GCCCGTCGGC GTACCTGACG GCCGGGGTCG CCGACGTGCG GCCGCGCGGG
AGCGACACGG TCGTGATCAA GCTGAGAAGC CCGGAGATCG GGTTCCTCGA CGCGCTCGCC
TCGATCTACG GGCCGCGCGT GATCAGCCCT GCGGCGCTCA GAGCGCACGG CGCGGGCGAG
GACGGCAAGC GCTGGTTCGC GTCCAACGCC GTCGGCACCG GGCCGCTGAG ACTGCTCAGC
TTCAGACTCG GCGACGGCGC GACGCTCGAG CGCTTCGACG GCTACTGGGG CGAGCAGGCC
AAGGCGAAGC GCTACGAGGT CGACACGCTG CCGAGCAGCG GCGAGCAGCA GCTGCAGCTG
CGCTCCGGCC AGCTCGACTA CCTCAGCGGC GGCTCGCTGC AGCCGGCGCA GCTGAAGGCG
TTCGACGGCA ACCCGAGATA CGAGGTGACG CGCCTGGACC AGGCGTTCCG GCCGATGCTC
GTGCTGAACA CGAACAAGCC GCCGTTCGAC GACGTCGAGA AGCGCAAGGC GTTCGTCGCG
GCGCTCGACG TCGACGCCGC GATCAGACAG GTGTGGGGCG ACGAGCTGAT GGAGGCGCCG
ACCTCCTACA TCTCGCCGTT CCTGCTCGAC CCGGCGCTGA ACAGAATCGA GCCGCTGACC
AGCGACGCCG CGCTCGACGA GCCGGTCACG TTCGAGTACG TCGGCGCGAT CCAGTCGCAC
CGTCAGTTCA GCGAGGTGAT CCAGCAGCAG CTGCGCGACG AGGACGTCGA GCTGAGACTG
AGCGCGACGA CCGGCGGCGA GGTCTTCTCG TGGCCGCAGG ACGTGCAGAA GGCGCCGAAC
GCCGCGATCG TCACCGTCTA CGGCGACTCG GCGTACGTGC AGAGCCTGGT CGACCCGTTC
TTCCGCACCG GCTCCGCGGT CAACTTCCTC GGCTACTCGA ACAGAACGGT CGACGCCACG
CTCGACGAGG CGGCCATCCA GACCGACCGT GACAAGGCGC TCCAGCTGTT CGCCGACGCG
AACAGAATCG TCGCGGTCGA CGACGCGTCG ATCATCCCGC TCGGCGACCT CAAGCAGCCG
ATCGTGGCGC GCAGGGGCGT CAGCGGCTTC CAGGGCACGC CGACGTCGAT CGACGTCGTG
CAGCTGGCCG CGATCGGGAA GTCCGCGGAC GCGTGA
 
Protein sequence
MTMKLARGVA TLLAAALVVA GCGGTSSNND STGSTGSGGG TSASFDAEAT IRTAQFGDGV 
GGMDPEIWYD LNGGSLHTAV YEGLLRYKTG TTEIEPALAE SYEVSRDGRT YTFKLRQGVR
FHDGTPLTPE AVAGSFARRA ALRGPSAYLT AGVADVRPRG SDTVVIKLRS PEIGFLDALA
SIYGPRVISP AALRAHGAGE DGKRWFASNA VGTGPLRLLS FRLGDGATLE RFDGYWGEQA
KAKRYEVDTL PSSGEQQLQL RSGQLDYLSG GSLQPAQLKA FDGNPRYEVT RLDQAFRPML
VLNTNKPPFD DVEKRKAFVA ALDVDAAIRQ VWGDELMEAP TSYISPFLLD PALNRIEPLT
SDAALDEPVT FEYVGAIQSH RQFSEVIQQQ LRDEDVELRL SATTGGEVFS WPQDVQKAPN
AAIVTVYGDS AYVQSLVDPF FRTGSAVNFL GYSNRTVDAT LDEAAIQTDR DKALQLFADA
NRIVAVDDAS IIPLGDLKQP IVARRGVSGF QGTPTSIDVV QLAAIGKSAD A