Gene Cwoe_3425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3425 
Symbol 
ID8733874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3653713 
End bp3655404 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content65% 
IMG OID646504042 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003395218 
Protein GI284044878 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACGT GGGCACGGAT CGTCGGTGCG ATGCTGGTCG CACTGGCGCT CGCGGTGACC 
GTCGCTGCGT GCGGCGGAGA CGACGACGGC GGTTCGACGT CGGCATCCAC CACATCGGGT
GGCGGCGGCG CGACCACCGG AGGGGCAAGA CAGGGGGGCA CCGCGACGGT GCTCATGGGC
ACAGCGCCCG ACTACCTCGA CCCGCAGCTG GCGTACACGA CCCAGGCGGT CGAGCCGCAC
TGGATCAGCT ACACGGGTCT CCTGACCTAC CGGCACGAGG AGGGCCAGGC CGGCACGGAG
CTGATCCCGG GCCTCGCCGA CGCGCTGCCG AGAATCAGCC AGGACGGCAG AACGTACGAC
TTCACGCTGC GCAGAGACCT CAGATACTCC GACGGCACGC CCGTCAGAGC GACTGACTTC
CCGTACTCCG TCGAGCGCAT GATCAAGATC CCGTGGGGCG GCAGATCGTT CGTCACCAAC
TACGTCGTCG GCGCGCAGGA GTTCGACGAA GGCAGAGCCA GAAGAGTCTC CGGCATCACT
GCCAGCGACG CGACCGGCAG AATCACGATC AGACTGAGAG AGGCGTACGG CGCGTTCTCC
AACGTGCTCG CGTTCCCGGC GCTGGCGCTC GTCCCGAGCG GCACGCCGAT CAGAAACCTG
TCGGCCGATC CGCCTCCCGG CGTCGGCGCG TACATGCTGA CCGACGTCGT CCCGAACCGT
TCGTTCACGG TCAGAAGAAA CCCGGCGTTC GCCGCGTTCA GAATCCCCGA CATCCCGCTC
GGCAACCTCG ACGCGATAAA CGTGAGAATC GTGTCGAACA CGAACTCCGA GGCGCAGCAG
GTGCTCAACA ACCAGGCCGA CATCTTCGAC CCGGGTGACA CGCTGCCGCC GGCGCTGCTG
CCGCAGATCG AGAGCCAGGC GAGAGATCGC TTCGCGCGCA AGCCGGTGCC GTCGACGTTC
TACTTCTTCC TCAACACGAC CAAGCCGCCG TTCGACAACC AGAAGGCGCG CGAAGCCGTC
AACATGGCGC TCGACCGCGA CGCGCTCGTG CGGCTCTCCA GCGGGTTCTT CTCGCCGAGC
TGCTTCTTCA TCCCCGAGGG GATCGTCGGC CACCCCGACG CGGAGTGCCC GTACGGCGAC
ACGCCGGACA TCGAGGGCGC GCGCAGAATC ATCAGCGACG AGGGCCTCGA AGGCACCAGA
GTCGTCGTCT GGGGCCAGGA GCGCAGCCCG CGCAAGGAGT ACGTCGACTA CTACACCGAC
ATGCTCAACA GAATCGGCTT CGACGCGCAG CAGAGAATCA TCGCCGACAC CGTCTACTTC
CCGACGATCG GCAACGAGAG AACGGATCCG CAGACGGGCT TCGCGAACTG GCTGCAGGAC
TTCCCGAACC CGTCGGACTT CTATCTGCTG CTCGACGCCC GTTCGATCCA GCCGACCAAC
AACCAGAACT TCTCGAAGGT CGACGACCCG CACATCCAGA GAGAGCTTCT GAGACTCAAC
GCCGTCCCCG CGACCGAGCT GGACAGCGTC GCCGACGAAT GGAGAGCACT CGACGAGTAC
ACCGCGCAGA GAGCGTACAA CGCGGTGTTC GGCTCGATCA GCGTGCCGAA GTTCTTCTCC
GACAAGCTGA ACTTCGACTC GGTGTTCCAC CCGTTGTACT TCAACGACTG GTCCACCCTC
TCGCTGAAGT AG
 
Protein sequence
MRTWARIVGA MLVALALAVT VAACGGDDDG GSTSASTTSG GGGATTGGAR QGGTATVLMG 
TAPDYLDPQL AYTTQAVEPH WISYTGLLTY RHEEGQAGTE LIPGLADALP RISQDGRTYD
FTLRRDLRYS DGTPVRATDF PYSVERMIKI PWGGRSFVTN YVVGAQEFDE GRARRVSGIT
ASDATGRITI RLREAYGAFS NVLAFPALAL VPSGTPIRNL SADPPPGVGA YMLTDVVPNR
SFTVRRNPAF AAFRIPDIPL GNLDAINVRI VSNTNSEAQQ VLNNQADIFD PGDTLPPALL
PQIESQARDR FARKPVPSTF YFFLNTTKPP FDNQKAREAV NMALDRDALV RLSSGFFSPS
CFFIPEGIVG HPDAECPYGD TPDIEGARRI ISDEGLEGTR VVVWGQERSP RKEYVDYYTD
MLNRIGFDAQ QRIIADTVYF PTIGNERTDP QTGFANWLQD FPNPSDFYLL LDARSIQPTN
NQNFSKVDDP HIQRELLRLN AVPATELDSV ADEWRALDEY TAQRAYNAVF GSISVPKFFS
DKLNFDSVFH PLYFNDWSTL SLK