Gene Cwoe_2114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2114 
Symbol 
ID8732557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2219527 
End bp2221143 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content71% 
IMG OID646502732 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003393914 
Protein GI284043574 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.59472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGGATC GGAACCTGAC GCGGCGCGAG TTCGCGGCGC GAGGCGCAGC CCTGGCGCTC 
ATCGGACTCG TGCCGGGCGC GCTGGCCGCC TGCGGCGGCG GCAGCAGCGG CGATGGTGGA
GCGGGCGACT CGCTCGACAT GATCGTCGGC GAGCTGCCGC TGCAGCTGAC CGGCACCCGG
CAGCTGATCC AGGGCGCCGG CGCGCTACTG ATCGCGCTCG AGCCGCTCGT CGTCGCCGAC
CCCAGGGGGC GGATCGAGCC GAGCCTCGCG CGGCGCTTCG AGACGCCCGA CCCGCGCACG
TTCGTCTTCG ACGTCCGGCC GGGCGTCAGA TTCTGGGACG GCGAGCCGCT GACGGCCGCG
GACGTCGCCT ACTCGCTGGA GCTGCATCGC ACCGACGTGG GCTCGATCCT GAACCGCTGG
TGGGCGCTCG CCGAGCGCGT CGAGACGGAC GGCTCCGACC GCGTCGTCGT CACGCTCGAG
CGGCCCTATG CCGGCTTCGT CTACGCCGTC GCCCAGACGC CGATCGTCCA GCGCGCCTAC
AACGAGCGGC ACAAGCAGGC GATCGGCACG CCGAAGGCGC TCAACATGGG CACCGGGGCG
TGGAAGTTCG AGCGCTTCGA CCCCGACAAG AGCATCGAGC TGGTCGCCAA CGACGACTAC
TGGGGGGAGA AGCCGCGTTA CCGCCGCATC ACCACCCGCA TGGTCGCCGA CCCGTCGACC
GCCGCGCTGT CGATCAAGAC GGGGGAGGTG ACCGGCAACC TCGCGGTGCC GGTCACCGAC
ACCTCGCACT ACGAGAAGCT CGGCGGGGTG CGGGTCGAGC AGCGGCCCAG CACCGCGGTC
TGCGTCATCA CGCTCAACAC GCTGTACGCG CCGTGGGACG ACGTCAACGT GCGCCGCGCG
ATGCAGCACG CCGTCAACCG GCAGGCGTGC GTCGAGGGCG CGCTCGGCGG CCACGGCCAG
CCGGAGCTGT CGATCGTCAG CGAGGCGGCG CTGCGCGAGG TGATGCCGGC CGAGGACGCG
AGCGCGCTGC TCGCCGAGAT CGAGCCGCTC GTCGAGTTCG ACCTCGACAA GGCGCGGGCC
GCGCTGGCGG AGTCGGCCCA CCCAGACGGC TTCGAGGTCG GCACCGTGAT CGACGGCGAG
ACCGAGATCG TGCGCACGCT GGAGCTGATC AAGCAGGATC TCGCCGAGAT CGGCATCACG
CTGAAGATCA CCCAGGCGCC GTCGTCGGTC TACGAGGAGC AGTACGGCAG CAGCAGATAC
TCGCTCGGCT CGTACACGGT CACACCGGAC AGCGGCGACC CGCTCACCAA CCTCGTCGGC
GGCGCCTTCG ACAAGGCCGG CGTCACCGAG ACCGGCGGCA ACGGCCCCAA CGCGACCAAC
TTCACGTCGC CCGAGCTGCA GCGGCTGCTC GAGCAGCTGC GCGCGACGCC GCTGAGCGAC
AGAGCGCGCC GGGCCGAGCT GTGCGCCGAG ATGGTGCGCT ACAACGCGCG TGAGGCGCTC
TACGTCGGTG TCTGGTCGCC GCGGGCGGTG CTCGCGATCA ACGACGCGTA CAGATACCCG
GCATTCAACG AGCTGTGGTG GCAGACGCGC TGGCCGGACC AGATCGAGCG GAGCTGA
 
Protein sequence
MEDRNLTRRE FAARGAALAL IGLVPGALAA CGGGSSGDGG AGDSLDMIVG ELPLQLTGTR 
QLIQGAGALL IALEPLVVAD PRGRIEPSLA RRFETPDPRT FVFDVRPGVR FWDGEPLTAA
DVAYSLELHR TDVGSILNRW WALAERVETD GSDRVVVTLE RPYAGFVYAV AQTPIVQRAY
NERHKQAIGT PKALNMGTGA WKFERFDPDK SIELVANDDY WGEKPRYRRI TTRMVADPST
AALSIKTGEV TGNLAVPVTD TSHYEKLGGV RVEQRPSTAV CVITLNTLYA PWDDVNVRRA
MQHAVNRQAC VEGALGGHGQ PELSIVSEAA LREVMPAEDA SALLAEIEPL VEFDLDKARA
ALAESAHPDG FEVGTVIDGE TEIVRTLELI KQDLAEIGIT LKITQAPSSV YEEQYGSSRY
SLGSYTVTPD SGDPLTNLVG GAFDKAGVTE TGGNGPNATN FTSPELQRLL EQLRATPLSD
RARRAELCAE MVRYNAREAL YVGVWSPRAV LAINDAYRYP AFNELWWQTR WPDQIERS