Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2114 |
Symbol | |
ID | 8732557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2219527 |
End bp | 2221143 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 646502732 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003393914 |
Protein GI | 284043574 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.59472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGATC GGAACCTGAC GCGGCGCGAG TTCGCGGCGC GAGGCGCAGC CCTGGCGCTC ATCGGACTCG TGCCGGGCGC GCTGGCCGCC TGCGGCGGCG GCAGCAGCGG CGATGGTGGA GCGGGCGACT CGCTCGACAT GATCGTCGGC GAGCTGCCGC TGCAGCTGAC CGGCACCCGG CAGCTGATCC AGGGCGCCGG CGCGCTACTG ATCGCGCTCG AGCCGCTCGT CGTCGCCGAC CCCAGGGGGC GGATCGAGCC GAGCCTCGCG CGGCGCTTCG AGACGCCCGA CCCGCGCACG TTCGTCTTCG ACGTCCGGCC GGGCGTCAGA TTCTGGGACG GCGAGCCGCT GACGGCCGCG GACGTCGCCT ACTCGCTGGA GCTGCATCGC ACCGACGTGG GCTCGATCCT GAACCGCTGG TGGGCGCTCG CCGAGCGCGT CGAGACGGAC GGCTCCGACC GCGTCGTCGT CACGCTCGAG CGGCCCTATG CCGGCTTCGT CTACGCCGTC GCCCAGACGC CGATCGTCCA GCGCGCCTAC AACGAGCGGC ACAAGCAGGC GATCGGCACG CCGAAGGCGC TCAACATGGG CACCGGGGCG TGGAAGTTCG AGCGCTTCGA CCCCGACAAG AGCATCGAGC TGGTCGCCAA CGACGACTAC TGGGGGGAGA AGCCGCGTTA CCGCCGCATC ACCACCCGCA TGGTCGCCGA CCCGTCGACC GCCGCGCTGT CGATCAAGAC GGGGGAGGTG ACCGGCAACC TCGCGGTGCC GGTCACCGAC ACCTCGCACT ACGAGAAGCT CGGCGGGGTG CGGGTCGAGC AGCGGCCCAG CACCGCGGTC TGCGTCATCA CGCTCAACAC GCTGTACGCG CCGTGGGACG ACGTCAACGT GCGCCGCGCG ATGCAGCACG CCGTCAACCG GCAGGCGTGC GTCGAGGGCG CGCTCGGCGG CCACGGCCAG CCGGAGCTGT CGATCGTCAG CGAGGCGGCG CTGCGCGAGG TGATGCCGGC CGAGGACGCG AGCGCGCTGC TCGCCGAGAT CGAGCCGCTC GTCGAGTTCG ACCTCGACAA GGCGCGGGCC GCGCTGGCGG AGTCGGCCCA CCCAGACGGC TTCGAGGTCG GCACCGTGAT CGACGGCGAG ACCGAGATCG TGCGCACGCT GGAGCTGATC AAGCAGGATC TCGCCGAGAT CGGCATCACG CTGAAGATCA CCCAGGCGCC GTCGTCGGTC TACGAGGAGC AGTACGGCAG CAGCAGATAC TCGCTCGGCT CGTACACGGT CACACCGGAC AGCGGCGACC CGCTCACCAA CCTCGTCGGC GGCGCCTTCG ACAAGGCCGG CGTCACCGAG ACCGGCGGCA ACGGCCCCAA CGCGACCAAC TTCACGTCGC CCGAGCTGCA GCGGCTGCTC GAGCAGCTGC GCGCGACGCC GCTGAGCGAC AGAGCGCGCC GGGCCGAGCT GTGCGCCGAG ATGGTGCGCT ACAACGCGCG TGAGGCGCTC TACGTCGGTG TCTGGTCGCC GCGGGCGGTG CTCGCGATCA ACGACGCGTA CAGATACCCG GCATTCAACG AGCTGTGGTG GCAGACGCGC TGGCCGGACC AGATCGAGCG GAGCTGA
|
Protein sequence | MEDRNLTRRE FAARGAALAL IGLVPGALAA CGGGSSGDGG AGDSLDMIVG ELPLQLTGTR QLIQGAGALL IALEPLVVAD PRGRIEPSLA RRFETPDPRT FVFDVRPGVR FWDGEPLTAA DVAYSLELHR TDVGSILNRW WALAERVETD GSDRVVVTLE RPYAGFVYAV AQTPIVQRAY NERHKQAIGT PKALNMGTGA WKFERFDPDK SIELVANDDY WGEKPRYRRI TTRMVADPST AALSIKTGEV TGNLAVPVTD TSHYEKLGGV RVEQRPSTAV CVITLNTLYA PWDDVNVRRA MQHAVNRQAC VEGALGGHGQ PELSIVSEAA LREVMPAEDA SALLAEIEPL VEFDLDKARA ALAESAHPDG FEVGTVIDGE TEIVRTLELI KQDLAEIGIT LKITQAPSSV YEEQYGSSRY SLGSYTVTPD SGDPLTNLVG GAFDKAGVTE TGGNGPNATN FTSPELQRLL EQLRATPLSD RARRAELCAE MVRYNAREAL YVGVWSPRAV LAINDAYRYP AFNELWWQTR WPDQIERS
|
| |