Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5116 |
Symbol | |
ID | 8735582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5471427 |
End bp | 5473025 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646505741 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003396900 |
Protein GI | 284046560 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.682235 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAGAC GCAGCTCGCG TGCCGCCGGC GCGCTCCTCG CGCTCCTCGC CTGCCTCCTG CTCGCAGCCT GCGGCGGCAG CGGATCCGAC TCGGGTTCGG GCGGCGGCTC CGGCGACGGC TATGGCAGCG CGGCCGGCAG TGGCGGCTCC TCGCCGGACG ACGGCGGCAG CGCCGACGGC GGCGGCAGCG TCACCGTCGG CGCCGTGACG GGGATCCCGC AGCTGGACCC GTACAAGCTG GTCTCGCCGA TGGAGGCCTC GCTGATGCAC ACGCTCTGGT CCTCGCTCGT CAAGCACGAC GCCGACGGCG AGATCGTCGG AGAGCTGGCC GAGTCGTGGG ACGTCTCCGA CGACGGCCGC ACCTACACGT TCAGACTCGT CGAGGACGCG ACCTTCGCCG ACGGCAAGCC GATCGACGCG AGCGTCGTCG CGGCGAACCT GAAGCGCGCG ACCGATCCGA GAACCGCCTG GGTGTTCGGC TCCTACATCC CGAGACTGGC GAGAATCGAG GCGGTCGACG CGACGACGTT GAAGCTGACC CTCGCGAGAC CGGCGAGCAC GCTGCTGGGC GCGCTGACGC TGGCGATGGT CGCCGACCCC GACAACCTCA GAGCGATCAA CAGACGGCCG AATGCCTCCG GCCCGTTCGA GCTGGACCGC TTCAACGCGA ACGAGTCGGT CGTGCTCAGC AGACGCGACG ACTTCTGGGG CGAGCCGGCG GCGGTCGGGA CGCTCGAGTT CACCCGCGCG CGGGACACGA CCGCGGCCGT CACCGCCCTG CGCACCGGCG ACCTCGACGC GCTCTTCCAG GTGCCGTGGG CCGACGTCGA GAGCCTCCAG GACGCGGGCA TCTCGGTCGA GGTCTCTCCG CGGCCCGGCG ACGCGACGAT CCTGATGCCG GACAACACCT CCAAGCCGTT CGACGACGTA CGCGCCCGCC GCGCGCTGTC GCTCGCGACC AATCGCGAGG CGATCGTCGC GACGGCGTTC GCCGGCAAGA CCGAGGTCGC GACCGCCAAC GTGCCGCTGT CGAAGACGAG CCCGTGGTTC GATGCGGACC TGCCACAGAC GCGCTTCGAC CTCGACGAGG CCAAGCGCCT GTTCGACGAG GTCGGCGTCG AGCGGCTGAC CTACTGGGCC CCCTCGGAGG GCTATCCCGA GTTCGCCGCG ATGGGCCAGA TCCTGCGCTC GGACCTGGCG AGAATCGGGA TCGAGCTGAA GATCGAGTCG GTCGAGCTGA ACGCCTGGCT GGCGAAGTTC GCGCCCGCCG GCAAGAAGTG GCCGGACACG ATCATCCCGA CGGTCTACGT CGCGCCGCAC AACCCTGGGA TCTTCCTCGC GCAGTGGTTC CCCGGCATCT GCGAGTGCAA CTTCGACGAC CCCAGATACG TCGCCGCCGT CGAAGCCGGC GTCGCCGCGA CCGACGAGGC CGCCGCGAGA GCCAGCTTCG CGGAGGCGCA GCGGATCTTC GCCGAGCAGG TGCCGGTCAG CGTCGCGACG ATGATGAGCT TCCCGGTCGC GGTCCGCGAC GACGTCTCGG GGATCTTCCT CGACGAGACC GGGTACGGGC GCTTCGAGCA GGTGACGGTC GGTGACTAG
|
Protein sequence | MTRRSSRAAG ALLALLACLL LAACGGSGSD SGSGGGSGDG YGSAAGSGGS SPDDGGSADG GGSVTVGAVT GIPQLDPYKL VSPMEASLMH TLWSSLVKHD ADGEIVGELA ESWDVSDDGR TYTFRLVEDA TFADGKPIDA SVVAANLKRA TDPRTAWVFG SYIPRLARIE AVDATTLKLT LARPASTLLG ALTLAMVADP DNLRAINRRP NASGPFELDR FNANESVVLS RRDDFWGEPA AVGTLEFTRA RDTTAAVTAL RTGDLDALFQ VPWADVESLQ DAGISVEVSP RPGDATILMP DNTSKPFDDV RARRALSLAT NREAIVATAF AGKTEVATAN VPLSKTSPWF DADLPQTRFD LDEAKRLFDE VGVERLTYWA PSEGYPEFAA MGQILRSDLA RIGIELKIES VELNAWLAKF APAGKKWPDT IIPTVYVAPH NPGIFLAQWF PGICECNFDD PRYVAAVEAG VAATDEAAAR ASFAEAQRIF AEQVPVSVAT MMSFPVAVRD DVSGIFLDET GYGRFEQVTV GD
|
| |