Gene Cwoe_5116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5116 
Symbol 
ID8735582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5471427 
End bp5473025 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content70% 
IMG OID646505741 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396900 
Protein GI284046560 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.682235 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAGAC GCAGCTCGCG TGCCGCCGGC GCGCTCCTCG CGCTCCTCGC CTGCCTCCTG 
CTCGCAGCCT GCGGCGGCAG CGGATCCGAC TCGGGTTCGG GCGGCGGCTC CGGCGACGGC
TATGGCAGCG CGGCCGGCAG TGGCGGCTCC TCGCCGGACG ACGGCGGCAG CGCCGACGGC
GGCGGCAGCG TCACCGTCGG CGCCGTGACG GGGATCCCGC AGCTGGACCC GTACAAGCTG
GTCTCGCCGA TGGAGGCCTC GCTGATGCAC ACGCTCTGGT CCTCGCTCGT CAAGCACGAC
GCCGACGGCG AGATCGTCGG AGAGCTGGCC GAGTCGTGGG ACGTCTCCGA CGACGGCCGC
ACCTACACGT TCAGACTCGT CGAGGACGCG ACCTTCGCCG ACGGCAAGCC GATCGACGCG
AGCGTCGTCG CGGCGAACCT GAAGCGCGCG ACCGATCCGA GAACCGCCTG GGTGTTCGGC
TCCTACATCC CGAGACTGGC GAGAATCGAG GCGGTCGACG CGACGACGTT GAAGCTGACC
CTCGCGAGAC CGGCGAGCAC GCTGCTGGGC GCGCTGACGC TGGCGATGGT CGCCGACCCC
GACAACCTCA GAGCGATCAA CAGACGGCCG AATGCCTCCG GCCCGTTCGA GCTGGACCGC
TTCAACGCGA ACGAGTCGGT CGTGCTCAGC AGACGCGACG ACTTCTGGGG CGAGCCGGCG
GCGGTCGGGA CGCTCGAGTT CACCCGCGCG CGGGACACGA CCGCGGCCGT CACCGCCCTG
CGCACCGGCG ACCTCGACGC GCTCTTCCAG GTGCCGTGGG CCGACGTCGA GAGCCTCCAG
GACGCGGGCA TCTCGGTCGA GGTCTCTCCG CGGCCCGGCG ACGCGACGAT CCTGATGCCG
GACAACACCT CCAAGCCGTT CGACGACGTA CGCGCCCGCC GCGCGCTGTC GCTCGCGACC
AATCGCGAGG CGATCGTCGC GACGGCGTTC GCCGGCAAGA CCGAGGTCGC GACCGCCAAC
GTGCCGCTGT CGAAGACGAG CCCGTGGTTC GATGCGGACC TGCCACAGAC GCGCTTCGAC
CTCGACGAGG CCAAGCGCCT GTTCGACGAG GTCGGCGTCG AGCGGCTGAC CTACTGGGCC
CCCTCGGAGG GCTATCCCGA GTTCGCCGCG ATGGGCCAGA TCCTGCGCTC GGACCTGGCG
AGAATCGGGA TCGAGCTGAA GATCGAGTCG GTCGAGCTGA ACGCCTGGCT GGCGAAGTTC
GCGCCCGCCG GCAAGAAGTG GCCGGACACG ATCATCCCGA CGGTCTACGT CGCGCCGCAC
AACCCTGGGA TCTTCCTCGC GCAGTGGTTC CCCGGCATCT GCGAGTGCAA CTTCGACGAC
CCCAGATACG TCGCCGCCGT CGAAGCCGGC GTCGCCGCGA CCGACGAGGC CGCCGCGAGA
GCCAGCTTCG CGGAGGCGCA GCGGATCTTC GCCGAGCAGG TGCCGGTCAG CGTCGCGACG
ATGATGAGCT TCCCGGTCGC GGTCCGCGAC GACGTCTCGG GGATCTTCCT CGACGAGACC
GGGTACGGGC GCTTCGAGCA GGTGACGGTC GGTGACTAG
 
Protein sequence
MTRRSSRAAG ALLALLACLL LAACGGSGSD SGSGGGSGDG YGSAAGSGGS SPDDGGSADG 
GGSVTVGAVT GIPQLDPYKL VSPMEASLMH TLWSSLVKHD ADGEIVGELA ESWDVSDDGR
TYTFRLVEDA TFADGKPIDA SVVAANLKRA TDPRTAWVFG SYIPRLARIE AVDATTLKLT
LARPASTLLG ALTLAMVADP DNLRAINRRP NASGPFELDR FNANESVVLS RRDDFWGEPA
AVGTLEFTRA RDTTAAVTAL RTGDLDALFQ VPWADVESLQ DAGISVEVSP RPGDATILMP
DNTSKPFDDV RARRALSLAT NREAIVATAF AGKTEVATAN VPLSKTSPWF DADLPQTRFD
LDEAKRLFDE VGVERLTYWA PSEGYPEFAA MGQILRSDLA RIGIELKIES VELNAWLAKF
APAGKKWPDT IIPTVYVAPH NPGIFLAQWF PGICECNFDD PRYVAAVEAG VAATDEAAAR
ASFAEAQRIF AEQVPVSVAT MMSFPVAVRD DVSGIFLDET GYGRFEQVTV GD