Gene Cwoe_5044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5044 
Symbol 
ID8735510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5376910 
End bp5378610 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content69% 
IMG OID646505671 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396830 
Protein GI284046490 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.449208 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTTGCCA AATCCATCAA GCGCAATCTG ATGCGTCGCC GCACAGCCAT CACGGCCGGC 
GTCCTCGCAC TGGGGGTCGC CGCTGCAGGC TGTGGCGCCG ATGCGCCGAG ATCGTCGGAC
ACCGGCGCTT CCACGGGGGC CGGCACGGCG GCGCCCGCCG TCGGCGCCGC GGCGCAGACG
GTCTACACGA CGACCGCCGC CAGAGGCGAG GTCGATTCGT TCACCTGGAA CCTGCCGAAC
GGTGAGCCGG CAAGCCTCGA CTGGGCTAGA GCGTACGACT CCTCGCCGAA TCAGGTCCTC
TCGAACATGT GCGAGAGCCT GATGCGTCAG CAGCCTGACT TCAGCATCGT CCCCGGCCTC
GCCGAGTCGT TCGAGCAGGC CGACGACAGA ACGCTCGTCT ACAAGCTCCG CTCCGGCGTC
AGATTCTGGG ACGGCAGAGC GATGACGGCC GACGACGTCG TGTTCAGCCT CTCGCGCCAC
ATGGACCCCG ACCAGGGCTC GTTCTGGTCG ACGCCGTTCT ACTCCAACGT CAGATCGATC
GAGAAGACCG GTGACCTCGA GGTCACCGTC AAGCTCAAGC GTCCGGACGC CGTCTTCAAC
CGCATGATGG CGACCCCCGC GGGCGTCGTC GGCCAGCAGG CGTTCGTCGA GGCGAGAGGC
CGCCGCTACG GCACGCCCAA CGGCGGCGTC ATGTGCACCG GCCCGTTCCA GCTCGACAGC
TGGAAGCCCG GCTCGAGCGT CGCGCTGAAG CGCAACGACG CCTACTGGGA CGCCGAGCAC
AAGGCGAAGG CCGGCGCGAT GACGTTCAGA TTCGTGACCG ACGAGTCGAC GATGATCGGC
GGCCTGCAGT CCGGCGAGCT GGACGGCACG TTCCAGGTCC CGCCGGCCGG CGTCTCGCAG
CTCAGAAGCG CCTCCGGCAC GCTCACGTTC GGCGCCTCGA CCGAGTGGTT CGCGTTCCGC
CCGACGGAGA AGGACGGTCC GCTGAAGGAC CCCCGCGTGA TGAGAGCGCT GTCGCTCGTG
CTCGACCGCG ACTCGATCGC CAGAGTCGTC TTCGGTGGCG CCGCCGTCGC CGCCGGCACG
CCGATCCAGC CCGGCGCCTA CGGCTACGCC AGAGAGGTCT TCGCGGCCGC GGCCGAGCAG
CTGCCCGCCC CGACGCCCGA CCCGGACGCC GCGAAGGCGC TCCTCGCCGA GGCCGGCGCC
GCGGCGAGAC AGCCGATCGT CGTCGCCGTC CCCGCCGACG TGCGGACGTA CAACCAGGCG
GCCCAGACGC TGCAGGACGC CGCGCGCCAG ATCGGTCTCG AGGTGAAGGT CGAGTCGATC
TCGACCGCGC AGTTCACCAA CCTGTACTTC GACAAGGGCG CGCGCGCCCC GTACGACCTC
TTCGCCGTGC AGCAGTACGG TGCCGGCGTC GCGGAGCCGC TGATCTCGCT GAGCGAGTTC
ACGCCCCTCT CGGCCTACAA CTACGGCCAG CTGAGAGACC CGGTCGTGAC GAGATCGGTC
GAGCAGGGCC TGGCGACCTA CGACGACGAG AAGCGCGCCG AGCTGGCGAC CAGAGCCGAG
AAGGCGCTCG TCGACGCTCC CGGCCTGATA CCGGTCGTCA ACCTGCTCAC GTCGGTCTAC
CAAGGGCCGA AGATCACCGG CTCGGTCGCG TCGCTGGCCT ACCTCTACTA CCCGTGGGCG
GCTGACGTAG GCGCGCCATG A
 
Protein sequence
MVAKSIKRNL MRRRTAITAG VLALGVAAAG CGADAPRSSD TGASTGAGTA APAVGAAAQT 
VYTTTAARGE VDSFTWNLPN GEPASLDWAR AYDSSPNQVL SNMCESLMRQ QPDFSIVPGL
AESFEQADDR TLVYKLRSGV RFWDGRAMTA DDVVFSLSRH MDPDQGSFWS TPFYSNVRSI
EKTGDLEVTV KLKRPDAVFN RMMATPAGVV GQQAFVEARG RRYGTPNGGV MCTGPFQLDS
WKPGSSVALK RNDAYWDAEH KAKAGAMTFR FVTDESTMIG GLQSGELDGT FQVPPAGVSQ
LRSASGTLTF GASTEWFAFR PTEKDGPLKD PRVMRALSLV LDRDSIARVV FGGAAVAAGT
PIQPGAYGYA REVFAAAAEQ LPAPTPDPDA AKALLAEAGA AARQPIVVAV PADVRTYNQA
AQTLQDAARQ IGLEVKVESI STAQFTNLYF DKGARAPYDL FAVQQYGAGV AEPLISLSEF
TPLSAYNYGQ LRDPVVTRSV EQGLATYDDE KRAELATRAE KALVDAPGLI PVVNLLTSVY
QGPKITGSVA SLAYLYYPWA ADVGAP