Gene Cwoe_5016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5016 
Symbol 
ID8735482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5346082 
End bp5347509 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content71% 
IMG OID646505643 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396802 
Protein GI284046462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGCAC GGGAGCAGAG AACGACGATC GCGATCGGCA ACGCCGAGCC GCCTACGTCC 
GAGCATTGGG ACCCCCATGC CGCGCTCGGG CTCGCCGACA ACCAGATCTG GTCGCTCGTC
TACGACACGC TGGTCTCCTA CGACGCCGAC GGCCGCGTCG TCGGCCGGCT CGCGCGCAAG
TGGTTCCGCA CCGCGCCGAC GCGCCTGCGC GTCGAGCTGC GGCCCGGCGT CGTCTTCTCC
GACGGCACGC CGCTGACCGC GCGCGACGTG CGCGCCAGCC TGGAGCGGAT CGGCGATCCG
CTGTCGCGCC TGGTGCTGTC CGGCAAGGTC GTCCCGGGCC TGCGCGTCGA GGTGCTCGGC
GACCACCTGC TGGAGATCGT CACGCCGAAG CCGTTCGGGC CGATCGAGCG CGCGCTCGCG
ATCGCCGCGA TCGTGCCCGT CGCCGACGCC GAGGACCCCG CGCGCTTCGC GCAGCGGCCG
CTCGGCAGCG GGCCGTTCGT GTTCGCCGGC TCCGACGGCG GCGTCGTGCG GCTGACCGCG
AACCCGGCGC ACTGGCGCGG CGCGCCCGGC GTCGCCGCGG TCGAGCTGCA CTACATCGAG
GATCCGCGCG AGCGGCTGGA GGCGCTGCTG TCCGGGCGCA TCGACATCCA CACGCGCGCC
AGCTCGATCG TGCTCGACGC GATCGCGGGC GACGACGCGT TCGACGTCAC GACGATGGGG
CCGGCGAGCC AGTTCATCTA CATCCCCCAG CACGACGGCC CCCTGCGTGA CACGCGCGTG
CGCCGCGCGA TCGCCCACGC GATCGACCGC CGCGCGATCG CCAAGCGGAT CCTCAAGCTC
GGCGAGCCGG CCGTCTCGGC GCTGCCGTCG TCGTCGATCG GCTTCCAGCC CGGCAGATCG
GAGCTGGAGT ACAACCCGGC AAAGGCGCGC GAGCTGCTCG CCGAGGCCGG CCACGCGGAC
GGCCTGCGGC TCTCGCTCGC CTCCGCCAAC CTCTTCGCCC ACCAGCTCGA GATCGACCAG
CTCGTCCGGC GCTCGCTCCA AGAGGTCGGG ATCGCCGTCG ACCTCGACGT GCTGGAGAGC
GGTCAGTTCC GCAGCTCCTT CAATCGCTAC GCCCTCTCGT TCAATGCGAT CGGCGCCACT
TCGCGCGACC CCGACCACCT GCTGACGTTC TTCCGGCCGG TCGTCGCGCA GGAGTCGATG
CACCTCGACG ACCACAAGAT CACCGAGCTG ATCGAGCGTG AGCGGCAGAC CTCCGGCAAG
GCCCGCCTCG CCGCCGTCAA CCAGGCGGCG CAGTACCTCT GGGAGAACCA AGTGATGATC
TACGTGTCCG ACGACGTCTG GTATACGGCC GTCAGTCGCC GTGTCCAGGG CTATCAGCGC
ACGCCGCTCG AAGGCGAGCC GCTGCTGTGG AAGGCGACGA AGGCGTAG
 
Protein sequence
MSAREQRTTI AIGNAEPPTS EHWDPHAALG LADNQIWSLV YDTLVSYDAD GRVVGRLARK 
WFRTAPTRLR VELRPGVVFS DGTPLTARDV RASLERIGDP LSRLVLSGKV VPGLRVEVLG
DHLLEIVTPK PFGPIERALA IAAIVPVADA EDPARFAQRP LGSGPFVFAG SDGGVVRLTA
NPAHWRGAPG VAAVELHYIE DPRERLEALL SGRIDIHTRA SSIVLDAIAG DDAFDVTTMG
PASQFIYIPQ HDGPLRDTRV RRAIAHAIDR RAIAKRILKL GEPAVSALPS SSIGFQPGRS
ELEYNPAKAR ELLAEAGHAD GLRLSLASAN LFAHQLEIDQ LVRRSLQEVG IAVDLDVLES
GQFRSSFNRY ALSFNAIGAT SRDPDHLLTF FRPVVAQESM HLDDHKITEL IERERQTSGK
ARLAAVNQAA QYLWENQVMI YVSDDVWYTA VSRRVQGYQR TPLEGEPLLW KATKA