Gene Cwoe_2177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2177 
Symbol 
ID8732620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2287866 
End bp2289530 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content70% 
IMG OID646502795 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003393977 
Protein GI284043637 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAAGAAGC CACGTCGCGC CGACGCCGGC GCGCGCTCGC TCGCGGCGAT CGCCTGCGCT 
TGCGCCGCCG CACTCACGGC CTGCGGGTCG AGCGACAGCT CCGGCAGCGC GGACGACCGT
TCGCGCACGC TGACGGCGTA CTTCGCCGCC AGTCCCGACT TCGTCGACCC GGCGCTGACA
TACACGGCGC GCGGTTGGAC GGCGCTGTGG ACGACGTACA CGCCGCTGCT CACCTACCGC
CACGAGGCCG GCCTCGCCGG CACCGAGCTG ATCCCCGGCC TGGCGGAGTC GCTGCCGCGC
ATCTCGCCCG ACGGCAAGAC ATACAGACTG CGACTGCGAA GAGGCCTGCG CTACTCGGAC
GGAACGCCCG CGAGAGCGTC CGACTTCGAG CACGCCATCA AGCGCGTGCT CAACCTCGGC
TCAGGCGGCA GCTGGGCCTA CCTGCCGATC GCCGGTGCGC CCGAGTACCT GAAGGCGCGC
AAGCCCGACG GCGACATCTC CGGCATCGAG ACCGACGACG CCGGCGGCGC GATCACGATC
CGCTTGACAC GATCGGTCGG CTCGTTCGCG GACGCGCTCG CGCTGCCGTT CGCCGCGCTG
GTGCCGAGCA CGACGCCGTT CGAGAACGCG ACCAGCGACC CGCCGCCCGG CATCGGCGCG
TTCAAGGTCG CCTCGTCGGT TCCCAACCGC AGCTTGACGC TCGTGCGCGA CAGATCGTTC
CGCGGCCTGC CCGGCGTCCC GGTCGCGAAG CTCGACCGCG TCGAGGTGAA GGTCGTCACC
AACCAGCACC GCCAGATCGA GGACGTGATC GACAACAAGG TCGACTGGGT CCAGAGCCCG
CCGCCGCCGG ACCAGATGAC GCTCGTCGAG CAGCGTGCGA AGGGGCGGTT CGAGCCGTTC
GTGCTCAACT CGCTCGCGTA CGTCTTCCTC AACCACCGCG TCGCGCCGTT CGACGACGAA
CGCGTGCGCA AGGCGGTCGC GTACGCCGTC GACCGGCGCG CGGTCGTCCG CCTGTTCGGC
GGCCTGCTGA GAACGACGTG CAACTTCCTG CCGCCGAACA TGCAGGGCTA CGAGCGGCTC
GACCCCTGTC CATACGGCGA CCCGGACGGC GCGCCCGACG TCGAGCGCGC GAAGGCGCTG
ATCAGGGAGG CCGGCGCCGA CGGCGCCGCG GTCACCGTCT GGGGCAGCAA CGACGCCGTC
GCGGCGCCGG TCACGCAGTA CCTCGCGGAC GCGTTCAACG CGATCGGCCT CGACGCGACG
CCGAAGCTCG TCGACGCCTC GACCTACCGC CAGACGGTTG GCAACGAGAA GACGCGTGCG
CAGGCCGGCT TCGGCCAGGT CGTCTCGGAC TTCCCACATC CCTCGAACCT GATGTCGCTC
GTCACCGGCA ACGGAATCCA GCCGACCAAC AGCCTCAACT GGGGCAACGT CGACAACCCG
CGGATCAACG CGCTCGCCGA CCGCGCCGAC GCGATGCCCG ACCTCGCCAG AGCCGCGCCC
GACTACGCGC AGATCGACCG GCTGCTGGTG GAGGACGCCG ACCTCGTGCC GCTCGGCAAC
TTCGAGCTGG CGAAGCTGGT CTCCGAGCGC GTCGACTTCG ACTCGTTCAT CGACCATCCC
GTGTTCTCCG GCGACCTCTC GCTGCTGACG TTCAGAGCGC CATGA
 
Protein sequence
MKKPRRADAG ARSLAAIACA CAAALTACGS SDSSGSADDR SRTLTAYFAA SPDFVDPALT 
YTARGWTALW TTYTPLLTYR HEAGLAGTEL IPGLAESLPR ISPDGKTYRL RLRRGLRYSD
GTPARASDFE HAIKRVLNLG SGGSWAYLPI AGAPEYLKAR KPDGDISGIE TDDAGGAITI
RLTRSVGSFA DALALPFAAL VPSTTPFENA TSDPPPGIGA FKVASSVPNR SLTLVRDRSF
RGLPGVPVAK LDRVEVKVVT NQHRQIEDVI DNKVDWVQSP PPPDQMTLVE QRAKGRFEPF
VLNSLAYVFL NHRVAPFDDE RVRKAVAYAV DRRAVVRLFG GLLRTTCNFL PPNMQGYERL
DPCPYGDPDG APDVERAKAL IREAGADGAA VTVWGSNDAV AAPVTQYLAD AFNAIGLDAT
PKLVDASTYR QTVGNEKTRA QAGFGQVVSD FPHPSNLMSL VTGNGIQPTN SLNWGNVDNP
RINALADRAD AMPDLARAAP DYAQIDRLLV EDADLVPLGN FELAKLVSER VDFDSFIDHP
VFSGDLSLLT FRAP