Gene Cwoe_4369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4369 
Symbol 
ID8734831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4652484 
End bp4654094 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content70% 
IMG OID646504995 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396158 
Protein GI284045818 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGCA CGACCCGTGC CGTGGTTCTC GCAGGAGCGC TCGGCGTCTT GGCGCTCAGC 
GGCTGCGGAA GCGGCGGGAA CGACTCGCCG ACCGGCACCC AGCCCGCCGA CGGCTCGGTG
CCCGCCACCA AGCCGGTGAG AGACGGTGGA ACGCTCCGCG TGGGGCTGAC CGCCGAGCCT
GACTACATCG ACCCGGCCCG GATGCAGTCG CTCGACTCGT GGCAGGTGCT CACGGCGATG
TGCGAGGGCC TCTACAAGAT CGGCGCGAGA GGGCAAGCGG TCCCGCAGCT CGCCGTCGGC
GCACCGCGGG TGTCCAAGGA CGGCCTGACC GCGACGATCA AGCTGCGCGA CGGCGTGCAG
TTCAACGACG GCACGCCGTT CGACGCGAGA GCGGTCAAGC TGTCGCTCGA GCGCAACGGC
AGAACGTCGG TCCTGTTCCA GGGCAACGGC ATCGAGCGGA TCGACGCGCC CGCCGACGAC
ACCGTCGTCC TGCACCTGGC CAGACCCTAT GCGCCGCTGG AAGGCGACCT CGCCGGCCCC
GGCGGGATGA TCGGCTCGCC GAAGCAGATC GCTGCGCTCG GCGACAAGTT CGGCGATCGC
CCGGTGTGCG TCGGCCCGTT CGAGTGGGTC AGCCGGCGCG GCGGCGACTC GATCAGGCTC
AGACGGTCCG ACGTCTACTA CGACAAGGAG AACGTCCACC TCGACGGGCT CGACTTCAAG
GTGATCCCGG ACACGAACGC CCGTGGCGTC AGCCTGCGCG CCGGTGAGAT CGACATCGCG
GCCGAGCCGC CGGAGCCCGG GGCGCTCAAG TCCGACTCCA ACCTCGACGT CACGACGATC
ACCGGCGCGG GCTGGAAGGG CTTGTACGTG AACGTCGGAA ACGTCGACGG CGCGGGCAAG
CCGCCCAAGC CGCGCGACAC GCCGTTGTCG ACGTCGGCCG AGGCGCGCCA GGCGCTGTCG
CTCGCGATCG ACCGCCAGGC GCTCATCAAC CTCACCAGCG GTGACGGGTC AGCGCCGGCG
TGCAGCGCGA TCCCGCCCAG CAGCCCGTTC TACGACGACC CGCCGTGCCC GCAGAAGGCG
GATCCCGACG CGGCGAGAGC GCTGCTCGAG AGAGCAGGCG TCAGAACGCC CGTCAAGGGC
ACGATGGTCG TCGCGGGCAG CCCTGAGGAG ACACGCACCG CGCAGGCCGT GCAGGGCATG
GCGCGCGACG CCGGCTTCGA CTTCGAGATC GAGACCTGCG ACGTCGCCAC CTGCATCAGA
CGACTGCTCG CGGGCGACTT CGACGTCACG CTGGGCGGCT TCGACGGTGT CGTCGATCCA
GACCAGAGCC TCAGTCCGTT CGTCGCGAGC ACCGGCGGCT TCAACTTCGT CGGCGAGTCC
GACGCGGAGC TCGACCGGCT GCTGGCAAGC GCGCGAGCCG AGTCGACCGA CGTCGATGCG
CGGCGCAAGC TGTACAGACA GGCGCTCGAC CGCATCCGCG AGCGCGCCGC GCTGATCGTC
TTCTACAACA CGGGCAGCTC GGCGGCGGCA CGCAAGAACG TCAGCGGATA CGTGCTGACG
CCCTCGGTCC TGCTGGACTA CAAGCAAGCC GGCTTCACCA CCGGTCCATG A
 
Protein sequence
MRSTTRAVVL AGALGVLALS GCGSGGNDSP TGTQPADGSV PATKPVRDGG TLRVGLTAEP 
DYIDPARMQS LDSWQVLTAM CEGLYKIGAR GQAVPQLAVG APRVSKDGLT ATIKLRDGVQ
FNDGTPFDAR AVKLSLERNG RTSVLFQGNG IERIDAPADD TVVLHLARPY APLEGDLAGP
GGMIGSPKQI AALGDKFGDR PVCVGPFEWV SRRGGDSIRL RRSDVYYDKE NVHLDGLDFK
VIPDTNARGV SLRAGEIDIA AEPPEPGALK SDSNLDVTTI TGAGWKGLYV NVGNVDGAGK
PPKPRDTPLS TSAEARQALS LAIDRQALIN LTSGDGSAPA CSAIPPSSPF YDDPPCPQKA
DPDAARALLE RAGVRTPVKG TMVVAGSPEE TRTAQAVQGM ARDAGFDFEI ETCDVATCIR
RLLAGDFDVT LGGFDGVVDP DQSLSPFVAS TGGFNFVGES DAELDRLLAS ARAESTDVDA
RRKLYRQALD RIRERAALIV FYNTGSSAAA RKNVSGYVLT PSVLLDYKQA GFTTGP