Gene Cwoe_5121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5121 
Symbol 
ID8735587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5476681 
End bp5478249 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content70% 
IMG OID646505746 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003396905 
Protein GI284046565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.969979 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTTC AGATTCGGCA CCTGGCGGTG CTCGCCGCCG GCTGCGCGGT GCTGGCCGGC 
TGCGGTGGCG GCGGCAGTAC GAGCGGTGGC GAGACCGGCG ACACGGCGGC TTCGACGCAG
GCGGCGAGAG CCGGCGGCAG ACTCGTCTAC GGCACCGCCG CGGGCATCTC GCAGCTGGAC
CCGCACACGC TTTCGGCCGC GCAGCAGCTG GTGGTGCAGC CGCTGCTGTT CAACGGCCTG
ACGAAGGCCG ATCCAAGCGG TGAGACCACA CCCGACCTGG CGGCCTCATG GAGAGCGTCC
GCCGACCAGA GAACGTGGAC CTTCACGCTC CGCGACGGTG TCAGATTCCA CGACGGCACG
CCGTTCGACG CGGCGGCCGC GAAGGCCAAC CTCGAGCGCG TGCTCGACCC CAGAGTCCCG
AATCCCGACC GCACGAAGAT CGAGACGATC GCGAAGATCG AGACGCCCGC GCCGACGACG
CTGGTGCTGA AGCTGAGAGC GCCGAACGCG CTGCTGCCGG ACGCGCTCGC CTCGGGCACG
ATCAAGATGA TCGCGCCGAG AAGCTTCTCC AGCGCGAGCA AGACCGCGGT CGGCACCGGT
CCGTTCAAGC TCGGCGAGAT GGTCCCCGAC GACCACGTCA CGCTGCTCAG AAACGACGGC
TACTGGGGCG AGCCGGCCAA GCTCGACGAG ATCGACGTCG TCCGCTCGCC CGACTCGACC
GCCGCCGCGA CCGCGTTCCG CGCCGGCGAC CTCGACGTGC TGTGGGCGGT CACGCCCGCC
GACGTCGACG GGCTCGTCGC CGCGACGCGC GGCCGGGCGC TGGAGCCCGA CGACGTCTCG
GCCGGCGCTT ACTGGGAGGT CGACAACACC AGCCCGCCGT TCGACGACGT GCGCGCCCGT
CAGGCGCTGC TGCACGCGAT CGACCGCGAG ACGATGCTGA AGGTCGGCTA CGCCGGCAAG
GGCCTGGTGC CGGAGACGGC GTCGATGCTG TCGCCCAGAA ACGCCGCCTT CGACAGCTCG
CTGACGACCT ACCCGTTCGA CCTCGACAAG GCGAGAGCGC TGTTCGCCGA GGCCGGCGTC
GACGCCGGCA CGACGCTGAC GTTCCACACG GTCGCGGGCC AGTACCCGGA GTGGGTGCAG
ATGGGCCAGA TCCTCCAGCA GAACCTGGAG GAGATCGGGA TCAGAATGAA GATCGAGCGC
CAGGAGTTCA GCACCTGGCT CGACACGTTC TACCCGGCCG GCAAGAGATT CCCGGGCGGC
ATCGTCGCCA ACTACCTGTC GCTGCCGACC GTTCCCAGCT ACGCGCTCAG CTTCCTCGAC
GAGGGCGTCT GCGAGTGCAA CGCGAGACTG CCCGGCTGGA GAGAGCTGTC CGCCAGAGCG
GTCGCGACCG GCGAGCAGGC GGAGCGCGAC GCGATCTACG CCGAGATGCA GCAGCTGCAG
AACGACGCCG TGCCGATCAT GCCGATCGTC TTCTCGACGC TCCAGACGGT CGTGCGCGAC
GGCGTGACCG GCGCCTGGGT CGACCCGCAG GGCAACGTCA ACCTCGAACA GGCCGGCTTC
GCGCCGTGA
 
Protein sequence
MRVQIRHLAV LAAGCAVLAG CGGGGSTSGG ETGDTAASTQ AARAGGRLVY GTAAGISQLD 
PHTLSAAQQL VVQPLLFNGL TKADPSGETT PDLAASWRAS ADQRTWTFTL RDGVRFHDGT
PFDAAAAKAN LERVLDPRVP NPDRTKIETI AKIETPAPTT LVLKLRAPNA LLPDALASGT
IKMIAPRSFS SASKTAVGTG PFKLGEMVPD DHVTLLRNDG YWGEPAKLDE IDVVRSPDST
AAATAFRAGD LDVLWAVTPA DVDGLVAATR GRALEPDDVS AGAYWEVDNT SPPFDDVRAR
QALLHAIDRE TMLKVGYAGK GLVPETASML SPRNAAFDSS LTTYPFDLDK ARALFAEAGV
DAGTTLTFHT VAGQYPEWVQ MGQILQQNLE EIGIRMKIER QEFSTWLDTF YPAGKRFPGG
IVANYLSLPT VPSYALSFLD EGVCECNARL PGWRELSARA VATGEQAERD AIYAEMQQLQ
NDAVPIMPIV FSTLQTVVRD GVTGAWVDPQ GNVNLEQAGF AP