Gene Cwoe_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1014 
Symbol 
ID8731449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1068045 
End bp1069616 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content66% 
IMG OID646501632 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003392822 
Protein GI284042482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCCAA GAGCTATGGG GGTCAGCCGG ATGCGGCGCT TGGCCGTGGC TGTCGGGGTG 
CTGGTCGTTG GCGTCGGCGC CGCGGGGTGC GGCGGCGAAA GCACGACGAC GACGACCAGA
TCAGGCGGGG GAGCGAAGGT GCTGCGCTAC GGCGTTCCCG CTTCGATCGT GGCCAGAGGC
GTGAGCAATC CGGCCGTGAT CCCGAGCACG GACGGACCGA TCCTGAGCAT CGCCTACGCC
CCTCTGTTCC ATGCCGCTCC GGACGGCAGA ATCGAGCCGG CGCTGGCCGT CAGATGGCGC
TACACCGATG ACGACCAGAA GGTCTTCGAG TTCACCCTGC GCGAGGATGC GCGCTTCTCC
GACGGGACCC CGGTCACCGC GGAAGCGGTG GTGGGTTCGC TCGAGTACTA CTACAAGAGC
AGAAACATCT ACTCGGAGCT GCTGGGGAGA AACCCGAGGT TCGAGGCGGT CGACAGATGG
ACGGTGCGCG TCACGTTGAC GGCGTCGCTG CCCAACCTGC CGTTCCTGTT CTCCGAGGCG
AACGTCAACT GGGGCTTCGT GATGGCGCCG AAGGGAGTGG CGAATCCCAG ACTGTTCACG
AAGGCCACCT ATGGCGCGGG GCCGTACAAG CTCGACTACT CGAAGTCGGT GCCCGGCGAC
CACTACACGT TCGTGCCGAA CGAGTACTTC TACGACAGAT CGGCGATCAA GTTCAAGGAG
ATCTACCTGA AGGCGTTCGC CGATCCGTCG GCAGCGCTGC AGGCGCAGCA GGCCGGTCAG
ATCGACGTCG GCTGGACGAT GGACTCGTCG ACGGCCGAAG CGGCCGAGTC GGCCGGTCTC
GACGTCGTGT CTGCCCCATT CGCCGTTCTC TACATGACGA TGAACGCGCG CAGAGGCACC
GAGGCGCTTC GCGACGTCCG TGTGCGCCGG GCGCTGAACT ACGCGATCGA CCGCAAGGCG
ATCTCGAACG CGCTCTTCGG CAGATACGGC GTCCCGATGT CTCAGTTCAC GGTTCCGCCG
GACTCCAATC CCGGGTTGGA GAACGCGTAC CCCTACGATC CGGAGAGAGC GCGCGCGCTG
CTGGCCGAGG CGGGATACCC GAGAGGGTTC GAGTTCTCGA TCAACACCGG CAAGGACGAT
CCCCAGGCGA AGGCCGTCGA GCTGGTGGCC AGCTACCTCG ACAGAATCGG CGTCAAGTCG
AACGTCAGAA CGTTCCAGAA CCGCGCGGGC TACCTCGACG CAGCGCTGTC GTTCAAGGAC
GACTCGGCGA TCTTCGCCGG CGACGTCGGC GTGCCGACGA CGATCGAGTA CCCCAGCTAC
ATCGGGCCGA GCAGCACGCT CGGCGGGGGC GACCCGGTCA ACCCGAGAGT CAACGAGCTC
TACGAGGCCG GTCTCAGAGC GAGCGACCCC TCCAGAGACT GGAAGGAGAT GTGGGCGATC
ACGGTGAACG ACGCCTGGTT CCTGCCGATA TCGGGGTTCA GTGACCTCGC GTACGTGTCG
GACGGGATCG GCGGGGTGCA GATGACCCCG GCCCGCCCGT ACTCCTTCCC GACGGAGTGG
TTCCCCAAGT AG
 
Protein sequence
MSPRAMGVSR MRRLAVAVGV LVVGVGAAGC GGESTTTTTR SGGGAKVLRY GVPASIVARG 
VSNPAVIPST DGPILSIAYA PLFHAAPDGR IEPALAVRWR YTDDDQKVFE FTLREDARFS
DGTPVTAEAV VGSLEYYYKS RNIYSELLGR NPRFEAVDRW TVRVTLTASL PNLPFLFSEA
NVNWGFVMAP KGVANPRLFT KATYGAGPYK LDYSKSVPGD HYTFVPNEYF YDRSAIKFKE
IYLKAFADPS AALQAQQAGQ IDVGWTMDSS TAEAAESAGL DVVSAPFAVL YMTMNARRGT
EALRDVRVRR ALNYAIDRKA ISNALFGRYG VPMSQFTVPP DSNPGLENAY PYDPERARAL
LAEAGYPRGF EFSINTGKDD PQAKAVELVA SYLDRIGVKS NVRTFQNRAG YLDAALSFKD
DSAIFAGDVG VPTTIEYPSY IGPSSTLGGG DPVNPRVNEL YEAGLRASDP SRDWKEMWAI
TVNDAWFLPI SGFSDLAYVS DGIGGVQMTP ARPYSFPTEW FPK