Gene Cwoe_3860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3860 
Symbol 
ID8734315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4094534 
End bp4095643 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content70% 
IMG OID646504482 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003395652 
Protein GI284045312 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0687] Spermidine/putrescine-binding periplasmic protein 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.668247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTCAT CGTCGTTGCT GCGCCGCGGC CTGCTGGCCG CGATGGCCGT CCCGGCGCTC 
GCGCTGGGCG CCTGCGGAAG TGACCCCGGC TCGCCTGGGA GCAGCGCCTC CGAGGGCGAC
TCGGACGGCG GCAGTCTCGT GATCGCCAGC TGGGGCGGGG ACTTCTCCGC CGCCACCAAG
GAGGACCTGG CAGATCCGTT CGCCGCGCAG GCGGGCGTGA AGGTGCAGAT GGTCGAGGCG
TCCTCCCAGC ACGTCGCTCA GCTGGAGGCC CAGAACAAGG CAGGCAAGGT CACCTGGGAC
GTGATCGACT CGCTCGGCGA GGCGAACACG GCCTACCTCG TCAAGCAGGG GCTGCTGGAG
AGACTGCCGG CGGACCTGAA GGCGGAGCTG GAGAGAGTCT CGGTGCCCGG CGGCGTGACC
GACCACGGCG TCCTGCAGTC GACGATCGGC ACGCTGCTCG CGTGCATGCC CGAGCACGCG
AAGGCGTGCC CGCAGACGCC GGCCGAGTTC TTCGACACCG AGCGCTTCCC CGGATCGCGC
ATGATGTACG ACGACCCGTA CTACGGCATC CAGTTCGCGC TCGCCGCGGA CGGCCTCACG
CAGGATCAGA TGTGGCCGAT GACCGACGCG AACGTCGAGC GCGCGTTCGC CAAGCTGGAG
GAGATCGTGC CGGCCGTGCG CGTGTGGTGG ACGTCCGGCG ACCAGACGAT CCAGGCGCTG
CGCGACGGCG AGGTCGACAT GGGGCAGATC TGGAACCGGC CGGCCAAGGA GCTGTCCGAG
CAGGACCCGA GCGCGAGATT CAGCTGGGAC GGCGTGCTGC TTGCCGAGGC GTACAGCGCG
GTGCCCAAGG GCGCGCCGAA CGTCGAGACC GCGCTCGACT ACCTGAGGTT CTACGGCACC
GATCCCGAGG CACAGGCGAG ATGGGCCGCC CGCACGGGCT ACGGCGTCTC GAACGCCAAG
GCGGCTGACT TCATCTCGCC GGAGGACCTC GAGTTCTCGG TGCTCAACCC GGACAACGTC
GCGACCGCGA TCCGCGGCGA CGGCGAGTGG TGGGTCGACA ACCGCGACGA GCTGACCAGA
CGCTGGCGGA CCCTCATCAG CGGGTCCTGA
 
Protein sequence
MKSSSLLRRG LLAAMAVPAL ALGACGSDPG SPGSSASEGD SDGGSLVIAS WGGDFSAATK 
EDLADPFAAQ AGVKVQMVEA SSQHVAQLEA QNKAGKVTWD VIDSLGEANT AYLVKQGLLE
RLPADLKAEL ERVSVPGGVT DHGVLQSTIG TLLACMPEHA KACPQTPAEF FDTERFPGSR
MMYDDPYYGI QFALAADGLT QDQMWPMTDA NVERAFAKLE EIVPAVRVWW TSGDQTIQAL
RDGEVDMGQI WNRPAKELSE QDPSARFSWD GVLLAEAYSA VPKGAPNVET ALDYLRFYGT
DPEAQARWAA RTGYGVSNAK AADFISPEDL EFSVLNPDNV ATAIRGDGEW WVDNRDELTR
RWRTLISGS