Gene Cwoe_3813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3813 
Symbol 
ID8734268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4051054 
End bp4052067 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID646504435 
Productaliphatic sulfonates family ABC transporter, periplasmic ligand-binding protein 
Protein accessionYP_003395605 
Protein GI284045265 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.939168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.433555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGC GACTCGCCTG GCTGGCCACC TCGATCGCGA TGCTGCTCGC GGCGACGGTG 
CTGGCGGCGT GCGGCGGCGA CGACTCCACC GACGCGACGG CCGCGAGCGC GTCGTCGTCG
GCGTCGTCGG AGGGGGCGAC CGTCAAGATC GGCACGCTCC AGGGCGTGAC GCTCGCCGCG
GTCGCGAGAC ACATCGGCTC GATCGACAGA GCGCTTGCGA GCGTCGGCGC GAGAGCGAGC
TACGAGGGCC CGTTCCCCGC GATGGTGCCG GCGATCGAGG CGATGAACGC GGGCGACGTC
GACATCACCT ACGGCTCGAT CTCGGCCGCG ATCGGCGCGC TGGCGGGAAA CTCCGACTTC
AAGATCTTCG CGATCGAGCC CAACCAGCCC GAGAACGAGG GGATCATCGC CGGCAGAGAC
AGCGGGATCG CGACCGCCGC CGACCTGAAG GGCAAGAAGA TCGCGGTCAA CAGAGCGGGC
ACGGGCGAGT ACCTGACGCT GCTCGCGCTC GACAGAGCCG GCCTCAGCAG AGACGACGTC
GAGCTGGTCT ACCTGCCGCC GGCCGACGCG GCGAGCGCGT TCGGCAGCGG GCAGGTCGAC
GCGTGGGCGA CGTGGTCGTC GTTCACCGGC CTGGCGCAGG ACAAGCTCGG CGGCAGACTC
GTGATCTCCG GCGGCGAGCT GGGCTCGCTC AACGACACGC CGTACATCGT CTCCAGCGAG
TTCGCCGAGC GGCACCCGGC GCTCGTCGCC GCGGTCTACC GCGGTCTCCA GGACGCGGCC
GCGTGGATCG CGGCGAACCC CGCCGAAGCC GCGAGACTGT ACGCCGACGC CGGCCTGCCG
GACACGGTCG CCAGAGCGCA GGTCGACGCG GCCGAGAGAC TGGAGCCGAT CACGCCGGCG
ATCTTGGCGC GCTTCCAGCA GGTCGCGAGG TACGTCGCCG AAAGAGGCGT CGTGCCGGGC
GAAGTCGACC TGAGCGACCG CACGATCGAC GACGTGGAGG AGGCACGCAG ATGA
 
Protein sequence
MTKRLAWLAT SIAMLLAATV LAACGGDDST DATAASASSS ASSEGATVKI GTLQGVTLAA 
VARHIGSIDR ALASVGARAS YEGPFPAMVP AIEAMNAGDV DITYGSISAA IGALAGNSDF
KIFAIEPNQP ENEGIIAGRD SGIATAADLK GKKIAVNRAG TGEYLTLLAL DRAGLSRDDV
ELVYLPPADA ASAFGSGQVD AWATWSSFTG LAQDKLGGRL VISGGELGSL NDTPYIVSSE
FAERHPALVA AVYRGLQDAA AWIAANPAEA ARLYADAGLP DTVARAQVDA AERLEPITPA
ILARFQQVAR YVAERGVVPG EVDLSDRTID DVEEARR