Gene Cwoe_3398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3398 
Symbol 
ID8733847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3620675 
End bp3621712 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content71% 
IMG OID646504015 
Productaliphatic sulfonates family ABC transporter, periplasmic ligand-binding protein 
Protein accessionYP_003395191 
Protein GI284044851 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.860812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCT CGTTCCGCCG TGCAATCTCG ATGCTGCTCG TACTCGCGCT GCTGCCGCTC 
GGCCTGGCGG CGTGCGGTGG TGATGACGAC GGCGGCGGCG CGGACACGGC GGCGGCCGCG
AGCGGCGACG GTGGCGGCGA GAGCGGCGGC GAGAAGGTCA AGCTGCGCGT CGGCGTGCAG
AAGGACGGCA TCCGCGCCGT GCTCGGCAAG TCCGGCCAGC TCGACGACCT GCCGTACGAG
ATCGAGTGGT CGACGTTCCA GGCCGGGCCG CCGCTCGTCG AGGCGGCCGG CGCCGACAAG
ATCGACATCG CGTGGGTCGG CTGCGCGCCG CCGATCTTCG GCGCCGCCGC CGGCGCGGAG
TTCAAGGTGA TCGCCGCGGT GCAGGAGCGC GACAGACAGG AGAACCGCCT GCTCGTGCCG
AGAGACTCGG AGATCAGAGC GATCGCGGAC CTGAAGGGCA AGAAGATCGC CGTTCCGAAG
GGAACCTCCG GCCACGCCTT CATCCTCAAC GCGCTGCAGA GCGAGGGGCT GAGCACCGAC
GACGTCGAGT TCGCGTTCCT CGCGCCGCCC GACGCCCTCG CCGCCTACCA GAACGGCGCG
GTCGACGCGA TCTCGATCTG GGACCCGTTC GCGATCCAGG CGCAGCAGTC GCTCGGCGCG
CGCGAGATCG TCGCGGGCGA GCCGCACGAG CGCGGGCTCG GCTTCGAGAT CGCCTCCGCG
AAGGCGCTGG AGGACCCGGC GAAGGTCGAG GCGATCAGAG ACTACGTCAG ACGGCTGAGC
GCGGCGTGGG AGTGGGCCGG CGAGAACCCC GACGAGTGGG CGGCGGCATG GACCGAGGAC
ACGAGACTGC CGCTGTCGGT GACGAGAGCC GCGGCGCGCA GAAAGGCGTC CGACATCATC
CCGCTCGACG ACACGATCGT CGCCTCCCAG CAGAGACTCG CCGACCTCTT CACCGAAGAG
GGCGAGCTGC CCGGCGAGGT GACGTTCACC GACATCGTCG ACACCTCCGT GCTGGGGGAG
AGCGGAGGCG CGAGATGA
 
Protein sequence
MPTSFRRAIS MLLVLALLPL GLAACGGDDD GGGADTAAAA SGDGGGESGG EKVKLRVGVQ 
KDGIRAVLGK SGQLDDLPYE IEWSTFQAGP PLVEAAGADK IDIAWVGCAP PIFGAAAGAE
FKVIAAVQER DRQENRLLVP RDSEIRAIAD LKGKKIAVPK GTSGHAFILN ALQSEGLSTD
DVEFAFLAPP DALAAYQNGA VDAISIWDPF AIQAQQSLGA REIVAGEPHE RGLGFEIASA
KALEDPAKVE AIRDYVRRLS AAWEWAGENP DEWAAAWTED TRLPLSVTRA AARRKASDII
PLDDTIVASQ QRLADLFTEE GELPGEVTFT DIVDTSVLGE SGGAR