Gene Cwoe_2431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2431 
Symbol 
ID8732874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2580251 
End bp2581372 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content69% 
IMG OID646503047 
ProductSubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_003394229 
Protein GI284043889 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0946473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.133086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGA CGGAGGCGGT GCACGCGGTG CGCCGGCGGC TGTTGCAGGT GTCTGTGGTG 
GTGGGCGTGG CGGCGCTGAT CGGCGGCTGC GGAGGCTCCT CTGACGACGA CAAGGGGTCG
TCGCAGGCGG ACAGCGGAGC GGCGGCGTCC GGGCCCGCGA CGTCGATGTC GGTCGAGCAG
CTGCGCTCCG AGCTGGAGGG CAAGGAGCTG AAGATCGGCA GCGCGGCCTT CCCGAACCCG
AGCCTCGTGG GCCTCTACAA GGTCGTCGAG CTGCTGAGAG AGGACTTCGG CATGGAGCCG
GAGCTGCAGC TGCTCGACTC GGCGCCGCTG ACGGCCGCGC TGCTGTCCGG CGACGTCCAG
CTCGCGCACG TCTCGCTGTC CGGCCTCGCG GCGGCGGCCG ACGCCGGCGG CGAGCTGACC
GCGGTCGCGG GCGACGACCA GAAGAACGTC TTCCTCGTGA CCGCGAAGGC GCCGATCAGA
ACGATGGAGG AGCTGGACGG CAAGAAGTTC GCGATCTCGC AGTCGGCCAC CTCGATCGTC
GGACAGACCG GCGCGAAGTG CTTCGAGGAC GCCGGGATGG AGATGCAGAA GGACACGCAG
CTGCTCCAGC TCGACAACGT CGGCTCGATC GTCGAGGCGC TGATGTCCGG CGCGGTCGAC
GGCGGCGTCT CCGCGACGTT CCGCCAGGTC GAGCTCGACG CGACCGACCC CGGCGAGTTC
AACGTCCTCT GCAAGGGTTG GGAGGCCGAT CCCCAGCTCA ACGACGTGAT GGTCCTCAAC
GACGACTACC TGAAGGACAA CCAGGCGCTC GCCCAGGCGG TCGCGATCGC CGAGCTGAGA
GCGGCGCGCT GGATGCAGGA GGACCAGGCC GGCTGGGAGG CGCTCGCGCA GCGGGAGCTC
GACGGGCTGA CGCCCGAGCA GGCGAGCGCC AACTACGACA CGCTCGTCAG AGAGCTCGAC
GACTGGCCCG TCAACGGCAG CCTCGATCGC AGAATGTGCG ACTACACGCT CGCCGAGGGC
AAGGCCAGCG GCGCCCTCAG GACCGAGACC AGCTGCGATG ACCTGGTGAC GTTCGAGTAC
CAGGACGCCG CCGTCAGACT GCTGGGTCCA AGCAGACGAT GA
 
Protein sequence
MAMTEAVHAV RRRLLQVSVV VGVAALIGGC GGSSDDDKGS SQADSGAAAS GPATSMSVEQ 
LRSELEGKEL KIGSAAFPNP SLVGLYKVVE LLREDFGMEP ELQLLDSAPL TAALLSGDVQ
LAHVSLSGLA AAADAGGELT AVAGDDQKNV FLVTAKAPIR TMEELDGKKF AISQSATSIV
GQTGAKCFED AGMEMQKDTQ LLQLDNVGSI VEALMSGAVD GGVSATFRQV ELDATDPGEF
NVLCKGWEAD PQLNDVMVLN DDYLKDNQAL AQAVAIAELR AARWMQEDQA GWEALAQREL
DGLTPEQASA NYDTLVRELD DWPVNGSLDR RMCDYTLAEG KASGALRTET SCDDLVTFEY
QDAAVRLLGP SRR