Gene Cwoe_4256 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4256 
Symbol 
ID8734718 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4522228 
End bp4523283 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content66% 
IMG OID646504882 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_003396045 
Protein GI284045705 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0797632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.637243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACAGC TGAGGACGAC GTGGCGCCAG CTCACCCTTG CGATTGCGAC TGTCGCGTTG 
GTCGCGACGT TCGCCGCCTG CGGTGGCAGC GACTCTGGGG GGACCGCCGA CAACGTCAGA
ACGACGTCTG GTGGCGAGCT CGCCGAGATG ACCAAGGCGA CGCTCGTGCT CGACTTCGTT
CCCAACGCCG TCCACGCGGG CATCTACCGG GCATTGGCAG CCGGCTACTA CAGAGACCAC
AACATCGATC TGCGCGTGAT CCAGCCGACC TCGACCGCTG ACACGCTGCG CCTCATCAAC
GCGAACAAGG CCGACTTCGG CCTCGCGGAC GGCCTCGACG TCGCCAACCA GATCGGCGAG
GGGCTCGACA TCGAGGCGTT CCTGGGGATC GTCCAGCGGC CGCTCGGCGG CGTCATCACG
CTGGAGAGAG ACAACATCGC CTCCGGCAAG GACTTCGAGG GCAAGACCGT CGGCGTCACC
GGCGTGCCGT CCGACAACGC AACGCTCGAC ACCGTCGTCA GAAACGACGG CGGCGACCCG
TCGAAGGTGA AGGTCGTCAC GATCGGCTTC AACGGCGTGC AGAACCTCCA GAACGGCAAG
GTCGCCGGCT TCATCGGCTT CTGGCCCTCC GACGGCGTCC AGCTCGACGT CGACGGCTTC
CCGACCAAGA GCTTCAAGCT CGACGAGAAC GGCGGACCGG TCTACCCGGG CCTCGTCGCC
TTCTCGACCC AAAAGCACAT CCAGCAGGAC CCGGCGCTGA TAAGAGCGTT CACGGCCGCG
ACGGTCCAGG GCTACGAGGA CACGATCAGA GACCCGCAGC AGTCGCTGGC GGACCTCCTG
TCGGAGAACA AGTCGCTCAG AAGAGACCTG ACGGCCGCGC AGCTGAGAGC GTTCGAGCCG
CTGTTCCAGG GCGACGCCGC GCGCTTCGGC ACGCTCGACC CGAGAAACGT CGAGGCGCTC
TCGAGCTGGA TGGTCGACAA CAGACTCGCG AGAGAGCCGT TCACGCCGGA GCGCTACGGC
GGCGACAGAT ATCTCCCTGC GGCCGGCGGA TCATGA
 
Protein sequence
MRQLRTTWRQ LTLAIATVAL VATFAACGGS DSGGTADNVR TTSGGELAEM TKATLVLDFV 
PNAVHAGIYR ALAAGYYRDH NIDLRVIQPT STADTLRLIN ANKADFGLAD GLDVANQIGE
GLDIEAFLGI VQRPLGGVIT LERDNIASGK DFEGKTVGVT GVPSDNATLD TVVRNDGGDP
SKVKVVTIGF NGVQNLQNGK VAGFIGFWPS DGVQLDVDGF PTKSFKLDEN GGPVYPGLVA
FSTQKHIQQD PALIRAFTAA TVQGYEDTIR DPQQSLADLL SENKSLRRDL TAAQLRAFEP
LFQGDAARFG TLDPRNVEAL SSWMVDNRLA REPFTPERYG GDRYLPAAGG S