Gene Cwoe_2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2140 
Symbol 
ID8732583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2249357 
End bp2250445 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content72% 
IMG OID646502758 
Productprotein of unknown function DUF917 
Protein accessionYP_003393940 
Protein GI284043600 
COG category[S] Function unknown 
COG ID[COG3535] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0447006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCT GGCGGGTGGA AGAGGACGCG CTGGAGTCGA TCGTGATCGG CGCCGGCATC 
CTCGGGACCG GCGGCGGGGG CAACCCGTAC GTCGGCAAGC TGCGGGCGCG CAAGCTGCTG
CGTGCGGGCC ACGAGATCGA GGTGATCGCG CTGGAGGACG TCCCCGACGA GTGGCGCCTC
TGCACCGCGG GCGGCATGGG CGCGCCGACG ATCGCGGTCG AGAAGCTCCC CCGCGGCAGT
GAGACGACCG ACGCGGTTCG CGCGCTCGAG GAGCACGTCG GGCACCGGAT CGACGCGATC
CTGCCCGCGG AGATCGGCGG CGGCAACTCG ATCGAGCCGA TGATCATCGC CGCGACGCTC
GGCATCCCGA TGGTCGACGC CGACGGGATG GGGCGCGCGT TTCCGACCCT CCCGATGATC
ACGTACTTCA TCTACGGCGT CTCGCCGTTC CCGTGCGCCT TGGCGGACGA GAAGGGCAAC
CAGATCGTCT ACCCGCGCGG CGTCGACGAC CACTGGCTGG AACGGCTGAC GCGCTCCAGC
GCGGTGCAGA TGGGCGGCTT CGTCGGCTGC GCGGTGGCGT ACATGTCCGG CGCGGACGCC
AAGCGCACGG CGATCGGCGG GACGCTGTCG TGGGCGCGCG CGCTCGGCGA CCGCGTCCGC
CGCGCCCGCG CCGCCCGCGA CGAGGACGTG CTCGACGGGG TGCTCGAGGC GGCCGGCGGT
CGCGTGCTGT TCGAGGGCAA GGTCGTCGAC GTCGAGCGGC GCAGCACCGA CGGCTTCGCG
CGCGGCCAGC TCGTGCTCGA CGGGTTCGGC GGCGACGCCG GCGCGCAGCT GACGATCTCG
TTCCAGAACG AGTACCTCGT CGCGTGGCGC GATGGCGAGG TTGTCGCGAC CGTGCCGGAC
CTGATCTGCA TGGTCAACCG CGAGGACGGC GAGCCGATCA CGGTCGAGCG GCTGCGCTAC
GGCTACCGCG TCGCGATCCT GGGTGTTCCG TGCTCGGAGC TGCTGCGCAC GCCCGAGGCG
CTGGACGTCG TCGGCCCGCC GGCGTTCGGC TACGACCTTC CCTACGAGCC GATGGAGGTG
GTCCGATGA
 
Protein sequence
MSSWRVEEDA LESIVIGAGI LGTGGGGNPY VGKLRARKLL RAGHEIEVIA LEDVPDEWRL 
CTAGGMGAPT IAVEKLPRGS ETTDAVRALE EHVGHRIDAI LPAEIGGGNS IEPMIIAATL
GIPMVDADGM GRAFPTLPMI TYFIYGVSPF PCALADEKGN QIVYPRGVDD HWLERLTRSS
AVQMGGFVGC AVAYMSGADA KRTAIGGTLS WARALGDRVR RARAARDEDV LDGVLEAAGG
RVLFEGKVVD VERRSTDGFA RGQLVLDGFG GDAGAQLTIS FQNEYLVAWR DGEVVATVPD
LICMVNREDG EPITVERLRY GYRVAILGVP CSELLRTPEA LDVVGPPAFG YDLPYEPMEV
VR