Gene Cwoe_1922 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1922 
Symbol 
ID8732363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2017079 
End bp2019088 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content70% 
IMG OID646502539 
ProductEndopygalactorunase-like protein 
Protein accessionYP_003393723 
Protein GI284043383 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.449199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCGGC TTCGACCATC TGCCCGTGCC GCGCTGCTGT GTTGCGCGGC GCTCGCGGCG 
CTGCCCGCGC TCGCCGCCGC CAGAACGTAC GACGTCACCG CACCGCCGTA CTCGGCGGCC
GGCAACGGCA CGACGAACGA CCGCCTCGCG ATCCAGCAGG CGATCGAGGA CGCGTCGGCC
GCGGGCGGCG GGACGGTGCT GGTGCCCGCC GGGAGAACGT TCCTCTCAGG CGGCATCCGC
CTGCGCTCGA ACGTCACCTT CCAGCTCGAC GGGACGTTGC AGCAGAGCTT GAACACCGCC
CACTACGCGG TCGCGCCGAT GGTCGGCTGG GACGTGCCCG GCTCGACGCT CAACTGGGAC
TCGACGGCGT TTCACAACCA GCCGTTCGTG TTCGCCGCCG ACGCCGAGAA CGTCACGCTG
ACGACGTCGG GCAGAGGGAC GATCCAGATG GGCGTGACGC CGACCTCCGC AACCGCGATC
CGCGTCGACG CGATCGGCTT CCGCGACGTC GAGAGATGCT GGATCAACAA CATCACGACG
CGTGACGTGA TCGGCTTCAA CATCGTGCTC GACCGCGCGA ACCACTGCGA CATCACCGGC
ACGTACCTCA ACTCGAAGGC CGGCAGCCTC GGCAGCGACG GCATCAACAT CACGGGCTCC
CAGCACGTGA AGGTGCTCTA CAACCACGTC AGCGCGGGCG ACGACGGCCT CTACATCGCC
GTCAGCTACG GCGACCCGCG CTTCACCGGC CCGTGGCGGG CGCCCGACAC CGGCGGCGCC
GCGCGCTACA TCGAGATCGC GAACAACGAG GTCGTCGACT TCGGCCACCA GAACGCGTTC
ACGCTGATCC CGTGGGGCAG CCTCGACCCC GACCAGCGCA ACGTCGAGAT ATCCGACGTC
TCGATCCACG ACAACACCTT CATCGCGGAC GTCGCGCAGG CGGTCGACTG CCGCTGCGAC
AACCCCTGGA GAGGCACCAG AAGATACTTC CAGGACACCG ATCGCGGCGA CCAGTCGCCG
ATGACGCGCT TCTCGATGTG GAACAACGTC TTCATCTCGA GAACGAGAGT GCCGAACTTC
CCGACCTGGG TCGGGGCGAC CTTCACCGAC TCGCAGTTCG GCGGCCTCTC AGGTGCGCCC
GGGGCGATCA GAAGCAGTCC GTCGATCCAG AACGGCGGCT TCGAGCGGAC CGGCAGCGCG
TGGTGGAGCA TCGGCGGCGT CGGCGGCGCC ACGAACGACC CGGCGATGCT GCCGGCCGGC
GCGGGCGCGG CGCTCAGAGC ATTGGGCGGC TGGGCCGGGT TCGTGCTGCC GAGCGGCACC
GCCACGACAA GCCTCGTGCA GGGGCTCGGG TTGGAGAACG CCGCCGACCT CGGGCTCCCG
CTCGTCGGCG TCGCCGGCGC GGCGACATAC CGGCTCGACG CGACCGTCGT GACGAACGGG
CAGCCGTTCC GCGTCTACGC GCACGACACG TGCGAGAACA GAGTGCTCGC GCAGCAGACG
GTCTCCGCGA CGACGGCGAC GCGCGTGTCG CTCCCGTTCA GCGTCACCAG AAGCTGCGGC
AACGTCCATC TCGGGATCGA CCGGGGCGGA GCCACGAGCG GCTGGGCGCT GATCGACGAC
GTCGAGCTGC GCGCTCCCGT CGTCGGCAAC GAGGATCCGT CGCTGCGCAC GGTCGGCACG
TGGGGCCGCG ACTGGGCCGG CGGCGACATG GGCGGCACCC ACCACCATGA CAACGGCACC
GGCAGCACGG TGACGATCCC GTTCACCGGC ACGCGCGGCA GAGTCCTCGC GCCGAAGGGG
CCGGGCTGGG GCATCGCGTC GGTCTCGGTC GACGGCGGGC CGGCGGTCGA CGTCGACCTC
TACGGCGCCG CGGCCGCGTG GCACGCGACC GTCTTCGACA CCGGCGTCCT CCCGTTCGGG
AGACACACCG TCACGATGAC CGTCTCCGGC CGCAAGCACC CCTCGTCGAC CGGCACCTGG
ATCGCGTTCG ACGCGCTGCT CGTGAGCTGA
 
Protein sequence
MHRLRPSARA ALLCCAALAA LPALAAARTY DVTAPPYSAA GNGTTNDRLA IQQAIEDASA 
AGGGTVLVPA GRTFLSGGIR LRSNVTFQLD GTLQQSLNTA HYAVAPMVGW DVPGSTLNWD
STAFHNQPFV FAADAENVTL TTSGRGTIQM GVTPTSATAI RVDAIGFRDV ERCWINNITT
RDVIGFNIVL DRANHCDITG TYLNSKAGSL GSDGINITGS QHVKVLYNHV SAGDDGLYIA
VSYGDPRFTG PWRAPDTGGA ARYIEIANNE VVDFGHQNAF TLIPWGSLDP DQRNVEISDV
SIHDNTFIAD VAQAVDCRCD NPWRGTRRYF QDTDRGDQSP MTRFSMWNNV FISRTRVPNF
PTWVGATFTD SQFGGLSGAP GAIRSSPSIQ NGGFERTGSA WWSIGGVGGA TNDPAMLPAG
AGAALRALGG WAGFVLPSGT ATTSLVQGLG LENAADLGLP LVGVAGAATY RLDATVVTNG
QPFRVYAHDT CENRVLAQQT VSATTATRVS LPFSVTRSCG NVHLGIDRGG ATSGWALIDD
VELRAPVVGN EDPSLRTVGT WGRDWAGGDM GGTHHHDNGT GSTVTIPFTG TRGRVLAPKG
PGWGIASVSV DGGPAVDVDL YGAAAAWHAT VFDTGVLPFG RHTVTMTVSG RKHPSSTGTW
IAFDALLVS