Gene Cwoe_5164 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5164 
Symbol 
ID8735630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5525413 
End bp5528601 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content74% 
IMG OID646505789 
Producthypothetical protein 
Protein accessionYP_003396948 
Protein GI284046608 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCGC GCGCTGTTCG ACCGACCCTG GCCTGTCTCG CCCTGACGCT GATCCCGGCT 
TTGCTGCCGG CCGCCGCCTC GGCCGCCACC TCGCAGAGAT TCACGACCCC CGGCGCGACG
CAGTTCACGG TCCCGGCGGG CGTCACGAGC GTCTCGATCG ACGCGATCGG CGGGCAGGGC
GGCGGCGCGC TCCAGACGCC GCCGTTTCGG TGCCAAGGCG GTGCCGGCTC GCGCGTGAGA
GGGACGATGA CCGTCACGCC GGGGCAGAGC TTCTGGGTCG TGGTCGGCGG CGCCGGCGGC
GACGCGATGC CGCCGGCCCA GAGCAGCGCC GGCGGGTACA ACGGCGGCGG CAACGGCCGC
TCGGAGCAGT GGGGAAGCGG CGGCGGAGGC GGCGCGTCGG ACATCCGCAC GCTGCCCGTC
GGCAGCGGGC TGACGCCGAC CGATTCGCGC CTGCTGGTCG CCGGCGGCGG CGGCGGTGCC
GGTGGCGACA ACCGCGACGC CTGCCCCGGC GGCGGCGCGG GCGGCGCGAC CCCCGGTGCC
GGGCACGAAG GCGCGTTCGG CACGACCGCC GGCCAGCCCG GCACGCAGTC GGCCGGCGGC
GCGCGCGGCC TCTCGGCCCT CTGCACCACA CCCGTCCCGT CGACCGACGG CGCGCTCGGC
TTCGGCGGCA TTGGCGCCTA CGAGAGTCAG TCCAACGGCC TCTGCAGCGG CAGCGGCGGC
GGCGGTGGTG GCGGTCGCTA CGGCGGCGGC GGAGGCGGAG CGAGCCAGCT GAGCCTGTTC
TACGGCGGCG GAGGCGGCGG CGCGGGCTCC AACTACACGA GCCCGGCGAG AATCGGCAAC
GTCGCGATCA CGACCGCGGT GTGGGAGAGA ACGAACCCCC CGCTGAACGG CTCGATCACG
ATCGACTGGA CACCCGAGTC GCCGACCGCC TCGATCGCCT CGCCGGCCGA CGGCGGGAGC
TACGCGGTCG GGCAGACCGT CGCGACCTCG TTCTCGTGCG CGGACGCGGT CAACGGCGGT
GGGATCACCT CGTGCGTCGA CGGCGCGGGC AGAACGAGCC CGGGCAGCCT CGACACGACG
ACGCCCGGCT CCCACACTTA CACCGTCACC GCGACCTCGG CCAGCGGCCT GAGAGGGACC
GACTCGATCA CCTACACCGT GGCGGCTGCG CCGAGCGCGA CGATCAGCTC GCCGGCGGGC
GGCGGGACCT ATGCGGTCGG TCAGAACGTG GCGACGGCCT TCGCCTGCGC GGAGGGTGCG
AGCGGCCCGG GGATCAGCTC GTGTCTCGAC GGCGCCGGCT CGACCAGCCC CGGCAGACTC
GACACGACGA CGCCTGGGAC CCGCACCTAT ACGGTGACGG CGACCTCGGC GAGCGGTCAG
AGAGGGACCG ACTCGATCAC CTACACCGTG GCGGCTGCGC CGAGCGCGAC GATCAGCTCG
CCGGTGAGCG GCGGGACCTA TGCGGTCGGT CAGAACGTGG CGACGGCCTT CGCCTGCGCC
GACGGTGCGA GCGGGCCGGG GATCTCCACG TGCGTGGACG GCGCCGGCAG AACGAGCCCG
GGGAGACTGG ACACGACGAC GCCCGGCTCG CGCACCTACA CCGTGACGGC GACCTCGAGA
AGCGGGCAGT CGAGAGCGGC GTCGATCACC TACACGGTCG CGGCGGCGCC GAGTGCGACG
ATCACGGCGC CGGCGAGCGG CGGGACCTAC GCGGTCGGCC AGAACGTGGC GACGGCCTTC
ACCTGCGCCG ACGGTGCTAG TGGGCCCGGC ATCGACAGCT GCCTCGACGG CGCCGGGAGA
ACGAGCCCGG CGAGACTGGA CACGACGACG CCCGGCACCC GCACCTACAC GGTGACGGCG
ACCTCCAGCA GCGGCCAGAG AGGGACCGAC TCGATCACGT ACACCGTGGC GGCCGCGCCG
AGCGCGACGA TCACCGCGCC CGCCGACGGT CAGGTCTACG CCGTCGGCGC GGACGTCGCG
ACGGCCTTCG CCTGCGCCGA CGGCGCGAGC GGTCCGGGGA TCACGACCTG CCTCGACGAC
GTCGGCGCGG CGAGCCCCGG CAGACTCGAC ACGACGACGA CCGGCCAGCA CATCTACACC
GTGACGGCGA CCTCCGGCAG CGGCCAGACG AGAGCGGCCT CGATCGCCTA CACCGTCGCG
AGAGTGCCGA CGGCGACGAT CAGCGCGCCG GAGGACGGCG CGATCTTCGC GATCGGCGAG
TCGGTGACGA CGAGCTTCAG CTGCGCGATC GGCCAGGACG GCGGCCCGAT CGCGACGTGC
GAGGACAGCG GCGGCGCGAG CGGCGGCAGC GGCAGACTCG ACACGAGCAG AGCCGGCAGA
TTCACCTACA CGGTCGACGT GAGAAGCGAC GACGGCCTGA GCGGCAGCAC GTCGGTCGCA
TACACGGTCG CCGAGGCGCC GAGAGCGACG ATCAGCGCGC CCGGCGACGG CGGCACCTAT
ACCGTCGGCG ACCGCGTTGA GACCGCGTTC GCGTGCGCCG AGGGCGACTT CGGCCCCGGC
ATCGCCTCGT GTGTCGACGG CGTCGGTGCG AGTGGCACCG GGGTGCTCGA CACGGCGGCG
GCGGGCAGCC ACGCGTACCG CGTGACCGCG ACGTCGAGAG ACGGGCAGGC GGCGAGTGCG
AGCATCTCCT ACACGGTCGT GAGACCGGCC GATCCGCCGG CCAGAAGAGG TCCGGACAGC
CCCGGCACGC CGCCCGACGG CGGGAGAACC GGCCCGAGCG CTCCGAACGG CTCCGGCGAC
ACGCCGAACG GCCCGTCCGG GACGCCGAGC GGCACGCCGC CCTCCAACGA CGTCAGATTC
ACCCGCGTCC GCACGAACGC TGACGGCAGT CTCCGCTTCA CCGTGCGCTT CCCCGGCCGC
GGCCGCGCGG AGACGATGCT GACCGCGCGG AGAGCGACCC TCGCCGGCGC CTCCGCCACC
TTCACCCCGC TGGTGAGCCG CTTCGCGTTC GCGACCGGGC GTTTCACGGT GAGACGCGCC
GGCCCGTTCA CGCTGACGCT GCGCCCGAAC AGACGCGGCC GTGAGCTGAT CGCGCGCGGT
CGCGGCGCCA CCCGCCTGCG CCTCTGGGTC GCGTTCACAC CGACCGGCGG CAGACAGCGC
AAGATCAGCG TCTTCGGCGT GAGAGTGCCG GCGCCGGCGC CGGCGGGCGC GGCGCGAGCG
CGCCACTGA
 
Protein sequence
MHARAVRPTL ACLALTLIPA LLPAAASAAT SQRFTTPGAT QFTVPAGVTS VSIDAIGGQG 
GGALQTPPFR CQGGAGSRVR GTMTVTPGQS FWVVVGGAGG DAMPPAQSSA GGYNGGGNGR
SEQWGSGGGG GASDIRTLPV GSGLTPTDSR LLVAGGGGGA GGDNRDACPG GGAGGATPGA
GHEGAFGTTA GQPGTQSAGG ARGLSALCTT PVPSTDGALG FGGIGAYESQ SNGLCSGSGG
GGGGGRYGGG GGGASQLSLF YGGGGGGAGS NYTSPARIGN VAITTAVWER TNPPLNGSIT
IDWTPESPTA SIASPADGGS YAVGQTVATS FSCADAVNGG GITSCVDGAG RTSPGSLDTT
TPGSHTYTVT ATSASGLRGT DSITYTVAAA PSATISSPAG GGTYAVGQNV ATAFACAEGA
SGPGISSCLD GAGSTSPGRL DTTTPGTRTY TVTATSASGQ RGTDSITYTV AAAPSATISS
PVSGGTYAVG QNVATAFACA DGASGPGIST CVDGAGRTSP GRLDTTTPGS RTYTVTATSR
SGQSRAASIT YTVAAAPSAT ITAPASGGTY AVGQNVATAF TCADGASGPG IDSCLDGAGR
TSPARLDTTT PGTRTYTVTA TSSSGQRGTD SITYTVAAAP SATITAPADG QVYAVGADVA
TAFACADGAS GPGITTCLDD VGAASPGRLD TTTTGQHIYT VTATSGSGQT RAASIAYTVA
RVPTATISAP EDGAIFAIGE SVTTSFSCAI GQDGGPIATC EDSGGASGGS GRLDTSRAGR
FTYTVDVRSD DGLSGSTSVA YTVAEAPRAT ISAPGDGGTY TVGDRVETAF ACAEGDFGPG
IASCVDGVGA SGTGVLDTAA AGSHAYRVTA TSRDGQAASA SISYTVVRPA DPPARRGPDS
PGTPPDGGRT GPSAPNGSGD TPNGPSGTPS GTPPSNDVRF TRVRTNADGS LRFTVRFPGR
GRAETMLTAR RATLAGASAT FTPLVSRFAF ATGRFTVRRA GPFTLTLRPN RRGRELIARG
RGATRLRLWV AFTPTGGRQR KISVFGVRVP APAPAGAARA RH