Gene Cwoe_0893 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0893 
Symbol 
ID8731327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp935461 
End bp937395 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content72% 
IMG OID646501510 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003392701 
Protein GI284042361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.216124 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.507094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGTGCC TTTCGCTCAG CAGCCTCGTC GCGCTGAGCT TCCCCGCCGG CGCGGGCGCG 
GGCGTAGCCT TCGCGCCGAG CGTCAGCCTT CCGACGGGCA CCGCGCCGAG TGCGGTCGCC
AGCGCTGACG TCAACAGAGA CTGGCGTCCG GATCTCATTA CGGCCAACGA CGTCGTCGGG
GACGACGTGT CGGTGCTGGT GAGCGCGGGC GCGGGCGTGT TCGCACCCGC CACCAACTTC
TCGTTCGGGG CGTCTCCCGT GGCGATCGCG GCCGGCGACT TCGACTGGGA CAACCGGGTG
GACCTCGTGA CCGTCGACCG CGGGTCCGAC ACGGTGTCGG CGCGCAGAGG GATGGTCGGC
GGAACGTTCG GCGCGCCGAC CGTCATCCCG GTCGGCGCCG CCCCGGCTGG GCTCGTCCTC
GGCGATGTCA CAGGCAACGG CGCTGTGGAC ATCGTGACCG CCGACGCCGG TGCGAACACG
GTGACGGTGG TGCGCGAGGT CTCAGGGAGC GGGTGGGACT CCCCGGTGGG CTTCCAGGTC
GGTGCCACGC CCAGGTCGGT CGCCGCGGGC GACCTCGACG ACGGCTACTA CCGCGACCTC
GTGACAGCCA ACGCGGGTGG GAACTCCGTG TCTGTGCTGA TGAACACGAT GCCGTCCATG
GCCGCGCCCA TCACGTTCGC GCCAGCCGTC GACTATCCGG CGGGCAGCGC CCCAAGCGCG
GTCGCCCTCG GGGACTTCGA CCGCGATGGG CAGGTCGACA TCGCGACGGC CAATCCGTCG
ACGAGCAGCG TCTCCGTGCT GCTGAACACC GGCAGCGGGT TCTCGGCGCC GACCAGCTTT
TCCGTCGACG GCACGCCGCA GTCGATCGCG GTGGGGGACT TCAACAGCGA CATGACGCAG
GACATCGCGA CCGGCAGCAC CGCCGAGAGC ACGGTGTCGG TGCTCGTGAG CGACCGAGCC
GGCGGCTTCG CGGCCGCGCG CGTCTTCGGT GTCGGCGCCG CTCCGAAGGG GATCGCGGTC
GGCGATTTCA ACGCGGACGC CGCGACCGAC ATCGCGGTCG CCTCATCTGG CGCGAACGCC
GTGTCGGTCC TGATGAGCAA GCCGGCCGCC GACCTGAGCA GTGCGTCGCT GACGTTCGGC
ACGGCCGGCG CGCCGGTCCC CCAGGGTGCG CTCAGCGGCA CGCAGGGCGT GACGATCACC
AACGACGGCG GGGTGTCGAT GGCGATCGCG GGCTTCATGT TCGGCGGGGC GAACGCTGAG
GACTTCCTCG TCTCCGCGGA CACCTGTCGC GAGGCGCTCG CGCCGGACCG CAGCTGCACC
GTGTGGGTGC GTTTCGTGCC GCAGGCCCAG GGCACCCGTG CGGCGACGCT GAACATCGTC
TCCAACGATC CGGCGAACCC GTCGGTCCAG CTGCAGGGCT TCGCCGGTCC GCCGCAGGGC
GGCGCGACCG GGCTGCAGGG GCCGAAGGGC GACGACGGCG ACACCGGCCC GAAGGGCGAC
ACCGGCGCGA AGGGCGATGC CGGCACGCCG GGCTCCAAGG GCGACGCCGG CGCGAACGGT
GCGCGGGGCG ACGCGGGCGC GTCGGGCGCC AAGGGAGACA GAGGCACCAA GGGCGACGCC
GGCCCGCAGG GTCCGGCGGG CAAGGCCGGT CAGATCCGGC TCGTGACGTG CACGACCCGG
ACGGTCAGAG GCAGGAGAGT CCGCAGGTGC GAGACGAAGA CGATCGAGGG AAGCACCAGC
TTCACGACCG CCGCGACCGC TCGCGCGTCG TTGACGCGCG ACGGCCGCGT GTACGCGACC
GGCACCGCCA GCCGCTCCAC GGGCCTGCGT CTGCGCACGC GCGGCGCGGT GCCTGCCGGC
CGCTACACGC TCACGCTCCG CTACCGACAG GCGCGGCAGC AGATCACGGT GACCGCGAAG
GTCACGCTCC GCTGA
 
Protein sequence
MTCLSLSSLV ALSFPAGAGA GVAFAPSVSL PTGTAPSAVA SADVNRDWRP DLITANDVVG 
DDVSVLVSAG AGVFAPATNF SFGASPVAIA AGDFDWDNRV DLVTVDRGSD TVSARRGMVG
GTFGAPTVIP VGAAPAGLVL GDVTGNGAVD IVTADAGANT VTVVREVSGS GWDSPVGFQV
GATPRSVAAG DLDDGYYRDL VTANAGGNSV SVLMNTMPSM AAPITFAPAV DYPAGSAPSA
VALGDFDRDG QVDIATANPS TSSVSVLLNT GSGFSAPTSF SVDGTPQSIA VGDFNSDMTQ
DIATGSTAES TVSVLVSDRA GGFAAARVFG VGAAPKGIAV GDFNADAATD IAVASSGANA
VSVLMSKPAA DLSSASLTFG TAGAPVPQGA LSGTQGVTIT NDGGVSMAIA GFMFGGANAE
DFLVSADTCR EALAPDRSCT VWVRFVPQAQ GTRAATLNIV SNDPANPSVQ LQGFAGPPQG
GATGLQGPKG DDGDTGPKGD TGAKGDAGTP GSKGDAGANG ARGDAGASGA KGDRGTKGDA
GPQGPAGKAG QIRLVTCTTR TVRGRRVRRC ETKTIEGSTS FTTAATARAS LTRDGRVYAT
GTASRSTGLR LRTRGAVPAG RYTLTLRYRQ ARQQITVTAK VTLR