Gene Cwoe_3449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3449 
Symbol 
ID8733898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3675502 
End bp3677493 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content78% 
IMG OID646504066 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003395242 
Protein GI284044902 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.344079 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0780597 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTGGC GCATGCGCAC GATCAGCACC CTTCTCGCCG CCGCAGCCGC GCTGCTGGCG 
CTCACCGCTC CGGCGACCGC AGCGACGTTC AGCCCCGTCG CCGGCTCGCC GTTCGCGACG
AGATCTACCG ACACGATCGA CCTCGACCTC GCCGACTTCG ACCTCGACGG CCGCCTCGAC
GCGGTCGTCG CGGATCGCGG CGCCAGCGAG CTCGCGGTCC TGCGCGGCGC CGCGGGCGGC
GGATTCGGCG CCCCGGAGGT CACGCCGATC GCGGGCCCCG GGCCGGTGCG GGTACAGGCC
GCTGACCTCG ACGACGACGG CTTCCCGGAC GCGATCGTGC AGCGGCAGAA CCAGTCGGAC
GTCACGGTGC TGCGCGGGGA CGGCACGGGC GCGCTGACGC CGGTCGCCGG CTCGCCGTTC
CCCGCCGCCG CGGAGATCTG GAGCATGGAC GTCGCAGACG TGAACGGCGA CGGCCGCCCT
GACGTCGTCG CGGGGCTGCT CGACGGCCGC GTGCAGCCGC TGCTCGGCAC CGGCGGCGGC
AGACTGAGCG CGGGAGCCGT CGTCGACGTC GGCGGGGGCA TGCTGCCGCT CGTCACGGCG
GGGCGCTTCA ACGCCGGCAG ACGGGTCGAC CTCGTGCTGG CGAGCCGCGT CGCGGCGACC
GTCACCGTCC TGCGGGGCAA CGGCGACGGC ACGTTCGCGC CGGTGCCGGG CGGCTCGCTC
GGCGTCGGCG CCGACCCCAG CGCGCTGGTC GCCGAGGACT TCGACGGAGA CGGCGACCTC
GACGTGGCGG CGAGCAGCCA GACGGACGGC ACCGTGACGG TCGCGCTCGG CGACGGCACG
GGCAGACTGG TCGTCGGCTC GACGGTCGCG GTGACGCAGA CGCCGACGGA CCTCGCCGCC
GGCGACGCCG ACGGAGACGG CGACCCTGAC CTCGCCGTGG CACGGTACGA CGCGAGCCGG
GTCCAGTTGC TCGTCAATGC CGGCGACGGC ACGTTCTCGA CCGACCTCGA CCCGCAGACG
CCGGTCGTCG CCGATCCGAT CGCCCTGACG GCCGGCGACC TCGACGGCGA CGGGCTCGCC
GACCTGCAGG TGCTCGGGCG CGGCCTCGTC GGCTCGCTGC GCAACGAGAG CGTCGCCGCC
GTCACCGTCG ACCAGCCCGC GCTCGGCTTC GCCGACCAGG CGGTCGGGAC GCTCGGCGGC
GGGCAGACCG TGAGAGTGAC GAGCGCGGGC GCCCTGCGCC TGGACGTCGA GCAGCTGCGG
CTCGACGGCG CGGCCGACGA CTTCCTTCTG CTGCAGGACA CGTGCAGCGG GCGCTCCTTC
GCACGCGGCG CGTCCTGCTC GGTGCGCGTC CGGTTCGCGC CGACCGCCGC CGGCCCGCGC
GCGGCGACGC TCGTCGTCGA GAGCGACGCC GCCGGCTCCG CGCCGGCCGT CGCGCTGTCG
GGGACCGGGA TCCTGATGCC GCCTCCGGTC GGCGGGAGAG ATGGCGCCGA CGGGAGAGAC
GGCGCGGCCG GCAGAGACGG TGCGGCCGGC AGAGACGGTG CGGCCGGCAG AGACGGTGCG
GCCGGGCCCA GCGGCGCGGC GGGCAGAGAC GGCGCGATCG GACCCAGCGG CCGGCGTGGT
CGAGACGGTG CGGCCGGCCC CAGCGGCCCT GCCGGGCCTC GCGGCCCCGC GGGGCCGGAA
GGCCCCCGGG GACGGCCTGG AGCACCCGCA GGAACGGCGC CGGCGGCCGC TTGCACGCGA
GCAGGCACCC GCGCCGTGCG CTGCAGCATC ACGCTCCGCG GCGCCCTGGC GGGCCGGCGC
GGCGCGGTGG CGGTGCGACT GCTGCGCGGC CGCGCCCAGG CCGCGGCGAC GCGGGGACGC
GCCCGTGGCG GCCGCGTGTC CGTGACGCTG CGCTCACGCC GCGCGCTCCG CCGCGGGCGC
TACCAGCTCG TCGTCGCGGC GAGCGCCCGC AAGGGCGCAC CTGTGAGCCG GGCGTGGGTG
ACGATCCGCT GA
 
Protein sequence
MIWRMRTIST LLAAAAALLA LTAPATAATF SPVAGSPFAT RSTDTIDLDL ADFDLDGRLD 
AVVADRGASE LAVLRGAAGG GFGAPEVTPI AGPGPVRVQA ADLDDDGFPD AIVQRQNQSD
VTVLRGDGTG ALTPVAGSPF PAAAEIWSMD VADVNGDGRP DVVAGLLDGR VQPLLGTGGG
RLSAGAVVDV GGGMLPLVTA GRFNAGRRVD LVLASRVAAT VTVLRGNGDG TFAPVPGGSL
GVGADPSALV AEDFDGDGDL DVAASSQTDG TVTVALGDGT GRLVVGSTVA VTQTPTDLAA
GDADGDGDPD LAVARYDASR VQLLVNAGDG TFSTDLDPQT PVVADPIALT AGDLDGDGLA
DLQVLGRGLV GSLRNESVAA VTVDQPALGF ADQAVGTLGG GQTVRVTSAG ALRLDVEQLR
LDGAADDFLL LQDTCSGRSF ARGASCSVRV RFAPTAAGPR AATLVVESDA AGSAPAVALS
GTGILMPPPV GGRDGADGRD GAAGRDGAAG RDGAAGRDGA AGPSGAAGRD GAIGPSGRRG
RDGAAGPSGP AGPRGPAGPE GPRGRPGAPA GTAPAAACTR AGTRAVRCSI TLRGALAGRR
GAVAVRLLRG RAQAAATRGR ARGGRVSVTL RSRRALRRGR YQLVVAASAR KGAPVSRAWV
TIR