Gene Cwoe_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1720 
Symbol 
ID8732160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1813134 
End bp1814810 
Gene Length1677 bp 
Protein Length558 aa 
Translation table11 
GC content76% 
IMG OID646502337 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003393522 
Protein GI284043182 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.493637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0315117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCACCCA TCTCCGGCCT GCGCCGGTTC CTCCCAGGCG TCGTCGCCGC GTTCGCGGTC 
GTCCCCGCGA CCGCGACGGC CGCGGACGGC CCGGCCGTGA CGGTCCGCGT CGAAGGCGTC
CAGAGTACGC TCGCCTCGCC GTTCGCCTCC GCCCAGGCGC CGCTCAGACG GATCGTGCTC
GGCGCTGCGT CCGACGCCGC GGTCGACGCC CCGGCCAGCG TGTACTGCAA CGGCCTGCCG
AGCGGGAAGG TCCCGGCCGA CTCGGTCGGC GCCGCGCTCG CGAGGGTCGA CGCGAGCTGG
CAGTTCGATC CGTTCGGCTT CCCGTCGTCG ATGACGCTGC TGAGCGAGAG ACACGGTCTC
GACGGCGTCA CGTTCGACGG CTCCGCCGGC GTCTGGAGCG TCTGGATCGG CCGCGACTAC
CACAACCTCT CGCTCGAGTC GGGTGCTCCG TGCCAGCCGC TGGCCGACGG CGCGACGCTG
CTGTTCCAGG CGTCCGAGCA GCGCAAGGCG ACGCCGGAGG AGATGTTCGC GACGCCGACG
ACCCCGCTCG TCGAGATCGA CGGCGTGCCG CAGACGGTCC TGCGTGGGAC GTCGGTCCAG
GTGACCGTCT CGACCTACGC GCCGAGCACG TGGGGCGGCG CCGCGATGCC GGGCGTCCGC
GCTCCCGGCG CCGGCTTCCA CGTCTACCCC GACGGCGCGC CGGCGGACTT GATCGCCGAC
GCCGCCGGCA ACGCGACGGT GACGCTCGAC GGGAGCGGGA ACGTCGCGAT CTCGGCGTAC
CACGCCGACG CGTGGGGCGG AGAGGCCGGC ACGTTCCCGA CGCCGACGGG CAACTCCGGC
CGCGCGCTGC CGCGGGTCGT CTGCGTCTTC GACCCCGACG CGATCGCCTC GCCGTGCGTC
GGCAACCTCG TCGGCCCCGC GACCGCGCCG GACTTCGGCA CGCAGGCGCG CGAGACGCTC
GGCGCCCCGC GCACGATCGC GCTCGGCTCG CAGCTCGGCA GAGTCGGCGT GACCGGCGTG
AAGGTGGTCG GCAGCGGAGC CGACCAGGAC GGCGCGGACG ACTTCCTGCT CAGCTACGAC
GGCTGCAGCG GGCGCACCGT CGACAGGGCG GCGCCGACCT GCTCGGTGCG TGTCCGTTTC
GCGCCCTCCG TCGTCGGCCC GCGCAGCGCG ACGCTGCGCG TCGAGAACAG CGGCGGAGGC
GGCACGCTCG ACGTGCCGCT GACGGGCACC GGCGGCGGCG CCGCGCCGGG CGCGCCCGGC
GCGGACGGCG CCGACGGCGC GAAGGGCGAC AAGGGCGACG CGGGCGCGAA GGGCGACGCC
GGTCCCGCCG GTGCGCCGGG CGCGACCGGC CCCAGCGGGC CGCAGGGTCC CGCCGGTGGC
GTCGGGCCCA CCGGTGCCGC CGGGTCCACC GGCCCCGCCG GCCCGCAGGG CAAGCCGGGC
AGAAACGGCA AGGACGCGAC GTGCACCGTG AAGCGCGGCA AGGGCGCGCC GAAGATCGTC
TGCAGGCTCG TCAACGGCGC TGGCACGGCG CGCGCGTCGA TGACCCGCGG CGGCACGACC
TACGCCCGCG GCACGGTCAG CTCGCTGCGC GCGACGCGTG CGGTGCGCGC CGGCAGCTAT
CTCCTGCGCT TCCACGTCAA GGGCAAGCGC GTGACGCAGC CGGTGCGCGT CCGCTGA
 
Protein sequence
MSPISGLRRF LPGVVAAFAV VPATATAADG PAVTVRVEGV QSTLASPFAS AQAPLRRIVL 
GAASDAAVDA PASVYCNGLP SGKVPADSVG AALARVDASW QFDPFGFPSS MTLLSERHGL
DGVTFDGSAG VWSVWIGRDY HNLSLESGAP CQPLADGATL LFQASEQRKA TPEEMFATPT
TPLVEIDGVP QTVLRGTSVQ VTVSTYAPST WGGAAMPGVR APGAGFHVYP DGAPADLIAD
AAGNATVTLD GSGNVAISAY HADAWGGEAG TFPTPTGNSG RALPRVVCVF DPDAIASPCV
GNLVGPATAP DFGTQARETL GAPRTIALGS QLGRVGVTGV KVVGSGADQD GADDFLLSYD
GCSGRTVDRA APTCSVRVRF APSVVGPRSA TLRVENSGGG GTLDVPLTGT GGGAAPGAPG
ADGADGAKGD KGDAGAKGDA GPAGAPGATG PSGPQGPAGG VGPTGAAGST GPAGPQGKPG
RNGKDATCTV KRGKGAPKIV CRLVNGAGTA RASMTRGGTT YARGTVSSLR ATRAVRAGSY
LLRFHVKGKR VTQPVRVR