Gene Cwoe_4578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4578 
Symbol 
ID8735043 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4880179 
End bp4881513 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID646505206 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003396366 
Protein GI284046026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.567616 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGGGA TCGGACGTAA GGAAGCAACC ATCCTCGCTG TGCTTGTCGC AGCAGCGATG 
AGCCTGCTGT GGACTTCAGG CGCCAGCGCG GCCGCGACGA GCACCGCGAC GACGAGCGCC
ACCAGAAGCA CGAGAACGAA CAGAAACAGA GGGAGAGGCG GCGTCAAGGG CGTCTCCCAG
AGAAGCGGTC CGAGAGGCGC CAGAGGCGCC ACCGGCGCGA CGGGCCCCGC CGGTCCTGCC
GGCCCGATCG GCCCGGTCGG CCCCGCCGGC CCGCAGGGCG CCGCCGGCCC CCAGGGACCG
CAGGGCGAGA GAGGCGCGGC CGGCGCCGAC GGCAGAAACG GCGCGAACGG CCTCAAGGGC
GACACCGGCG CTGCCGGCAC CGGCGCCAAG GGCGACACCG GCGACCAGGG CATCCAGGGC
ATCCAGGGCG TCAAGGGCGA TACCGGCGGC CAAGGCATCC AGGGCGCCAA GGGCGACACC
GGCGGCCAAG GCATCCAGGG CGATACCGGC GGCCAAGGCA TCCAGGGCGC CAAGGGCGAC
ACCGGCGACC AAGGCACCCA AGGCGTCAAG GGCGACAAGG GCGACACCGG CGACCAAGGC
ACCCAAGGCG TCAAGGGCGA CCAAGGCACC CAGGGCATCC AGGGCGACGT CGGCCCCGAG
GGGCCGCAGG GCCCCATCGG TCCGAATGGC CTCCAGGGCG TCAAGGGCGA TAAGGGCGAC
AAGGGCGACG ACGGTGCCGC TGGTGGCGAC GGCGCGGCTG GCGCAACCGG CGCGGCTGGC
GGCACGGGCG CGACCGGCGC TCAGGGTCCG CAGGGCGAGA CCGGTCCGAT CGGTCCGCAG
GGTCCCGCAG GCCCGCAGGG CGATAAGGGC GACCCCGGCA CACCCGGCGA TCAGGGTCCG
GCCGGCAACA CCGGCCCGCA GGGTCCGGCC GGTGCCGACA GCACCGTCGC AGGTCCGAGA
GGCGAGACCG GCGCGACCGG CGCGCAGGGT CCCGCAGGTC CGAGAGGCGA GACTGGCTCG
ACCGGCGCCA CCGGTGCCCA GGGTCCGAGC GGTGCGGTCG GTGCGGTCGT CACCTCGTCG
ACGTTCACCG TCACGAACGA CGGCTACGCG ACGATCGCCT GCCCGAACAG CGGCGTGGCG
CTCGGCGGCG GCGGCCAGTT CACGAGCACC AGTGGCGGTG GCGGAACGGT GTACACGGTG
CGCGGTTCGT TCCCGTCGGC CACCAACGGC ACTCCGACCA CGAGCGGGTC GGCAGGCAGA
GCGTGGACGA TCCGGTCCTC CAGCGCGAAC GACACCCAGG GCGGCACCGT CTACGTGATC
TGCGTCCCAC AGTGA
 
Protein sequence
MKGIGRKEAT ILAVLVAAAM SLLWTSGASA AATSTATTSA TRSTRTNRNR GRGGVKGVSQ 
RSGPRGARGA TGATGPAGPA GPIGPVGPAG PQGAAGPQGP QGERGAAGAD GRNGANGLKG
DTGAAGTGAK GDTGDQGIQG IQGVKGDTGG QGIQGAKGDT GGQGIQGDTG GQGIQGAKGD
TGDQGTQGVK GDKGDTGDQG TQGVKGDQGT QGIQGDVGPE GPQGPIGPNG LQGVKGDKGD
KGDDGAAGGD GAAGATGAAG GTGATGAQGP QGETGPIGPQ GPAGPQGDKG DPGTPGDQGP
AGNTGPQGPA GADSTVAGPR GETGATGAQG PAGPRGETGS TGATGAQGPS GAVGAVVTSS
TFTVTNDGYA TIACPNSGVA LGGGGQFTST SGGGGTVYTV RGSFPSATNG TPTTSGSAGR
AWTIRSSSAN DTQGGTVYVI CVPQ