Gene Cwoe_2459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2459 
Symbol 
ID8732902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2613855 
End bp2614883 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content73% 
IMG OID646503075 
ProductCollagen triple helix repeat protein 
Protein accessionYP_003394257 
Protein GI284043917 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00356575 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCTATG TCCTGAGCCT TGCTCGCCGC CATGCGGTGG CGTTCCTGGC CGTCTTCCTC 
TTGCTCGGCG GCACCGCCTA CGCGGTCGCC GATCGCGTCG TGATGTCGAG AGCAACCCCC
AAGATCTACG CGTGCGTCAC GGAGACGCAC GGCACGCTGA ACCTGTCGAG CGCGAAGGCG
AGATGCGAGC CCGGCCAGCG CAAGATCTCG TGGAACGCAG AGGGTCAGCG CGGCCTGCCC
GGCACTGACG GAGCGCGCGG CGCGAGAGGC GCGGCGGGCG CCGCCGGTCC CGCCGGTCCG
GCGGGACCTG CCGGTGCCGC CGGTGCGAAG GGCGCGCCCG GTGAGAAGGG CGCGAAGGGC
GACCGCGGCG AGACCGGCCC GGCTGGTGCG GGCGAGCGCG GCCCCGCCGG TACGCAGGGC
GAGAGAGGCG CCGCCGGCGC CGTCGGTCCG ATCGGCCCGA TCGGCCCGCA GGGTCCGACT
GGCCTGCAGG GGCCGGCCGG CACCAACGGC ACCGACGGCG CCAGCGCGAT GCTGACGGCG
AGCGGACCGA CGAGCCCGAC GACGATCGCC GGCGGGGTGT CGGGCGACGT CGGCCTGCTG
CCGCTGTCGG GTCAGCTGAC CGCTTCCAAC ACCAGCATCC TCATGGGCGG CACGCTGGAC
GGGGACACGC CGGAGGTCCG GGCCGCGGCG CAGGTCTTCC CGCGCGATGG GACGCTGACC
GCCATCGGCG GGACGTTCGT CGTGACGCAG GCGATGAGCC TCATCAGCAC GACGATCACC
GTCGAGGCCC AGTTGTTCAC GGGCTCGGGC ACCACGCTGA CGCCCGTCCC GGGCGCGCAG
TGCATCGCCG TGCCGGCGCT CACGGGCATC GTCTCGATCG GCGTTCTGTC GAGATGCATC
ACGACCGGCC TCTCGATCCC GGTCACGGCG GGCACGCGTG GCGTGGTCGT CGTCACGATC
ACCGCCGCAG GCATCCAGCT GACTCAGTCG GCCCAGCTGC AGAGCGCCGT CTCGCTCGCG
GTGGCCTGA
 
Protein sequence
MSYVLSLARR HAVAFLAVFL LLGGTAYAVA DRVVMSRATP KIYACVTETH GTLNLSSAKA 
RCEPGQRKIS WNAEGQRGLP GTDGARGARG AAGAAGPAGP AGPAGAAGAK GAPGEKGAKG
DRGETGPAGA GERGPAGTQG ERGAAGAVGP IGPIGPQGPT GLQGPAGTNG TDGASAMLTA
SGPTSPTTIA GGVSGDVGLL PLSGQLTASN TSILMGGTLD GDTPEVRAAA QVFPRDGTLT
AIGGTFVVTQ AMSLISTTIT VEAQLFTGSG TTLTPVPGAQ CIAVPALTGI VSIGVLSRCI
TTGLSIPVTA GTRGVVVVTI TAAGIQLTQS AQLQSAVSLA VA