Gene Cwoe_3031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3031 
Symbol 
ID8733477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp3236096 
End bp3237310 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content77% 
IMG OID646503646 
ProductVWA containing CoxE family protein 
Protein accessionYP_003394825 
Protein GI284044485 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0793428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0122091 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGC GCGTCGACCT GCCGGCGTTC GCAGCCGCGT TCGGCCGGGC GGTCCACGAC 
GCCGGGATCC CGTCGCCGCC CGAGCGGGCG GTCCGCTTTG CGCAGGCGCT CGCGCTCGCG
CCGCCGGCGC GCCGCGGCAG CCTCTACTGG ACCGCGCGCA CCGTCTTCGT CTCCTCGCGC
GAGCAGCTCG AGACGTTCGA CCGCGTCTTC GCGCGGGTGT TCGGCGGGAT CGCGGACCCG
GCGCAGGCGC GCGGCGATCC GAACGCGCCG CCGCCGCCCG GCGCCCAGGC CGGGCCGCGA
CCGGCGCGCA CGCAGGCCGC GCTGCCAGAG GCCCCGCCTG AGGGTGGCGC GGCGCCGCGT
TCCTCCTTCG GCGACCTGCG TGAGCCGCGC CGCGACGGCG ACGACGCCGA GCGGCGCGAG
CGCAACGTGC AGCTCGCCGC CGCGAGCGCC GAGGAGCGGC TCGCCGAGCG CGACTTCGGG
CAGCTGACGC CCGACGAGCT GCGCGCGCTG TGGCGGCTGA TGCGCGAGCT GGCGCTGGCG
CCGCCCCTCC GTCGCTCGCG CCGCGCCCGC CGCGACCGTC ACGGCGGGCG GCTCGACGTG
CGTGCGACGC TGCGCGCGAG CCGCCGGACC GGTGGCGACC CCGCCCGCCG GATCATGCGC
AGACGGGTGC TGCGCCGCCG CCGGCTGGTG CTGCTGTGCG ACATCTCCGG CTCGATGGAG
CCGTACTCGC GCGCCTTCCT GCAGTTCTTG CACGCCGCGG TCGGCGGGAC CGACGCGGAG
GCGTTCGTCT TCGCGACGCG ACTGACCCGG CTGACGCGCG CGCTCCAGGG CCGCCAGCCG
GAGCTTGCGA TCGAGCGCGC GACGGCGGTG GCGCACGACT GGGCCGGCGG GACGCGGATC
GGCGAGACGC TACGGCGCTT CAACGACAGC TATGGCCGGC GCGGGATGGC GCGCGGCGCC
GTCGTCGTGA TCGTCTCTGA CGGCTGGGAA CGCGGCGACC CCGCGCTCGT CGCCGAGCAG
ATGGAGCGGC TGCACCGGCT CGCGCACCGC GTCGTGTGGG TCAACCCGCA CAAGGCGAGC
CGTGACTTCG CACCGCTCGC GGGCGGGATG GCGGCGGCGC TGCCGTGGTG TGACGCCTTC
GTCAGCGGGC ACAACCTGAG TGCGCTCTCG GCGGTGGCCG AGGCGATCTC GACCTCTCGG
AGGAGCACGC GATGA
 
Protein sequence
MSERVDLPAF AAAFGRAVHD AGIPSPPERA VRFAQALALA PPARRGSLYW TARTVFVSSR 
EQLETFDRVF ARVFGGIADP AQARGDPNAP PPPGAQAGPR PARTQAALPE APPEGGAAPR
SSFGDLREPR RDGDDAERRE RNVQLAAASA EERLAERDFG QLTPDELRAL WRLMRELALA
PPLRRSRRAR RDRHGGRLDV RATLRASRRT GGDPARRIMR RRVLRRRRLV LLCDISGSME
PYSRAFLQFL HAAVGGTDAE AFVFATRLTR LTRALQGRQP ELAIERATAV AHDWAGGTRI
GETLRRFNDS YGRRGMARGA VVVIVSDGWE RGDPALVAEQ MERLHRLAHR VVWVNPHKAS
RDFAPLAGGM AAALPWCDAF VSGHNLSALS AVAEAISTSR RSTR