Gene Cmaq_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1459 
Symbol 
ID5709007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1536957 
End bp1537997 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content49% 
IMG OID641275968 
Productglycosyl transferase group 1 
Protein accessionYP_001541273 
Protein GI159042021 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00514335 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00279802 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGGGTTG CTTTCATTGT TAGTAAGTCT TTTACTGAGA GGAGGCATGG TGGTTTTGGC 
TGGCTTGTGA GGTTGGTGGG TGAGGAGCTT GTTAGGCGTG GTTTTGAGGT TTATGTCCTT
GCTTGGCGTG ATACTGGCTA TCCCAGGGAG TACTCCGTGG GTAGCGTTAG GGTTGTAACA
TATGATTATC ACTTCGAGAC CAGGTCTGTC TTTAGGCACT TGCGTGATTA CTGGGGTGCT
CTGGAGGTTA TCAGGGATGT GGATGCTGAT GTTTACATAA GCATTGAGGC TATGGTGGAG
ACTTTACTGG CTGAGCTTGT TAAACGCCAT AGCGCCCATG TGGTTTGGGC GCAGGATCCC
TTTGAGTGGA GTGATTATGA GCTCCTGGCC TCCGTGGATC CTTACTACAG GATTTCCAGG
GCTAGGTTTT ACATGAATAG GTTGGTCTTT GGCGCGGCTT ATAGGTGGGC TGATTTAGTG
CTTACGCAGG CCAGATTTTA CATGAGGAAG CTCAGGGAGC TTTACGGCCT GGACCCAGGG
AGGGTTCATT ACCTGCCGAA CCCTGTACAC CCAATACATG AGTATGAGGT TAAGAAGTCC
GAGGAACCGC TGATTTGCTA CCTAGCTAGG ATGGATCCTC AGAAGCGTTA CTGGTTATTC
TTTGAGCTCA CTAAGCGCTT CCCTGACATC AGGTTTGTGA CTATGGGTAA GCCTAACGTG
CTCTATGAGG ATAGGTATAA GGAGGTTATT AGTAAGTACG TGGATTTAAG CAACCTTGAG
GTACTAGGCT TCGTACCTGA GAAGAGGAAG AGGGAGATTC TTGATAGGTG CTGGGTGCTT
GTGCTGCCGA GCATTAGGGA GGGGTTGCCC ATAGCGATGC TCGAGGCCTT GGCTCACAGG
GTTGCATTGC TTAGTTCTGT GAATCCTGAT GGGTTAACGG AGAGGTTTGG CTATTGGGCC
AGGAACGATG ATTTCGATGT GGGGTTAAAA TGGTTATTGA GTAATGATAG GTGGAGGGTG
CTTGGTGAGG AGGGCTATTG A
 
Protein sequence
MRVAFIVSKS FTERRHGGFG WLVRLVGEEL VRRGFEVYVL AWRDTGYPRE YSVGSVRVVT 
YDYHFETRSV FRHLRDYWGA LEVIRDVDAD VYISIEAMVE TLLAELVKRH SAHVVWAQDP
FEWSDYELLA SVDPYYRISR ARFYMNRLVF GAAYRWADLV LTQARFYMRK LRELYGLDPG
RVHYLPNPVH PIHEYEVKKS EEPLICYLAR MDPQKRYWLF FELTKRFPDI RFVTMGKPNV
LYEDRYKEVI SKYVDLSNLE VLGFVPEKRK REILDRCWVL VLPSIREGLP IAMLEALAHR
VALLSSVNPD GLTERFGYWA RNDDFDVGLK WLLSNDRWRV LGEEGY