Gene Cmaq_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0444 
Symbol 
ID5709653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp478736 
End bp479899 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content44% 
IMG OID641274947 
Productglycosyl transferase group 1 
Protein accessionYP_001540279 
Protein GI159041027 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.990869 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGTAC TTCATCTTTC CTGGGAGTAC CCACCACACA TAGTTGGTGG CTTAGGTAGG 
CACGTGTACT ACATAACCCA TGAGCTCATT AAACTGGGTG TTAATATTGA TGTAGCCACT
GTGGGTTATG AAGACACCCA CGTTATTGAT GAGGGTGTTA ACGTGCATTT AATCGACGCA
TTCAAGGTTA GGGTACCTGA CTTCTCATCA TGGGTTCACT CCTTCAACAT ATTCATGATG
ATGGATTTAA GCCACATAAG TGAGGTTGAT GCAATTCACG TTCACGACTG GTTAACTGCA
CCGGCAGGTA TTGTGCTTAA GCATAGGTTT AAGATACCCT TAATAGCCAC AATACACGCC
ACGGAATACG GCAGGAGGGG TGGATTGCAT AGCCTTGAGT CCAAGCATAT TCATGAGTGG
GAGTGGTTAC TTGCCTATGA GGCATGGAAG ATCATAGTCT GCAGCAACTA CATGGCCAAT
GAGGTGAAGA GCGTCTTCGG TGTGCCTGAT GATAAAATAG TTATGATACC TAACGGCATA
GATAAGGCGC TACTCAGCTT TAAGCCTAAG TACGACCGCT CCAGGTACGC TTACCCCTGG
GAATTACTAA TAGTGTTCTA CGGTAGGTTA GTTTACGAGA AGGGTCCTGA CTCTGTGATT
AGGGCTTTCG CCAAGTTAAT GAGCAGGATG AGTAACATTA AACTCGTAAT AATTGGTGAT
GGGCCGATGA GGGAGTACTT AGTTAACCTG GCTAATCAAC TTGGGTTAGG TAGTAAGGTT
TACTTCACAG GTAAGGTGAG TGACGATGAG TTATACAGCA TAATAGCTCA CTCAAATCTA
GTCATATTGC CAAGTAGATA TGAGCCATTC GGTATAAGTG CACTTGAGGC CATGGCGCTT
GGTAAACCAT TAATAGCAAC TAATAGGGGT GGGCCAACGG ACTTCATTAG ACATATGGAG
AATGGGGTAT TAATCAACCC AGATAACCCT GATGAAATAG CCTACTACGC CGAGATGCTG
CTTAAGGATG AGGGCTTAGC CCGTAGGTTA GCTAATGAGG CTAGGGGAAC GATAATGAAG
GGGTACACTT GGGATATTAT AGCTAAGAAA ACTTATGAAC TCTATAAAAC AATAATTGAG
GAGAGGGCTA AGGTTAATTG GTAA
 
Protein sequence
MRVLHLSWEY PPHIVGGLGR HVYYITHELI KLGVNIDVAT VGYEDTHVID EGVNVHLIDA 
FKVRVPDFSS WVHSFNIFMM MDLSHISEVD AIHVHDWLTA PAGIVLKHRF KIPLIATIHA
TEYGRRGGLH SLESKHIHEW EWLLAYEAWK IIVCSNYMAN EVKSVFGVPD DKIVMIPNGI
DKALLSFKPK YDRSRYAYPW ELLIVFYGRL VYEKGPDSVI RAFAKLMSRM SNIKLVIIGD
GPMREYLVNL ANQLGLGSKV YFTGKVSDDE LYSIIAHSNL VILPSRYEPF GISALEAMAL
GKPLIATNRG GPTDFIRHME NGVLINPDNP DEIAYYAEML LKDEGLARRL ANEARGTIMK
GYTWDIIAKK TYELYKTIIE ERAKVNW