Gene Cmaq_1655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1655 
Symbol 
ID5709848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1729881 
End bp1731542 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content39% 
IMG OID641276163 
Productglycosyl transferase family protein 
Protein accessionYP_001541468 
Protein GI159042216 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00101138 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTGCTA ATACTACCAC GCAAAGCCAA ATACTGCTTA GTAATATAAG TAGAATTATT 
GAATTTAATA CCAGCAAAAT AATTGAGAAT ATTGAATTAA GGTTTGACCA ATATATAAGT
GCAATAAACG TGGTGTTGAA TAATTCAACG TATTACACTC AATCAAATTC AACCATTGGA
ATAGCCACTA AGCCAGTACC TAAGGGTTAT GTACCAGTCA TATTAATCAC ATACATAATT
TACGCAGTAT TCATGGTGAT TGTGCTTGCT CAATTACTAA TAATGATACA TTCCGCTTAC
GCTAGGTTAA GGGTAATGAG GGACGCTAAA CCAACACACA TTAGTAACAT TAATTGGTTA
CCCTTAGTTT CAATAATAAT ACCAGTCAGG GGGGAGGGGA TTGACGCCAT TGAGGATGCT
GTTAAGAGGA TTACTGGCCT CGATTACCCA AGGGATAAGC TTGAAGTAAT AGTGGCCACT
GATGATGATG AAGCCACAGC TAATTTAATT AAGAATTCAG TGGAGAGAAT AGGTAAGACC
TATGGATTAC GAACGATAGT TAACTGGAGG AGTAAACCAG TAGGCTACAA GGGTGGTGCA
ATTAATGAGG CCGCTAAATT AGCCAGAGGA GATGTTCTAC TGATACTTGA CGTGGACACG
ATATTACCGA GGAATTACCT GAAGGTTGCC TTAAGTTACC TTAATGAGGG ATACGATGCC
GTTGGAGCAC CATTCCTGGG TGTACCCAAG GTGCCTAATA ACTTTAGCTG GCCTCTAATG
ATTTTATTCA ATACACTTAG TGAAGTTCAG ATAGTTGGTA GGGCTCTCTC TAGGTTTAAG
AGGGGTTTCT ACATGATTAT AGGTAATAAT CTCCTGATTA GGAGGGATTT CTTTAATAGG
ATTAATGGGT TATGCTACTG TAAATCCGAT GACATTGACG TAGCCCTAAG AATATGGTTA
ATGGGTGGTA GGATAGGGGT AATGAATGAG AGGGTATTAA CTGAAATACC GAGTACGTAT
GATGCGTTCA GATCCCAGAC AATAAGATGG GCCACCAATG ACATGTGGGC TCTTAAGAAG
TACTTCACTA AGATACTTAA GTCAAGGAAC AGGAGCCTTG TTGATAAGGT GGATGCCTAC
CTATGGTTAC TTAAGTACCC GTTAGTGTAC TTAGGCGTTA TCTCAATAAT AACAATGATA
ATAATGCAGG TATTCAATAT ACTGATACCA CCATTACCAA TACTGGTACT CTCAGTACTC
ACTGATGTAG TTGGGGCAGC ATTATTAATT CTAATAATTA TGGTTGGGAG AAGGATGAAT
TACAGTTATT GGGACTTATT TAAGTCACTT GTTATTGGTG GGTTAACCAT GTATGCGTTA
GCGTTCCCGC TAATGATATA CTTAGCTAAG GCCCTATTAA GTGACTTACC ATGGTTATAC
ACACCTAAGG CTTCGAAGGC ATTATTAAGG CAGAAGTTCC TTATTGAACG CTTATCAATA
ATAGCCCTAA TTGCTGTTGG AGCAGCATTA CTAATGATGG GGCATGTGAT AGTGTCATTA
TACGTCTTCG CTAACTCAAT ACTAATACTA ATGGGGTACA GTGTTGGTAC TGTTAAACCA
AGCAATAGGT TGATTCACTC CACTCAAATG ATTACTCATT AA
 
Protein sequence
MIANTTTQSQ ILLSNISRII EFNTSKIIEN IELRFDQYIS AINVVLNNST YYTQSNSTIG 
IATKPVPKGY VPVILITYII YAVFMVIVLA QLLIMIHSAY ARLRVMRDAK PTHISNINWL
PLVSIIIPVR GEGIDAIEDA VKRITGLDYP RDKLEVIVAT DDDEATANLI KNSVERIGKT
YGLRTIVNWR SKPVGYKGGA INEAAKLARG DVLLILDVDT ILPRNYLKVA LSYLNEGYDA
VGAPFLGVPK VPNNFSWPLM ILFNTLSEVQ IVGRALSRFK RGFYMIIGNN LLIRRDFFNR
INGLCYCKSD DIDVALRIWL MGGRIGVMNE RVLTEIPSTY DAFRSQTIRW ATNDMWALKK
YFTKILKSRN RSLVDKVDAY LWLLKYPLVY LGVISIITMI IMQVFNILIP PLPILVLSVL
TDVVGAALLI LIIMVGRRMN YSYWDLFKSL VIGGLTMYAL AFPLMIYLAK ALLSDLPWLY
TPKASKALLR QKFLIERLSI IALIAVGAAL LMMGHVIVSL YVFANSILIL MGYSVGTVKP
SNRLIHSTQM ITH