Gene Cmaq_1456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1456 
Symbol 
ID5709217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1533317 
End bp1534471 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content40% 
IMG OID641275966 
Productglycosyl transferase group 1 
Protein accessionYP_001541271 
Protein GI159042019 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.550034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.000088456 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAATA AGAATATCAT TACTATCGTT CTCGGTGAAA TTAATAGTTT ATACGTTAAG 
AAAGACCCTG TTTGTATTCC TTACGAATTC AACAAATTAG GATTTAATTC AATTCTCATT
TGTAGCAGGT GCAGCATCAA AATTCCAGGT GTTAGGGTTT TCGAGATTGG TGGTGTGAGT
TTACGTAGTG GATTTGTGCC TCTTTATTTA ATCAATGTAA TTAATAGTAT AGTCAATGCT
GTAATATTGA TGGCTAAGCT TGTCCCTATA CTTAGGAGGG TTAGGCCATG GTTTGTATTA
ACATATTACT ACCCAGCATT ATTACCGTTA CTTTACTTAC TTGGTAGGGT TCTTAACTAT
TACGTTGTAG TTAAGATGGA TTGGGATGGT AATTTAACGG GTAATTTTCT TAAGGTTCTT
TTTAGAAAAC TTATGTTGAT TGCGCTTTCG AGGTTTGCGG ACGCGGTGAT TATAGAGAGT
TACGATGCAA TGCGTAAAGC CATTGAGGCT ATACCGGCTT TACGGGGCAA ATTAAGGGTA
GTGTATAATG GATGGTGCGA TGAGCTACTC AGAGAATTTA ACCTCGGTAA TCGTGAGAGG
ATTGTTTTAA CGGTGGCTAG GGTCGTGCGT GTTAAGGGGA TTCACGACTT AATAAGAGCC
TTTGCTATGG TTGCAAATAA GCACAAGGAT TGGGTTTTGA GGATTGTTGG TCCTATTGTT
GACGCGAATT ACTATAGGGA GCTTATGACC CTGGTTAGAC AGCATAATTT AGAGGGGAGG
GTTTACTTCC TAGGCGCAAT TAGTGATAAG GAGTTGATTA GGGAGTATAG TAAGGCATCG
ATATTTGTGT TACCATCATA CGCTGAAAGC TTTGGTATTG CAAGGCTTGA GGCGCTCGCT
CATGGGTTGC CTGTGATCAC CACTGATACT GGAGGTTCTG AGGTTGTGAT GGGTGTTGGT
GTTATAATAG AGCCTGGTGA TGTAGCTTCA TTGGCGTATT GGTTAGATAG GTTAATGGGC
GATGATGCGT TGAGGTATAA CATGGGTATG AGGGCCCGTA TGAAGGCGGC CGCATTAACA
TGGAGGTTTG TATCTACAAG AATAATTAGT ATTGTGAATG AGCTAGAGTC TTTACGCTTA
AAAAATAAAA TCTAG
 
Protein sequence
MNNKNIITIV LGEINSLYVK KDPVCIPYEF NKLGFNSILI CSRCSIKIPG VRVFEIGGVS 
LRSGFVPLYL INVINSIVNA VILMAKLVPI LRRVRPWFVL TYYYPALLPL LYLLGRVLNY
YVVVKMDWDG NLTGNFLKVL FRKLMLIALS RFADAVIIES YDAMRKAIEA IPALRGKLRV
VYNGWCDELL REFNLGNRER IVLTVARVVR VKGIHDLIRA FAMVANKHKD WVLRIVGPIV
DANYYRELMT LVRQHNLEGR VYFLGAISDK ELIREYSKAS IFVLPSYAES FGIARLEALA
HGLPVITTDT GGSEVVMGVG VIIEPGDVAS LAYWLDRLMG DDALRYNMGM RARMKAAALT
WRFVSTRIIS IVNELESLRL KNKI