Gene Cmaq_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1040 
Symbol 
ID5710377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1091145 
End bp1092491 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content44% 
IMG OID641275540 
Productglycosyl transferase family protein 
Protein accessionYP_001540859 
Protein GI159041607 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4346] Predicted membrane-bound dolichyl-phosphate-mannose-protein mannosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0000509262 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTATGT TAAGCTTATT GAAGAGGTGG CTGCCACTAG TCATAGTGCT GGTGGTTTTC 
ACGCTTTACC TAGTGGCTAT TCAACCCCAC TTAAACTCCT ACACCAGTGA TGAGATTTGG
TACGTTTCAG CGGCTAGGAA TTTCATAAAT GAGGTTGGGT TAGGCAGGTA CTTAGTCAGT
TACGGAGTTA CAGTAGCCCC TTCACCACCA TGTAATGTAA CCGGTGCTAC ACCGGCTATA
TACTATAAGG ATAATTTAAT GGCGTGGTAC AGTAATGTAA CCTACACCCA GTTGAAGAGC
CTGATGGATA ATTCCACCTG CATAGTTAGG TTCGGTTACT ATTACCCGGA TAAGCAAAGC
ATAGTTACTT ACCTTAACAT TGAGCATCCA TGGTTCGGTA AATGCTTCAT GGTTCTCTCA
ATGCTCCTAC TGGGTGATAA GCCAGTTACC TGGAGGGTGC CTTCAATGAT ACTATCAGCC
CTAATGCTAA TACTAGTCTA CTACATTGCA TTGTACATAA GTAATAACGT GGCCTACGCA
TCCCTAGCAT CCCTGGCTCT ACTCATGGAT GCATCATTCC GCGACGTTGG TATAATAGCC
CTCCTGGATG TTTACATAGG CTTCTTCACA CTACTCACAG TATACCTATA CGTAACCAAG
CGCTTCACCT CATCATCAGT GGCCGCCGGC CTAGCGGCAG CCAGCAAGTA CCCAGGAGTC
TTCACAGTCT TCGCTGGGGC CTACTTAGAG GCTTATAGGC GTAATGGCTT ATGGGCATTA
GTCTACTTAG CGATATCACT CATAGTATTC ATAATACCTC AATTACCCAT AATTCACCTC
GTCGGCGGCT TAAGTAACTA CATTTCAGAC GTATTCTTCT ACCTAAAATG GTTTACTGAA
TCCAGGCCAC CTGGACCCGT TGCATCTAAT CCATTCGACT GGCTCATAGG CGTAGACAGT
TTTGTCAATA ATGTTTCACC ACCATTATAT GCAATGGGGT TACCTGGGGC ATACCTAGTT
GCCTTAGTTT ACTCATTCCT AATGATTATT CCCCAATCCA GGAAATTACT ATTCAGTAAT
ATTCAATTCA ACGAGTACTC AATACCAGTC ACTATACTGA CCATGTGGTT AGGCTTCTGG
ATGATATACC TACTGGGCAA TACTACGCTA TACAGTTACT ACACCATGGA CTTCGCACCA
TTAATACCCC TTGAACTAGT ATTAGCCATG CATAGGGCTA AGCCCAACAC CAAGCCATGG
ATCATTATCG CGGCATTAGC CGGCATAGCC TATGGCATCG CCGTTCAATG GAGCATGATA
TACTCCCTAA TTAAGGCTAT TACCTAA
 
Protein sequence
MGMLSLLKRW LPLVIVLVVF TLYLVAIQPH LNSYTSDEIW YVSAARNFIN EVGLGRYLVS 
YGVTVAPSPP CNVTGATPAI YYKDNLMAWY SNVTYTQLKS LMDNSTCIVR FGYYYPDKQS
IVTYLNIEHP WFGKCFMVLS MLLLGDKPVT WRVPSMILSA LMLILVYYIA LYISNNVAYA
SLASLALLMD ASFRDVGIIA LLDVYIGFFT LLTVYLYVTK RFTSSSVAAG LAAASKYPGV
FTVFAGAYLE AYRRNGLWAL VYLAISLIVF IIPQLPIIHL VGGLSNYISD VFFYLKWFTE
SRPPGPVASN PFDWLIGVDS FVNNVSPPLY AMGLPGAYLV ALVYSFLMII PQSRKLLFSN
IQFNEYSIPV TILTMWLGFW MIYLLGNTTL YSYYTMDFAP LIPLELVLAM HRAKPNTKPW
IIIAALAGIA YGIAVQWSMI YSLIKAIT