Gene Cmaq_1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1002 
Symbol 
ID5710454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1054707 
End bp1055984 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content41% 
IMG OID641275503 
Productglycoside hydrolase family protein 
Protein accessionYP_001540823 
Protein GI159041571 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4833] Predicted glycosyl hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.508727 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGGTA AGCTAACCTT ACTCACGTTA CTTGCACTGG CGTTAATAGT GCTGAGCTTA 
CTTAACCTGT ATAATGCATC ATCAATAAGC TTAATTGAGC TTTACTCCAA GAGGGCTGTA
GCCACCTTTA ATGCCCTCCA AGACCACTAC TACGTGAATT CCCTAAATCT ATATAAAGGC
TCCACCTGTG GGGATTATAG TTGCCTTTGG ACTTACTCAC AGATTTTATC AGCATTAGTT
TACTTATCCC TAACACCTGG TTTAGGTAAC TTCACGAATC TATTTAATCA ATACTACTTA
GGTTTAAGTC ATTACTCTAA TCCACTTAAC CCATCATCAG GGTATGAGTC CGCTGTAACA
CCACCCATTG GTCCAGGTGG TGACACTTAT TATGATGATA ATGAGTGGGT TACGTTAGCA
TTAATAAAAA TGTACCTGGC GACCAATAAT ACTAGGTACT TGAGGAGGGC TGAGGAGTTA
TTCAACTTCA TAATCAGTGG GTGGAGTACC AATGAGTCAT TAAGGTGCCC TGGTGGAATA
TACTGGAGGG TTGGTGACTT ATCAAGGAAC ACTGTATCCA ATTCCCCAGC CGCTGAGGCT
GCTGCTGAAT TATACCTAAT AACCGGTAAC CCAAGTTACT TGAAGTGGGC TATTAGGATC
CTTAACTGGG TTAATCAATG CTTAAGGTCA CCTAGTGGAC TATATTATGA TCACATTAAC
CCAGATGGTA CCATTGATGA AACAATATGG AGCTATAACC AAGGCACCAC AGCCGCAGCG
GCGGCATTAA TATATGAAGC AACCCATAAT GAGTCATACC TGGTTCTAGC TGAAGACTCC
GCATACGCAT CCTTAAGCTA CTTTAGCCAG GGAGCCATAT ACTCGCAGCC GCCTGAATTC
AATGCAATAT ACTTCAGAAG CCTTGAAAAG GTCATTGAGA TAAGCCGTAA TAACACGCTC
TCTAAAATGT ACTGGAACCT ACTCTTAACG TATGTTAATG ATACGTGGAT AACCTATAGG
GATCCTGAAA CTGGATTAAT AACAATGGGT CAACCATTGA ATTCAGTGAA TCCTGATGAT
GCGGAAATAT GGACTGCTGC AATGGTGCAG TTATACGCTA TAATAGCTGG TTCACGGCAA
CCAATTGCAT TTAAGGCACC TAGCATTAAA CCAGGCACTG GACTTGTGCA GGTGCAATGG
CCTATAATCA TTATAATCGT TATCGTATTA ATAATTGTAA TCACATATAT TGCCTTGAGA
ATCAGAGAGA AGAGTTAA
 
Protein sequence
MVGKLTLLTL LALALIVLSL LNLYNASSIS LIELYSKRAV ATFNALQDHY YVNSLNLYKG 
STCGDYSCLW TYSQILSALV YLSLTPGLGN FTNLFNQYYL GLSHYSNPLN PSSGYESAVT
PPIGPGGDTY YDDNEWVTLA LIKMYLATNN TRYLRRAEEL FNFIISGWST NESLRCPGGI
YWRVGDLSRN TVSNSPAAEA AAELYLITGN PSYLKWAIRI LNWVNQCLRS PSGLYYDHIN
PDGTIDETIW SYNQGTTAAA AALIYEATHN ESYLVLAEDS AYASLSYFSQ GAIYSQPPEF
NAIYFRSLEK VIEISRNNTL SKMYWNLLLT YVNDTWITYR DPETGLITMG QPLNSVNPDD
AEIWTAAMVQ LYAIIAGSRQ PIAFKAPSIK PGTGLVQVQW PIIIIIVIVL IIVITYIALR
IREKS