Gene Cmaq_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0171 
Symbol 
ID5709029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp201167 
End bp202513 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content43% 
IMG OID641274674 
ProductAlpha-amylase 
Protein accessionYP_001540010 
Protein GI159040758 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.400898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.121411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAAAA ACATCGTATT CTTCATGGAG ATGCATCAAC CTAGGAGACT TAATAGGCTT 
CTTCATTATC AATCCTCAAT GGAACCCCTT GACCTTTTAT TCGATGATGA ACTTGATAAG
CTTATTTTAA GTAGGATTGC AGCTAGATCC TACAGTAAGG TTCTTGATAT TATTAAGGAG
GCTAATAGGG AATACGGCTA CAGGTTTGCG ATTAGTATAA CTGGGGTATT GGTTGAACAG
TTGAGGAAAT GGGCCCCGGA GGTTTTAGGG AAGTTAATTA ACTTAATTAA TGATGATGCC
GCTGAGCCAG TGGCTGAAAC CTATTACCAC TCCTTAGCTT ACTTAATTGA TGAAGCTGAA
TTCAGGGAGC AGGTTATGAT GCATGTTAAT TTAATTGAGA AGTTAACCGG GAAGAGACCT
GTTACTGTGC AGAATACTGA ATTCATGTAT AGTGATGATG TTGGTAGGGT GTTTTCAGAA
ATGGGGTTTA AGGTAGCCTT AACCGAGGGT GTGGAGAGGG TTCTTGGGTT TAGGCAGCCA
ACTTACCTTT ACAAGAGCCC AAGCGGCTTA CTGCTCCTGC TTAGGCATTA TAGGCTTTCC
GATGATGTTG GTTTCAGGTT TACGAATAAG TCATGGGACC AGTACCCGTT AACTGCCGAT
AAGTACGTTG CTTGGTTAAG GGCGACATGG GGTGATTTAG TAATGATTGG GTTAGACATG
GAGACCTTCG GTGAACACAT GCCTGAGGAG TCGGGAATAT TTGAGTTCCT GAGGTGGATG
TTTAGGCATG CTTATGAATC AGGCATAAGG TTCATAGCGC CAAGTGAGGT TAAGGGGTAC
GTGTCATCAT CATATGAACT TAACGTTAAT GAAGTAATAT CGTGGGCTGA TGCTGAGAAG
GATACCTCAG CGTGGATTGG CAATGAAATG CAGTGGACGT CGTTTAACCA AGTGGCTATG
CTTCACGGTT TAGTTAAGGA GCTTGGTGAT GAGTATCTGA GGAATTACGT TAGGTTACTC
ATGGTTAGTG ACCACTTCTA CTACATGTCA ACTAAACATG GTGCACCTCA GGATGTTCAC
AATTACTTTA ACCCATACTA CAGCCCCTAT AGAGCCTTTA CCCTGCATCA ATCCGCGGTG
CATAGGGTAC TTAGCTACAT GGCTAGGACG CATGGTAATG CCTCAGTTAT TAGGCATTTA
GCCAGGATTA AACTACCCAG TGAGTTAGCT ACCTGGGTTA GGGGTGAGTC ATTTAGTAAG
GCTCAATGCA CCAATGCCCA ATACACGGCT AGGCTTATGA CTATTAATCA CCCTAAGTTA
ACTGAAGCCT GCAGTAAATC AAGCTAA
 
Protein sequence
MVKNIVFFME MHQPRRLNRL LHYQSSMEPL DLLFDDELDK LILSRIAARS YSKVLDIIKE 
ANREYGYRFA ISITGVLVEQ LRKWAPEVLG KLINLINDDA AEPVAETYYH SLAYLIDEAE
FREQVMMHVN LIEKLTGKRP VTVQNTEFMY SDDVGRVFSE MGFKVALTEG VERVLGFRQP
TYLYKSPSGL LLLLRHYRLS DDVGFRFTNK SWDQYPLTAD KYVAWLRATW GDLVMIGLDM
ETFGEHMPEE SGIFEFLRWM FRHAYESGIR FIAPSEVKGY VSSSYELNVN EVISWADAEK
DTSAWIGNEM QWTSFNQVAM LHGLVKELGD EYLRNYVRLL MVSDHFYYMS TKHGAPQDVH
NYFNPYYSPY RAFTLHQSAV HRVLSYMART HGNASVIRHL ARIKLPSELA TWVRGESFSK
AQCTNAQYTA RLMTINHPKL TEACSKSS