Gene Cmaq_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1626 
Symbol 
ID5708619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1701545 
End bp1703413 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content43% 
IMG OID641276134 
Productglycoside hydrolase 15-related 
Protein accessionYP_001541439 
Protein GI159042187 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0120463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.415513 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTAA GCGGCATAGT TGTAGTAATG GTAGTATTGG TGGTATTAAT AATTGGTTTA 
GGCGTTGTGT TAACTCACGT TAAGAGTCAT GGCGCTGTGG CAGGTGGTTT AAACCTAGGT
AACTGCACCT TCCCCAGTAT AATTAACCCA ACGCTTCCAT ATAATACACC ACCCTCAGCC
TCATATCTAC TACTCAGTAA TTGGGTTAAC ATGAGTGCCT TAATATCCAC TGGGTTCCCT
ATTTTCGGAT TAAGTCAAAG CGGTGGGTTG GTTACAATGC CAACATCATT GAGGTGGCTT
TTAATTAAGG GGATTGGGCT AGTTAACGTA AGCCTATACG TTGAGAACCT GGGTAGTATT
CGCAACGCCT ACCTCACGTT AAATAACTCT ATTGTTATAA AGGGTACTAA TGGGTGGGTT
GAATTCACCA TGCCCCCATA CACTAACGCT GTTTTAATTA CTCAAAATTC AACGGTGCCG
TTGAGTTACG TTATTGAGGC TATAGGTAAT TATAGTTATG GGCCAATTAA GGATGGTGTG
GTTATTATTG GTGAACCAAA CATAACCGTT TACTCACCTG GCTCCTCATT CAGTGTCAGC
GTAATGTATA AGTCGATTCA AGTCACAGTG AAGGCTAGTG GATTCAGTTA CATTGAGTTA
ATGATTAATA ATCACCCTGA TCCTCCCCAG TACTTACTTA GTATTAATAA TGAGGAGGTT
GAGTCATGGC TCATGAAGTC AAGGAAACCG AACCTAAGTG GTTGCCTACT TAATGAATAC
TACTTAAGCC TACTCCTGAT TAAGGATTCC CAAAGCCCAA TAACCGGTGG CTTCGCTGCA
TCGCCTGAAC CAATTTACCT ATACACCTGG GTTAGGGACT CCTCCTTCGC CGCCATGGCT
CTTCAGGAGT CAGGACATTA TAATTCAGCA GCGGAGTACT GGCTTTGGAT GGCTAAGGCT
CAGAATAATT CAGGGGCTTG GTTCACTAGG TATAGTTTCT TTAACGGGAA CCCTGACTAC
GGGTACGGTA TACCTGAATA CGATAGTGTC GGGTTGTTTC AAATAGGGGT TTACCAGTAC
TACGAGTTGA CTCATAATGA ATCCTTCATA AATGAGGTAA TCCCTGCGGT GAATAAGTCA
CTGAACTGGG AGTATAGGGT GATTAATAAT ACTGGACTCA TACCCCAGGA CTTAAGCATA
TGGGAGGACC TATACGCCTA CAATTTCTGG ACCCAGGGAG TGGACTTGGA TGGGTTAGTG
GCGTCGTATA GACTCTACAG TGAGTTAGGG TATGATGCAT CATGGATACT GGCTATGATT
AATGAATTGA ATAATACGAT TCAACACGAT TTTTACCTTG ATGGATGCTA CGTTAGGGCA
CTGGAGCCGA GTGAGGTTTA CTACCAGGGT AAGACTCAGA TTACTTTAGT ACCAACCAGT
GTAATCTATG ATTCATCAGT AATACTTCCC ATTGACCTCG GGTTACTAAA CCCCTCAAGC
AGTAGGGCGG TTAATGCCGT TGACTGCGTT ATTAGTAATT TATGGAATAG TAAGGTGGGT
GGTTTAGCTA GGTATACTGG TGATATTTAC CATTACGCAG CCTACCTCTA CGACAGTAGC
GGAGAGGAGC CACCCTGGGT TATAACCACT CTCTTCCTAG CCCTATACTA TGAGGAGTTG
GGGAATTACA CAGCATCCTT AAATTTAATG AATTGGGCAA TCAACCACAC TGAATACGGT
CTACTACCTG AGGCCATTGA CCCCAATTAC GGTAACCCAC TTCCATCAAC GTCCCCATTA
GTTTGGTCGG CCGCAATGTA CGTTATAACG GCGCTTAATT ATAATCAAAC CAACGTAAAC
CACGGCTAA
 
Protein sequence
MRLSGIVVVM VVLVVLIIGL GVVLTHVKSH GAVAGGLNLG NCTFPSIINP TLPYNTPPSA 
SYLLLSNWVN MSALISTGFP IFGLSQSGGL VTMPTSLRWL LIKGIGLVNV SLYVENLGSI
RNAYLTLNNS IVIKGTNGWV EFTMPPYTNA VLITQNSTVP LSYVIEAIGN YSYGPIKDGV
VIIGEPNITV YSPGSSFSVS VMYKSIQVTV KASGFSYIEL MINNHPDPPQ YLLSINNEEV
ESWLMKSRKP NLSGCLLNEY YLSLLLIKDS QSPITGGFAA SPEPIYLYTW VRDSSFAAMA
LQESGHYNSA AEYWLWMAKA QNNSGAWFTR YSFFNGNPDY GYGIPEYDSV GLFQIGVYQY
YELTHNESFI NEVIPAVNKS LNWEYRVINN TGLIPQDLSI WEDLYAYNFW TQGVDLDGLV
ASYRLYSELG YDASWILAMI NELNNTIQHD FYLDGCYVRA LEPSEVYYQG KTQITLVPTS
VIYDSSVILP IDLGLLNPSS SRAVNAVDCV ISNLWNSKVG GLARYTGDIY HYAAYLYDSS
GEEPPWVITT LFLALYYEEL GNYTASLNLM NWAINHTEYG LLPEAIDPNY GNPLPSTSPL
VWSAAMYVIT ALNYNQTNVN HG