Gene Cmaq_1371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1371 
Symbol 
ID5709866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1444760 
End bp1446580 
Gene Length1821 bp 
Protein Length606 aa 
Translation table11 
GC content46% 
IMG OID641275881 
Productglycoside hydrolase family protein 
Protein accessionYP_001541187 
Protein GI159041935 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1449] Alpha-amylase/alpha-mannosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTCTACG TGAGGGCTTT ACTTGATGTT GATAGCCCAG TGCATAAGGT GGGGGATGAG 
GTTGGGGTTA AGGTTAGGTT GATTAATGAT TCATCATCAT CAATAAACGT TAACATAACC
CTCGACTACC TACTGGAGGG TAGGCACGTT AACTCATGGA CTGGTTCAGC CCTAGCACTA
CCCGGTGAGG TGACTACGGT TAACGCATCA TTTACCGTGG GTGAGGCTGG GCTTTGGGTT
ATTAGGCTTA ATGGTGACGC GGGGACTAGT AAATTCACTG AGTCAATTAA GGTGAGGGTT
ATTGAGGGGA GGAGGCCGGT TAAATTAGCC TTAGTGTACC ACATGCATCA ACCCCCATGG
TACATGAGTG ACGGCAGGTA TTACGCTGAT TGGGCATTCA GGTACGTTCA TGCCCCGGTT
ATGGCACCCT TCTTCAACGG TGGCCCATAC TTATTCCACG CATTCCTCAA TGACAAGTAC
AGTGGAGTTA AGGTGAATAT TCACTTATCC CCAAGCCTAC TTAAGCAGTG GGTTGATGCC
ATTGAGAAGG GTTATACCCT CATTAATGGT GAAGTCCACG CAAAGGGTAG TGGTGAGGTT
AATGCGGTGG CTAAGGTCCT TGATATGTAT AGGGTTCAGG CTAATAGGGG GCAGTTGGAT
GTATTATCAA GCGTATACGC CCACACCATA CTGGGTTACT TAGCATCAAG GTACGAGATT
ATTGACGTTA TTGATGAGGA ACTGGGCGTG GGTATGGAGG TTACTAAAAG TACCCTGGGT
GTTAATCCCG TTGGTGTTTG GACTCCTGAA ATGGCGTGGA GTATGGAGTT GCTTGACATA
TATGAGAAGC ATAAGGTCGG CTACACTGTG CTTGATGGTG GTAATCACTT CCCTGGGGTT
CAGGGGGATA AGGGGAGTAT TTATGAACCC TATAGCCTGG GTGGTAGGTT AACAGTATTC
TTTAGGGATG AGAGGTTAAG TAACATTTTA TCCTTCCAGA ATAATATCCC TGACGAGAGG
TCTGCGGTGA AGCTTGCCGC AATGCTCAGT AGATCCATTA TTGAGACTAA TGGTGAATTA
GTGGTCATTG CCCTTGATGG TGAGAACTTC ATAGCCATGT CCAAGACCCC GGCCATGGTT
GGTTTAATGC TTGATAAATT CTACTCATAC CTCAGTAGAA TGCAGGAGTT AGGCATTATT
GAGACTGTTA GGCTTAGTCA AGTTAACATG AGTAGGAGAA GCATAACCTA CATACCCACA
ACCTCCTGGT TAGGGGGCTT CACTAAGTGG GATGGAGAGA GGAGGGAGCA TGCAGAGTAC
TGGGTTAAGG TCATTGACTC ATACAGGTAC TTGAGGGGTC TTGAGGATGC ATTGGGTGGT
AAGATTAATG AGGCTAGGTA CGCCCTATGG CATGCCCTAG ACAGTGACTT CTGGTGGGCT
GAGTTCTGGA ATCCTGATTT AATTAACCAT TGGGTTGAGG AGTTCCGCAA TATCCTGGAT
TCAAGGTTCA AGATAGCCAT GAGGCCCCTA AGGGAGGTTT ACAGGGGGCT TGTTAATAGG
CCTATTGATG TGGAATTAGA GTTTGATAAT GACATGGGGG TTAACGTTAA GTTCAAGTTA
ATTTGCCTAG ATACTCAACT GGATGTTGTT ATTCAGCCTG GTTCCTCAAG GATTAAGTGC
AGTATAATAC CTAGGTTAGC CGGTTCCTAT AGGGTACCCA TATTCGTAAC CTCAGGTAAC
TACATTTACC TACAATCCTA CGTAACCCTA AACGTCACCT ACGGTAATAG GGATCCACCT
AATGAGGATT CAGCGGGATA G
 
Protein sequence
MVYVRALLDV DSPVHKVGDE VGVKVRLIND SSSSINVNIT LDYLLEGRHV NSWTGSALAL 
PGEVTTVNAS FTVGEAGLWV IRLNGDAGTS KFTESIKVRV IEGRRPVKLA LVYHMHQPPW
YMSDGRYYAD WAFRYVHAPV MAPFFNGGPY LFHAFLNDKY SGVKVNIHLS PSLLKQWVDA
IEKGYTLING EVHAKGSGEV NAVAKVLDMY RVQANRGQLD VLSSVYAHTI LGYLASRYEI
IDVIDEELGV GMEVTKSTLG VNPVGVWTPE MAWSMELLDI YEKHKVGYTV LDGGNHFPGV
QGDKGSIYEP YSLGGRLTVF FRDERLSNIL SFQNNIPDER SAVKLAAMLS RSIIETNGEL
VVIALDGENF IAMSKTPAMV GLMLDKFYSY LSRMQELGII ETVRLSQVNM SRRSITYIPT
TSWLGGFTKW DGERREHAEY WVKVIDSYRY LRGLEDALGG KINEARYALW HALDSDFWWA
EFWNPDLINH WVEEFRNILD SRFKIAMRPL REVYRGLVNR PIDVELEFDN DMGVNVKFKL
ICLDTQLDVV IQPGSSRIKC SIIPRLAGSY RVPIFVTSGN YIYLQSYVTL NVTYGNRDPP
NEDSAG