Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1011 |
Symbol | |
ID | 5709407 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1061375 |
End bp | 1063120 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641275512 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001540832 |
Protein GI | 159041580 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1626] Neutral trehalase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.956563 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.837929 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATTA GTAATAAGCT TGAAATCTCA AGTAACATAA CATCAATAAT AATGAATAAC ACGGGTATAG GGTTAAGATT ATTCCACGGG GAGTACGTTG AGGGTGAGTT AAGAGGTAGT GGTGCTGCAT TACTTAGGAG TTATTGGGAT TATGGGGGAA TTAGGAGGTA TTGGGGTAAT GGTGCCTTCG AATTCACCTC AATATACGGT GACTTAGCAG TCTACGTGAT AAGAGGCTAT AGGGGTGTTA TTCAAGGGGA GTTTAAGGTA AGGGGCTTCA GTGGGGTTGA TGAATTAGGG GATGGTTCAA TTAACGTTAA GCATGAGGGT GGTTTAATGA GAATTAAGGC TACTCAACCA ATTAACCTAA GGGTAATGAA TAATGGATTC TCATTAACAA TTAACGTTAA TGATGAATTA AAGGTGGCTG CAGCTGGAGG TGGACAGGTG AATGATGTGG ATAAGGTGCT TAACGATGAA TCAATCATTG AGTCTAAGAG GAGACTATGG TTAAGCACCC TAATGAGCAA TATCAATGGG AGTGATTTAA TTAAACTATG CTGGTACGTG ATATTAACTA ATAGGTGCAG TGTACCTAAT CACCCGGCTT TAAGGAAGCC GTTCAACATG CCCAGTAAGT ACGTTTTCAG GCATCAATGG CTCTGGGACT CCTCATTCCA CTCAATAGTC CTAAGGCATT ACGACGTTAA CATGGCTATG GAGGAGTTGG AGAACCTAAT CCTGAATCAG AAGCCTGACG GTAGGATTCC GCACGAAATA TTCATGTCGA AGGAGAGCTG TAAATCCTTC TGGGGTATTG ATGACTACTC ACCGTGGACA ACCCAACCAC CTGTATTAGC GGTGGCCATT GATAAGGTTC TATCAGTGAG GTGGAATGAT GAATTCGCCG AGAAGGCCTT TAATGCATTA ACCAAGTATG ATGAATGGTT TAGGAGTCAA AGGGACAGGG ATTCAGATCA CTTATACGCC TACTTCGATC CACTGGAGAG TGGGTGGGAT AATAGTCCCA GGTGGGATGA GGCCATTAGG AGGTTTAGGG AGAATCCGCA GCGTTACGAG GTGTATGGGA AATTAACCAT GACTCCAGTT GAGGCTGTTG ACTTAAATAG CCTAATTTAC CTTCAGAGGA GGGTTATCGC TAAGTTGGCT GAAAGGCTTG GTGAGGTTAA TGTTGCCGAA CACTATGATG AGATGGCTGA TGAGACCGCT AAGGCGGTTA GGAGGATTAT GTGGAGTGAG AAGGATGGTT TCTTCTATGA CGTGTATGAG GAGGGCCATG AATTAATCAA GGTTAAGACA CCTGCAGCCT TCCTAACAAT GTTCACTGGA ATAGCTACGG GTGAGCAGGC TGAGAGACTG GTGGCGCATT TGCTTAATCC AAGGGAATTC TGGACTACAT TCCCGCTACC AAGCGTGAGC GCTGATGAAT CAACCTATGA TCCAACAGGC TACTGGAGGG GTAGGTCATG GATTAACCTA GTGTGGTTCA CGTACCATGG GTTGAGGAAT TACGGTTACT ATGAGGAGGC CTCCAGGTTA CTTAATAAAG TCCTTGAAGT AATGGGTAGG TCAATGACCT GTAATGAGAA TTACAATAGT AGCACCGGGG AACCAATGGG TGCCCCTGAC TTCGGCTGGA CCACACTGAT AATAGACATG GTTGCGAGTG AACTGGGTAA GGAGTCCCCT GGGGCTGCGT TTCATTATGG TACTTTACTT AGTTAG
|
Protein sequence | MRISNKLEIS SNITSIIMNN TGIGLRLFHG EYVEGELRGS GAALLRSYWD YGGIRRYWGN GAFEFTSIYG DLAVYVIRGY RGVIQGEFKV RGFSGVDELG DGSINVKHEG GLMRIKATQP INLRVMNNGF SLTINVNDEL KVAAAGGGQV NDVDKVLNDE SIIESKRRLW LSTLMSNING SDLIKLCWYV ILTNRCSVPN HPALRKPFNM PSKYVFRHQW LWDSSFHSIV LRHYDVNMAM EELENLILNQ KPDGRIPHEI FMSKESCKSF WGIDDYSPWT TQPPVLAVAI DKVLSVRWND EFAEKAFNAL TKYDEWFRSQ RDRDSDHLYA YFDPLESGWD NSPRWDEAIR RFRENPQRYE VYGKLTMTPV EAVDLNSLIY LQRRVIAKLA ERLGEVNVAE HYDEMADETA KAVRRIMWSE KDGFFYDVYE EGHELIKVKT PAAFLTMFTG IATGEQAERL VAHLLNPREF WTTFPLPSVS ADESTYDPTG YWRGRSWINL VWFTYHGLRN YGYYEEASRL LNKVLEVMGR SMTCNENYNS STGEPMGAPD FGWTTLIIDM VASELGKESP GAAFHYGTLL S
|
| |