Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1638 |
Symbol | |
ID | 5709339 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1713235 |
End bp | 1715220 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641276146 |
Product | hypothetical protein |
Protein accession | YP_001541451 |
Protein GI | 159042199 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0737981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAGTGA GCGGTAGGAG TTTCAACAGT GGATTACCCC TAGGTGGAAT TGGTGCTGGT GCAATAGAAT TCTTCCCAGA TCTCACAATA GGCAATGTGA CTATAATGAA TAATTGGTTA AACCCACTTA AGGTGGTTAG GGGATTCCAC GTAGTACTAC TTAATGAGGA ACCACTCTTC CTTCAAACCA ACCCAGGTAA GAATATTGAG GTTAAGCCGC AGTATAAGCA TGTTGACACC ATTGAGGTTG ATGCCCAGTA TCCTAGAATC AAGTACAGGT TCAGTAATTT ACCGATTAAG GAGATCGAGT TCTATACACC CATTGTTAAG GGTAATCTTA AGGACTCATC ATTACCATTA ATAATAATTA GAGTTAAGGG AAATGGCACT ATTGCATTCT CATTCCCAAA CATAGTGGGT AGTAGGAGGT GGGGTCGCGT CAACTACTCC ATAACAGGTA AGGTTAATGG TGTATTATTC AGGAATCTAA GGTCACTGCA AAGTGACCCA GCTTACGGCG AAGTCTTCAT AGGTTGTGAA AAATGCCACA CATACTCCGG CTACTCATAC TGGGTACCCA CTAGGGGTGG TATGACTGAG GATATCTCAA TATTCAGTAA ACTCAGTGAA GTAGCTGATG AGGGTAGGTA CTCCATAAGG CCATATGCCA GGGAGGAGAT TGCAGGGATA GTGTGGAGGA GGGTTGATGA TGAGGCGTTA TTCTTCATTA CCTGGTTTTT TAATTCAAGA CCATACCATT ACCCATACGG CCACTACTAC GAGAATTTCT TCAACTCAGC AGTTGAGGTG GCTGAATACG CCTTAGAGAA TACCGGTTCA TTAAGTCCCC TTAATATTGA TGCTGATGGT TGGTTAAGGG ATGCTGTATT AAATAGCCTA TACGTGTTGA CTTCATCAAC GTGGTTAACT AAGGACGGTA GATTAGCTGT TTATGAATCA TTATCAATAG CACCATTAAT GAGTACCATA GGGTCAATGA CCTGGGATGG ATTATCCTTC GCCCTACTTG ACCTTTTCCC CGACTTAACT GTTAAAATGG ATGAATTACT GGGATTCTAC ATACATAATG GTGAGGTTCC TCATGATCTT GGGGAGGAGA GTATTGAGGA TCCAATATAC GGTGCCTCAT ACTTATATCC GTGGAATGAT TTAGGGAGTA CTTGGATTCT AATGATTTAT AGGGATTATC TCCTAACGGG TAATGTTGAG GTTTTAAGGA GGAATATTGA TAAGATGCGT GAGGTTATTG ACTGGCTCAT CAGTAGGGAT TATGACGGTG ACTGCATACC TGACTCAAGG GGTGGGTTTG ATAATTCCTA TGATGGAACA AACATGTATG GGGCATCCTC ATACATAGCC TCCCTATTCC TATGCTCACT TCAAGCATTT ATTAAGTCGG CTGAGGTACT TGGTGTAAGG CTCAGTGACC GTTATGAGTC ATGTTTAAGT AAAGGCAGGG AGACGTTGAA TTCACTGTGG AATGGCCGCT ACTTCATGGC ATGGAAATCA AGTGGCAATA GTAATGAGTC ATGCATGAAT AGTCAACTCC TTGGGCAATT CTGGTGCGAT TTCCTTAAAC TACCACCGGT GGTTGATGAG GATAAGATTA AGGTAGCCTT AAGGTCAATA TACGAGCTTA ACCACAAGTC ATCACCCCAC TGTCTCCCCA ATTCAGTTAA GCCAAGTGGG GAGATTGATA CTTCATCAGG GCAAATGAGG TCCTGCTGGC CTAGGGTAAG CTTCGTTGTG ACTGCCCACA TGGTGCTTAG GGGTATGGTT AATGAGGGTC TTGAGATTGC TAAGAAGGAG TGGGATACTA TATCAAGGTT AGAGCCTTGG AATCAAAGCT CAAGAATAGA CGCCATCGAG GGCAGGAACG TGGGCTTAGA CCACTACATA GGTAGCGCCT CCCCATACAT ACTATACTTA GCCCTTAGAA GTAGTAAGGT AGAACACTAC TCTTAA
|
Protein sequence | MIVSGRSFNS GLPLGGIGAG AIEFFPDLTI GNVTIMNNWL NPLKVVRGFH VVLLNEEPLF LQTNPGKNIE VKPQYKHVDT IEVDAQYPRI KYRFSNLPIK EIEFYTPIVK GNLKDSSLPL IIIRVKGNGT IAFSFPNIVG SRRWGRVNYS ITGKVNGVLF RNLRSLQSDP AYGEVFIGCE KCHTYSGYSY WVPTRGGMTE DISIFSKLSE VADEGRYSIR PYAREEIAGI VWRRVDDEAL FFITWFFNSR PYHYPYGHYY ENFFNSAVEV AEYALENTGS LSPLNIDADG WLRDAVLNSL YVLTSSTWLT KDGRLAVYES LSIAPLMSTI GSMTWDGLSF ALLDLFPDLT VKMDELLGFY IHNGEVPHDL GEESIEDPIY GASYLYPWND LGSTWILMIY RDYLLTGNVE VLRRNIDKMR EVIDWLISRD YDGDCIPDSR GGFDNSYDGT NMYGASSYIA SLFLCSLQAF IKSAEVLGVR LSDRYESCLS KGRETLNSLW NGRYFMAWKS SGNSNESCMN SQLLGQFWCD FLKLPPVVDE DKIKVALRSI YELNHKSSPH CLPNSVKPSG EIDTSSGQMR SCWPRVSFVV TAHMVLRGMV NEGLEIAKKE WDTISRLEPW NQSSRIDAIE GRNVGLDHYI GSASPYILYL ALRSSKVEHY S
|
| |