Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1499 |
Symbol | |
ID | 5709140 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1578317 |
End bp | 1580131 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641276008 |
Product | X-Pro dipeptidyl-peptidase domain-containing protein |
Protein accession | YP_001541313 |
Protein GI | 159042061 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000124334 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.885769 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGATTG TGTCTGTTTT CAAGGCTGAT GAAATAAAAA TTGCTCCGAA TCAATTCAGC CTAGAACCCG AAAAGTTGCC GCCTAACGCG CAGATAAAGG GTAGACTTTT CGATGGTTAC CCCGTGATAG TTAAAGAGGA GGTTTCTCCG CCAAAATACA GGATGAAGAT TCTGAAGGAT ATTATGGTAA AGATGCGGGA TGGGGTGCAT CTTGCTGTCG ATATCTATCT GCCGGACGCG GAGGGCGAGA AGTTTCCTTG CTTAGTTGCA TGGGGAATGT GGGGTAAGGA CAACCAGGAA ACAGTGCTAT GGCTTAAAGA CCTTCCTCAA CCATATTACA CGAGTCCTTG GTGGGATGGA AGCCTTGAGG CAGGAGACAT AGAGTACTTC GTCTCCAGGG GGTATGTTTA CGTGATACCG GATCCAAGAG GGATAGGAAA ATCTGAAGGG GGGCCTCCGC GTACTCTGAT AGACCTCCAC AAGCCTGAGG ATATTTACGA CCTCATAGAA TGGGTTGCAC AGCAGCCTTG GTGCAACGGG AAGGTGGGGA TGATTGGCCC CAGCTCCTAC TCCCTTTCAC AATACATGAT TGCAACCAAT AATCCCCCAC CGCATTTAGT TGCACTGTTC CCGATTGGTT CATTCTATCC TCCTGCGGAT CCATTTACAG GGATGATAGA CCTTGCTCTA GCGGGCATAT TCCATGGTGG CCATATACAC GATAGCTCGT TGCCGGTACG CCAATGGGGT CCACCGATGT CTCCAGAAAT ATTGCCCAAG GATGAATTTG AAAAGAGACT TAAGGAGTTA CTGGAGCACC CTGACATTAA GTTCCATCCC AAGGTTAGAT CATCACTGGT TTATCCGAGA GAACCCATCT TGTTTGATTA CCTGATGTCA GCTTTTCATC CAACGCCTGT TAATGATAAT CTTGATAAGG TAACACTCCC AATATACATC GGTGTTCCTG CCCCAGGGGG TGGAGGGGCA CGTGTGTATT GGTCTGGATT TGAGGCTTAC AATAAGGTCC GTTCAAAATA CAAGAAATTC CTCATATTCA TCCCTGGTGA GTTCCCAAGA CCGTTCGTAC ATATGCAATA CGAGATGATA AGGTGGTTTG ACTACTGGTT GAAGGGAATA GACAACGGCA TCATGGATGA GCCACCGGTG AAGATCTTCA TGGGTGGGGT GAATAAGTGG AAGTTTGAGG ATGATTGGCC ACCGAAGGAT ATTAAGTGGA TTAACCTCTA CTTAAGGAAG GGGAATAAAT TATCCACTAT CCCTGAGAGT GATTCAAGAC CTGACGTGTT GTATCAACCT ATGCCCCTCA AGGACCCCAC AGTCTACTCA CTAAACTACT ACACGGATCC ATTCACCGAG GACACTGAGA TAGTAGGACC AACCGCCTTA CATTTAGAGG CGACTATTGA TCAAGATGAC GTAAACTGGA TGATAACGGT AGTGGATGTA AGCCCAGACG GTAGCAAGCA ATTAATGACA GAGGGCTGGC TCAGGGGTTC CTTTAGAGCT ATTGATGAAA ATAAGTCAAA GCCATGGGCT CCAGTGCACA AGGTCCAGGA TCCAGTCCCT GTACCGAAGG GAGAGAAGGT GAAGTACGAC ATCAACTTAA TGCCGATAAC ATGGGTCATC CAGAAGGGGC ACAGGATAGG TGTCATAATA AGGACCCAGG ATGATATGTA TAGCCGTCTT GCAATTGGTG GCGTATACTT CCTACCAAGA ATGGTGGATA CGGTAGTCAA TCTGCATCTG GGACCCAATA GCTACATCGT CCTACCTGTA AGGAGCAAGG AATAA
|
Protein sequence | MSIVSVFKAD EIKIAPNQFS LEPEKLPPNA QIKGRLFDGY PVIVKEEVSP PKYRMKILKD IMVKMRDGVH LAVDIYLPDA EGEKFPCLVA WGMWGKDNQE TVLWLKDLPQ PYYTSPWWDG SLEAGDIEYF VSRGYVYVIP DPRGIGKSEG GPPRTLIDLH KPEDIYDLIE WVAQQPWCNG KVGMIGPSSY SLSQYMIATN NPPPHLVALF PIGSFYPPAD PFTGMIDLAL AGIFHGGHIH DSSLPVRQWG PPMSPEILPK DEFEKRLKEL LEHPDIKFHP KVRSSLVYPR EPILFDYLMS AFHPTPVNDN LDKVTLPIYI GVPAPGGGGA RVYWSGFEAY NKVRSKYKKF LIFIPGEFPR PFVHMQYEMI RWFDYWLKGI DNGIMDEPPV KIFMGGVNKW KFEDDWPPKD IKWINLYLRK GNKLSTIPES DSRPDVLYQP MPLKDPTVYS LNYYTDPFTE DTEIVGPTAL HLEATIDQDD VNWMITVVDV SPDGSKQLMT EGWLRGSFRA IDENKSKPWA PVHKVQDPVP VPKGEKVKYD INLMPITWVI QKGHRIGVII RTQDDMYSRL AIGGVYFLPR MVDTVVNLHL GPNSYIVLPV RSKE
|
| |