Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1274 |
Symbol | |
ID | 5708674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | - |
Start bp | 1341634 |
End bp | 1344018 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641275780 |
Product | extracellular solute-binding protein |
Protein accession | YP_001541091 |
Protein GI | 159041839 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000000437479 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGATCTGT ATACCTCAAG GAGTGCAACA GCAGTCACCT CAATAATAGT TGCCTTAGTT GCCGCTGTGG TTGGGTTATA TGCCTCTCAT GTTGTATATG CTCAAGAATC CCAAGTAACG TACACGTTCA CCACCGCCTA TTTCGGGTGG ACGTGGTCCC CTGCAGCCCA CTACTGGAAT CCATTCGCCC CCATTAATTA CATAGATTGG CCAGCCTTTG TAGCAATGCC TCTTGCCGCC TATGATGACC CTACTGGTCA GTGGTGGCCT ATTCTAGCCA GTAATTGGAC TGCGTTTCCT CAAAATAAGA CTGTAATCAT CTACCTTAGA CATAACATCT ATTGGTTTAA TGGTTCAGCG GTAATGCCCT TCACTGCTTG GGATGTTTAC GCTGAACTCT ACATTGGTGT TAAGGCATTC AGCTGGTACT ACCCATACTT AACACCCCAG AATGCGAGTG AGGAGATTAG GGTGCTTAAC AATTATACGT TGGAGATAGT TTTCAACATA TGGTCACCCA CAGAGTACTA TTGGATACTC ATGCAGACCA TAGCAACTCC ATGGCCCGTG TGGAAGCCTA TTGTTGAGAA GCTTCAAACA ATGAATGCCA GTCAAGCCTA CACCTTTGGT CAAGTTAACA TAACCGAGTT CAACCCACCC ATGTGGAGTA ATGGGCCATA CTACGTGGCA TCAATTGGAC CAACCTACAT AACTCAGAAC CTTGACCCAA TGTACTTTGA TGGTAAGCCG TTGCTGGCTG AGTGGGATAA GATACTACCA TTCCACACAT GGCAGTACTA CCCAACATTC ATTGCATGGA ACAACCCAGG TGCATCAACT ATCTTAGCCG CAATAGCAGC CCAGAAGCCA GTTTACATTG AGTGGATAGC CTTCAGTCTT CTTAAGGATT TACAGATAAT AAATAGTACA CCTGGCTTCA AATACTATGT AATGCCCGAC TTATCAATAT TCGGCATAGG TATACCAACA TATTACCCGT TTAATATTCC TCAGGTTAGG CAAGCATTCC TATACATCAT TAATAGGAGT GAGGCCGCTG CCGCCTGGGG ACCACCGTGG TTAACGTACC CAGTTTACAT TAATGTTCCA GCACCAGCAC CAACAGCTGC CTCAGGGCTT TGGTTAACAT TCCCGAAGGA TCTTAGAAGC ATAGCCGTTA ACTTCACTGA ACCCAATTGG ACTAAGGCAG CTCAACTACT GGAATCAGCC GGCTTAAAGT ACAAGAATGG TCAATGGTAC CTACCCAACG GAACACCACT AACACTAACA ATATACGCAT CAGCACCAAT GGTTAACTGG ATCACGCAGG CTCAAGTCGC TTTTAACCAA CTTGAAGAAT TTGGAATACC CGTTAAATTA ATTACACTGG AATCATCAAC ATACTCCACT GAAGTATCTC AATGCCAACT ACCTGCTGTG GTTGACTGGA TGTTCGCCGG TTCTAATAAG GGTGGTTACT CAACACTATG GATATCATAC GATGTTGCAT TCACAATGAC ACATCCAGCG CTTTTGCCAT CTGGCTGGTG TATTCCTGGC CACACTACGC CTTTCGCCTA CCCGATTGTT CAGAATAATG AGATTACCGG CTGGTATTGT AAACCATTAA CCACTAACCT GCCTATACCG AATAATACAA TTATCTGGTG CATTAACTCA ACCTATGGCT ACATTAACTT GAGTAATTGG CAGAATGCCA TAATAGCGGC TGAGCCAGGC AGCAGTACCT ATGAGGAGCT TCTTAAGGCC TACTTCTCAT GGTTTGAGTA CTGGGTACCT GGAGTGGAAA TATCAACAGC CACAATAACT GCAGCATTCC CAGTAAAGAT AACTAACCCA ATGTGGGCTT ATGAATGCAT GAACTTCAAG AATCCTAAGT ATACTAAGGC AGCATATTCA CTATTTCACC AATATGCAGT TGCTGGATTA CCTCCCGAAT TCAATACAGT ATTATCACTA GGCGCCTACG CACCGCAGGG TGTTATTCCT CCTTTGGCTG AGGCTATTAT TAATGGTAGT CTTTGGACTA ATCCGTATCT GCATCAGTAC GCTGTCTTCA TTGGTTTGCC TAATCCTGAT CCTCAGTTGC AGGCTTGTGT AGCATCATAC TTCCACACAA CGTACACACC AGTTACTACA ACCACTACTT CAACTGTTAC CTCAACCACT ACGGCTGTTA GCACTGTTAC AAGCACAGTC ACAACAACTG CAGTTAGCAC TGTGACTAGC ACAGCAACAA CCACAGCCAT ATCAACCGTA ACCGTGACTA AACCAGTAAT ATCAACAGCA CTAATAGCAG GAATAGTAAT AATAGTAGTG GTTATTGCCA TTGTTGCAGC AATAATAGCG TTGAGGAGAA GATAA
|
Protein sequence | MDLYTSRSAT AVTSIIVALV AAVVGLYASH VVYAQESQVT YTFTTAYFGW TWSPAAHYWN PFAPINYIDW PAFVAMPLAA YDDPTGQWWP ILASNWTAFP QNKTVIIYLR HNIYWFNGSA VMPFTAWDVY AELYIGVKAF SWYYPYLTPQ NASEEIRVLN NYTLEIVFNI WSPTEYYWIL MQTIATPWPV WKPIVEKLQT MNASQAYTFG QVNITEFNPP MWSNGPYYVA SIGPTYITQN LDPMYFDGKP LLAEWDKILP FHTWQYYPTF IAWNNPGAST ILAAIAAQKP VYIEWIAFSL LKDLQIINST PGFKYYVMPD LSIFGIGIPT YYPFNIPQVR QAFLYIINRS EAAAAWGPPW LTYPVYINVP APAPTAASGL WLTFPKDLRS IAVNFTEPNW TKAAQLLESA GLKYKNGQWY LPNGTPLTLT IYASAPMVNW ITQAQVAFNQ LEEFGIPVKL ITLESSTYST EVSQCQLPAV VDWMFAGSNK GGYSTLWISY DVAFTMTHPA LLPSGWCIPG HTTPFAYPIV QNNEITGWYC KPLTTNLPIP NNTIIWCINS TYGYINLSNW QNAIIAAEPG SSTYEELLKA YFSWFEYWVP GVEISTATIT AAFPVKITNP MWAYECMNFK NPKYTKAAYS LFHQYAVAGL PPEFNTVLSL GAYAPQGVIP PLAEAIINGS LWTNPYLHQY AVFIGLPNPD PQLQACVASY FHTTYTPVTT TTTSTVTSTT TAVSTVTSTV TTTAVSTVTS TATTTAISTV TVTKPVISTA LIAGIVIIVV VIAIVAAIIA LRRR
|
| |