Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_1494 |
Symbol | |
ID | 5709123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 1571708 |
End bp | 1573858 |
Gene Length | 2151 bp |
Protein Length | 716 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641276003 |
Product | extracellular solute-binding protein |
Protein accession | YP_001541308 |
Protein GI | 159042056 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.580792 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAGG GTTTAATCAC TAAGTTACTG ATCCTTAGTA TATCAGTACT GGCATTGGTT ACAGTATTCT CAAGCTACGT ACCATCATTA ACCATGGCTC AAACATACAA CATCACACCA TATAATACAA TATACATGTT CGTTACTAGG AGTCCACCGG CAACTGGTTG GAGTACTTAT AATCCTAATA TTTTTCAAGG TGGCATATGG TGGTATGGTG TTTACCAGCA GTTTTTGGCT GCGGTTAATA TAACTACTGG TGAATTTGTA CCTCTTCTTG CTGATAATTG GACGATGACA GTGCTACCTA ATGGTACCTT AGAAGTCTTA GTGCATCTTA GGCATAGTGG GTGGGATAAT GGTATTCCAG TAACCTGCTG GGATGTTTGG GCAACTAATA TGCTTCTTGG CCTTGTATCA GCAATGGTTG GTAATGTTAC CGTGTTTAAT AATTACACGT GCGCATTCAT GATACCTAAG GGTTACTATG TACCATTAAC CCTACCTGGC GCCCATGAGG CTAACTCATT CATAGCCCTT GAGTGGGGTC AAGGTATTGC CCTTGATTGG CAGGACAGCT ACGTATGGGA ACCAATAATA AAGACCGCGG CTGCAAACTA TAGCTGGATT TGGTTGTACA TGTTCGGTAA CTCAACCCAG AAGGGTGAGG CTGCTAAGGT CCTCTCAACA TTAATCCATG ATTACTTAAC CTACAAGCCA CCCTACACCA CTGGTTACTC AAATGGACCA TACTACCTAT GCGATATTAC GCCTGAATAC TTCCTACTCT GCAAGAACCC ATACTACTAT ACTGTTAATG AATTCAAGCC CGATTACATT GTTGAATGGC AGTACTCCTC AATGTCTCAA GTTTACGCAG CCTTAGCCAC AGGTAAGATT AGCCTATGGT CGACTTGGAT CGGCTCAGTA TCACCAACAG TAATACCAAC AATAACCAGT AACCCGTACA TTAAGGCATT ATCATTCCCA GCCTTCGGTG GTGATGCCTT GTATTTCAAC TTCCTTAATC CTTGGTTGGC TATGCCTCAG GTTAGGCAGG CTATTTACTA TGCTGTTAAT TGGACTCAAT TAGCCCAAGC AGCATACGGT GTAGGCTACA CGTACCCATC ACCAATGCCT CAAGATGGAT TAATGCCCTA CTACACTAAT TGGCAAAGCA TGATAACCAG TTACTTCGCA TCACTAGGCC CACAATGGAC TATGGTAAAT TACACTTATA ATGCTTCATT GGCTACTAGC CTACTTGAGA GTGTTGGCTT CACTAAGAAG AATGGTGTAT GGTATACACC CAACGGCTCA GAATTCACAT TAACACTATA CATCGGCTCA AACGCACCAC CAGCGCAGTT AACACTGGCA ACAGAAATAG CTAATGCATT AACAAGCTTC GGAATACCAA CAATAGTAAT AACATATCCC TCAGCTGAGG GGCTTACAAT AATTAAGCAG GGTAAGTACG ACTTATTATT CTCATACTAC AGTGACATAT ATAGGCCAGG CGTACCATAC TACTTCCCTG AGGCATTCTA CTTCCTAGGT TACCCATTCA ACTACAGTCA CTGGAATGGT ATCGTAACTC TACCTAATGG AACCTCATTC AGCGCACCTG AATGTGGTAC CATGTTATCG TTGAATTGCA TACTCAGGGT TGCCTGGGCC ATAAATCATG ATCCATGGTA TATTCAGATT GATTGGAATA GTGGTATAGT ATTCCTCAAT ACCCAGTACA TCAACTGGCC CATTAATGAC ACGTCAATTT GGATCGGCAC ACTACAACAC ACTACACCAG CATGGACAGT GTTACTGACT CACATATCCT TTAAGCCACC AGTTACTACT ACTTCGACTA CTACTCCTGT TTCCACTGTT ACTTCTACTG CTGTGGTTAC TTCTACGACT ACTGTTGTTA CTTCGACTAC TGTGGTTAGT GGTACTACTA CGACTTACAC AACCACAAGC ACTGTACCAG TAACAGCAAC AGTAACATCA ACAATACCGG TAACAACCAC GGCAGTATCA ACAGTAACAG TCACTAAACC AGTAATAAGC ACAACACTAA TAATAGGCAT AATAATCATA GTAATCATCA TAGTAGCAGC AGTAGCAGCA ATAGCATTAA GAAGAAGATA A
|
Protein sequence | MSKGLITKLL ILSISVLALV TVFSSYVPSL TMAQTYNITP YNTIYMFVTR SPPATGWSTY NPNIFQGGIW WYGVYQQFLA AVNITTGEFV PLLADNWTMT VLPNGTLEVL VHLRHSGWDN GIPVTCWDVW ATNMLLGLVS AMVGNVTVFN NYTCAFMIPK GYYVPLTLPG AHEANSFIAL EWGQGIALDW QDSYVWEPII KTAAANYSWI WLYMFGNSTQ KGEAAKVLST LIHDYLTYKP PYTTGYSNGP YYLCDITPEY FLLCKNPYYY TVNEFKPDYI VEWQYSSMSQ VYAALATGKI SLWSTWIGSV SPTVIPTITS NPYIKALSFP AFGGDALYFN FLNPWLAMPQ VRQAIYYAVN WTQLAQAAYG VGYTYPSPMP QDGLMPYYTN WQSMITSYFA SLGPQWTMVN YTYNASLATS LLESVGFTKK NGVWYTPNGS EFTLTLYIGS NAPPAQLTLA TEIANALTSF GIPTIVITYP SAEGLTIIKQ GKYDLLFSYY SDIYRPGVPY YFPEAFYFLG YPFNYSHWNG IVTLPNGTSF SAPECGTMLS LNCILRVAWA INHDPWYIQI DWNSGIVFLN TQYINWPIND TSIWIGTLQH TTPAWTVLLT HISFKPPVTT TSTTTPVSTV TSTAVVTSTT TVVTSTTVVS GTTTTYTTTS TVPVTATVTS TIPVTTTAVS TVTVTKPVIS TTLIIGIIII VIIIVAAVAA IALRRR
|
| |