Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0875 |
Symbol | |
ID | 5708730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 918858 |
End bp | 921182 |
Gene Length | 2325 bp |
Protein Length | 774 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 641275378 |
Product | extracellular solute-binding protein |
Protein accession | YP_001540700 |
Protein GI | 159041448 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACT TAAATATACA TTGCAATAAA TGTTATGCTA TGGGTAAAAA CACACCCCTA ACCCTAGTAG TAGCAGCAGT AATAGCCATA AGCATCGCCG TAGTAAACAT AGCCTACGCA GCTTCAAATA CGGTAACAAT AGTTACTACA GAGGGTACGC TCATTTACAC TCCTGGTCAA CCTATCTGGA ATCCATACGC ACCCAACAAC TTCATAGGCG TTGTGGGGAC ATACATGTCT CTCGCATTCT ACAATCCACT TACAGCTAAA TTCTATCCTG TTTTGGCTGA GAATTGGACT GTTGAGGTTT TGCCTAATGG TAGTGGTATT TTAACTATCT ATCTTAGGCA TAACTTGTAT TGGTTTAATG GTTCAGCGGT AATGCCCTTC ACTGCTTGGG ATGTTTACGC TGAATTCTAC ATTGGTGTTA AGGTGTTCAG CTGGTACTAC CCATTCATGC AACCGCAATA CGCCGATGAA GACATTAGAG TATTGAATAA TTACACAATA CAATTCCTAT TCCAAGAGTG GAGCCCACAG CAAATATACT ACGTGTTAAC TACCTGGATT GATACACCAT ACGCCGTATG GAAACCAATA GTGGATAAAC TCAAAACAAT GAGCGTCAGT CAAGCAGCAG CATACGGTAA TAATGTCACT AAGTTTGCTC CGCCATACTG GGGTTTAAGC CCATATTATG TTCAATTATC AACATTGAGC GCAACCTACC AGACACTGTT ACTTGAACCA ATGTACTTCA ATGGTGTCCC ATTACTGGCA TCATGGGATA AAATATTCCC ATTCAATGAT TGGAGCATCT ACCCGAAGTA CGTTATATGG GCGGTTGGTG GTAATACTCA AGCAATGACT GCCCTTCTGG CTCGTAAGGT TAATTTAGCG TTCATTGGCT TGTCACTGCA GCAGGAGGCT ACGATAAATG CTAGTGGCCT CGGTGAATAT AATGGGCCGA ATTATGCGAC TAATGGCTAC ACGCTTAATC CAAACATTTA CCCATTCAAT ATTCCTCAGG TTAGGCAGGC ATTCTGCTAC ATTATTAATA GAACGGCTGA ATCATTGGCC TGGGGTGGAT TATATGCACC TGACCCATAC CCAGTCCCTG TAGCCAACTT CCAGCCATCA ATACAATCAT ACCCATCAAG CGTGTGGAGT ATTGTAACTG TGCATTGTAC AACAAATTGG ACTAAGGCAG CTCAATTACT GGAATCAGCC GGTTTAACCT ATAAGAATGG TCAATGGTAT CTGCCTAATG GGACTCCATT TAAGTTAACT TTAATAGTGC CATCAGGTTT CACTGACTGG GCTACATTCT CATCAGCTGC CGCTGTCTCA ATAAGCCAGT TCGGTATACC TACAACATTA TTAGCCTTGG ATACCTCAAC ATACTGGAGT ACAATATTCC CTAATGGTGA GTATGAAATG GCTATGACAT GGATGACGTG GTCAAGGGGT TACGGTGACT TCAGTTTCCT AGCATCACCA TGGTGGACTT TCCCAGCATT CAATCTCTCC AAGGCTTGGC CATTCCAGTG GCCTAATGGG ACATGCACAC CAGTAACAGC ACCTAGCATA CCAGGCTTTA CTCCACCTAA TAGTACGATA GTATGGTGTG TTAATTCAAC ATTCGGCTAC ATTAACTTAA CCAACTGGGG CGTAGTATTT GCGGCCACTG TCCCAGGGAT GCCTCAGTAT GATGAGCTTC TGAAGGTATT ATTCGCCTGG TATGAATACT ACATGCCAGT GATACCAAAC TTAACCCAGA ACATGCAGTT CAACTATAAT CCAGCTGACT TTGATATTAT ATGGTGGATT AAAGGCTTAC CACCATCCTC ATGGTTACTC TTTACACAGG GAGCATGGGG TGAAACACTA TCCGTAATGC TTTGGAATAT TGGATTCGGC GCATTAGCAC CACCAGGTGT AGTACCACCG TTGGCGCAGG CTGCCGTCAA TGGTTCACTA TGGAGAATTG ACCCGCAGTT AGCTGCATTC GCAGGATGGA GCCCAAGTAC AATGAACCTA GAAGCCATAG CCTCATACTT CCATATACCG TATACTCCAG TGACTACTAC TTCAACTACT ACTACCACTT CAACTACAAC TGTTACTTCA ACTGCCGTTG CTACTGTGAC TTCAACAGTC ACAACCACTG CAGTTAGCAC TGTGACTAGC ACAGCAACAA CCACAGCAGT AAGCACAGTA ACCGTGACCA AACCAGTGGT AAGCACAGCA TTAATAGCAG GAATAGTAAT CATAGTAGTA GTTATTGCCA TTGTTGCAGC AATAATAGCA TTAAGAAGAA GATGA
|
Protein sequence | MKNLNIHCNK CYAMGKNTPL TLVVAAVIAI SIAVVNIAYA ASNTVTIVTT EGTLIYTPGQ PIWNPYAPNN FIGVVGTYMS LAFYNPLTAK FYPVLAENWT VEVLPNGSGI LTIYLRHNLY WFNGSAVMPF TAWDVYAEFY IGVKVFSWYY PFMQPQYADE DIRVLNNYTI QFLFQEWSPQ QIYYVLTTWI DTPYAVWKPI VDKLKTMSVS QAAAYGNNVT KFAPPYWGLS PYYVQLSTLS ATYQTLLLEP MYFNGVPLLA SWDKIFPFND WSIYPKYVIW AVGGNTQAMT ALLARKVNLA FIGLSLQQEA TINASGLGEY NGPNYATNGY TLNPNIYPFN IPQVRQAFCY IINRTAESLA WGGLYAPDPY PVPVANFQPS IQSYPSSVWS IVTVHCTTNW TKAAQLLESA GLTYKNGQWY LPNGTPFKLT LIVPSGFTDW ATFSSAAAVS ISQFGIPTTL LALDTSTYWS TIFPNGEYEM AMTWMTWSRG YGDFSFLASP WWTFPAFNLS KAWPFQWPNG TCTPVTAPSI PGFTPPNSTI VWCVNSTFGY INLTNWGVVF AATVPGMPQY DELLKVLFAW YEYYMPVIPN LTQNMQFNYN PADFDIIWWI KGLPPSSWLL FTQGAWGETL SVMLWNIGFG ALAPPGVVPP LAQAAVNGSL WRIDPQLAAF AGWSPSTMNL EAIASYFHIP YTPVTTTSTT TTTSTTTVTS TAVATVTSTV TTTAVSTVTS TATTTAVSTV TVTKPVVSTA LIAGIVIIVV VIAIVAAIIA LRRR
|
| |