Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cmaq_0033 |
Symbol | |
ID | 5708893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caldivirga maquilingensis IC-167 |
Kingdom | Archaea |
Replicon accession | NC_009954 |
Strand | + |
Start bp | 47839 |
End bp | 50184 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 641274536 |
Product | extracellular solute-binding protein |
Protein accession | YP_001539877 |
Protein GI | 159040625 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.649183 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTATA TAAGTAAATC AACAATACTA ATATCAGTAA TGGTATTAGC CATCATTACG ATTATTGCAC TACCCCAACA AACTCAGAAG CCTCAAATAA TATGGGCTGA TTACGGTACA ATAACCACAC TAACCGGCCC AATCTATAAT CCATTTTACC CAAATACATT AGCCACAGAT ACAGTAACCA GTATAATATC CTATGCGCCA CTGGCGTTAT ATAATCCATT CAATAATGTA TTTTACCCGG TCTTAGCCAG TAACTGGACT ATTCAAGTTC TCCCTAATGG TAGTGGTATT TTAACTGTTT ACCTTAGGAA GGGGCTGTAT TGGTTTAATG GTTCAGCTAC AATGCTCTTC ACTGCATGGG ATGTTTACGC ATACTTCTAC ATTGAGGATA AGGCCTTTGA GGCATATGCC CCATTCATGC AGCCACAATA CGCTGATGAG AGTATAAGGG TACTTGATAA TTATACTATT CAATTCCTAT TCCAAATATG GAGCCCAACC GAGTGGATAT ACTTCATTAC CTCAAGCATT GCGACTCCTT GGCCTGTTTG GAAACCCATT GTTAATGAGT TGAAGACTAT GAATGCCTCC CAAGCCTTAG CCTTCTCAAC CAATGTAACT AGGTACGTGG TGCCTTACTG GGGTATTTTC CCATATTACT TAACCTACAT AAGTTCATCA AGCATAGAAC TAACCCTGGA GCCAAGCCCA TTGCTCAACC AGTGGTATAC CGTATTTCCG CTTGCAGACT GGAATTACTA TGACCCAACC TTCGAGGAAT TCTTTACTGG GAGCCAGTAC ATAGCATCAC TGGTGTCGCA TAAGGCCACT TGGGCTGGCG GCGCGGCAGG CATTAAGCAG GTAGCCTTAC TTAATAGCAG TGGCTTCAGT GCATACTTCG CCCCAGACTT ATCAGGCTGG GGAATCACGT TCAATCCACA TGTGTATCCA TTCAACATAA CGCTGATTAG GGAGGCCCTA TGCCTCATAT TTAATAGAAC AGCCGTTGTA GCTGCCTGGG GACTTAACTA CCCTAACTAC TACTCTCAGC CAATAGCCCC TGAAACTATT AGTTCTTATC CGCCAAGTGT TAGGCAGTTC ATTATACCGT GCTCCTATGA TCCAGCTAAG GCGGCTCAAA TGCTGCAGAG CCTAGGATTC AAGAAGATTA ACGGATACTG GTATTTGCCT AATGGCTCCA TGTTCTCAAT ATACGTACTG GCACCCAGTG GTTGGATTGA TTGGGATACC ATGGCTTCTG AGGCTATTGA GGAGATGCAG GCATTCGGCA TAAACGCTAA GTTAATCACA ATGGATGCTG GAGCCTACTG GGGTACCATG ATACCTGATG GTGATTACGT CGCTGCATTA ACATGGACCA CCGCATTCAC TCCAGCCTAC TATAGTGCGT GGGAGGCGTT GAGTAATCCA TGGTGGGCCT TCGGTAGTGC AATATCTGCT TATACTCCTG GTAGTGAGGT TTGGCCATTC CAGTGGCCTA ATGGGACATG CACACCAGTC ACTGCACCAG CATCATTGAA ACTACCCAAT GGCACAATAG TATGGTGCAT CAACTCAACA TACGGCTACA TAAACCTAAG CAACTGGCAA ACATTCTTCA ATGATGCAAC TCCAGGAACA GCGGACTACA ATCTAGCACT AGACACAATA TTCGCATGGT ATAGTTACTT CACTCCAATA GTGCCGTTAG CAGCCAAGAT AGACCCATTC ACGTACTTAA CACCAATAGC TGATCCGAAT TGGCTATACC TCTGTCTACC TAATGAGACA ACGTGGTTCC TTGTATCTGC AAACTGGTAC ACCTACGGCT CATTAATAAT GCTAATGTTT GGTGCTGTTG CACCAAGAGG CGTAGTACCA CCATTAGCCC AGGTAATTGC TAACGGTAGC CTTTGGGTAA AATACCCACA AATAGCAAAT CTACTTGCCT TACCTAGTCC TGATCCTTCG TTGCAGGCTT GTGTGGCATC GTACTTCCAT ATACCGTATA CTCCGGTGAC TACTTCAACC TCAACAACCA CGACTACATC AACAACTACT ACATCAACAA CCACAACCAC TACTACACCG GTTACTACTA CGGCTACAAG CACTACTACG ACTACTTCAA CAGTAACCAC TACTGCCGTT AGCACTGTTA CAAGTACTGT TACAACCACT GCTGTAAGCA CGGTTACAAG TACAGCAACC ACTACGGCAG TAAGCACAGT AACAGTAACC AAACCAGTAG TATCAACAGC ATTAATAGCA GGAATAATTA TTATTGTAAT CGTAATAGCA GCAGTAGCAG CAATAATAAC ATTAAGAAGA AGATAA
|
Protein sequence | MRYISKSTIL ISVMVLAIIT IIALPQQTQK PQIIWADYGT ITTLTGPIYN PFYPNTLATD TVTSIISYAP LALYNPFNNV FYPVLASNWT IQVLPNGSGI LTVYLRKGLY WFNGSATMLF TAWDVYAYFY IEDKAFEAYA PFMQPQYADE SIRVLDNYTI QFLFQIWSPT EWIYFITSSI ATPWPVWKPI VNELKTMNAS QALAFSTNVT RYVVPYWGIF PYYLTYISSS SIELTLEPSP LLNQWYTVFP LADWNYYDPT FEEFFTGSQY IASLVSHKAT WAGGAAGIKQ VALLNSSGFS AYFAPDLSGW GITFNPHVYP FNITLIREAL CLIFNRTAVV AAWGLNYPNY YSQPIAPETI SSYPPSVRQF IIPCSYDPAK AAQMLQSLGF KKINGYWYLP NGSMFSIYVL APSGWIDWDT MASEAIEEMQ AFGINAKLIT MDAGAYWGTM IPDGDYVAAL TWTTAFTPAY YSAWEALSNP WWAFGSAISA YTPGSEVWPF QWPNGTCTPV TAPASLKLPN GTIVWCINST YGYINLSNWQ TFFNDATPGT ADYNLALDTI FAWYSYFTPI VPLAAKIDPF TYLTPIADPN WLYLCLPNET TWFLVSANWY TYGSLIMLMF GAVAPRGVVP PLAQVIANGS LWVKYPQIAN LLALPSPDPS LQACVASYFH IPYTPVTTST STTTTTSTTT TSTTTTTTTP VTTTATSTTT TTSTVTTTAV STVTSTVTTT AVSTVTSTAT TTAVSTVTVT KPVVSTALIA GIIIIVIVIA AVAAIITLRR R
|
| |