Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2342 |
Symbol | |
ID | 5054894 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2094008 |
End bp | 2095441 |
Gene Length | 1434 bp |
Protein Length | 477 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469894 |
Product | extracellular solute-binding protein |
Protein accession | YP_001154538 |
Protein GI | 145592536 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.936887 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0125511 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAT CTCAAGTAGT AGGGATTGTT GTCGCGTTGG TTGTTGTGGC GTTGTTGGGG TATTACCTCG GTTCTATGTC GCAACAGCCG CAAACTACGG CGCCGCCTTC ACAGACTACG GCATCGCCTC AGCCGACCAC ACCGTCGGCC CCTAAGAAGA TTACGTTTTA CACCTGGTGG GCTGGTCTTG AGAGGTTTGC CATCGATGCC GTTATTGGAA ACTTTACTAA AAAGACCGGC ATCGCTGTAG AGAAGACGGC GGTGCCTGGC GGCGCCGGTG TGAATGCCAA GTTTGCAATC TTGGCGCTGA TGCAGGCGGG GAATCCGCCG GCTGCATTCC AAGTACACTG CGGCCCCGAG ATGTTGAGCT ATATCTACGT TGCACCGAAG GGCGAGGCCA GCTTTATGGA GCTGAGCAAC GTGGCCAAGG AAATAGATAT GTCTACCCAA GCGCCTGACG TTCTGTCTGC GTGTTCCCTC AACGGTAAGG TTTTCGCGCT TCCCGTCAAT ATCCACAGGG CTAACCTCAT CTTCATAAAC AAACAAGTTC TCGACAAACT CGGCGGCTCT GTCCCCACAA CCCTAGACGA TTTGGTTGCT CTCTGCAAGA AGGCCTCTGC CGCCGGGATG CCATGCCTAA TCCAAGCGGG CGCAGATCAG TTCACAGTGC TACACCTCTG GGAGCAAGTT TTCCTTGCGG TGGCGGGGCC CCAGAAGTTT ATAATGTTCA TGTACGGCAC TCTGTCACCC GATGACCCCT CGCTAAAACA AGCCACGGAG AAGTTCTTAG AACTAGCTAA GTCATTCCCA GCTAACTGGC CAGCCCTAGA CTGGACAGGC GCCGTCGCTG ACTTAGTTGC GGGCAAAGGG CTGGCTCACG TAGACGGGGA TTGGGTAGTG GGCCTAATAT ACAACGTGTA TCCCCAGGTG AAGATGTGCC CCTACACCTC GATAACTCCA GACTGTAACA TAGTTGTCGC GCCGTTCCCC GGGACCCAGG GAGTTTACAA TCTGGTGATA GACTCAGTCG CGGTGCCGGC CGGCGGTCCG ACCACGCAAT TGGGCATAGA GTTTGTCAAG TTCTTCGCGG GGCCAGAGGG GCAGTCTATC TTCAACCCAT TAAAGGGCTC AATAGCAGTG TACAAGAACA TAGATCCCGC AATATACCCC ACCCCAATAC AGAGGTGGGA AGTACAGGAG TATAGAGACG CCAGATCCTA CGTGTTCAGC CTTACCCACG GCGCGTTGTT CTCAGACGTC TGGCAGATGT TGTTGCAACA AGCCATAGTC CTTGTGCAGA CGGGGAGGGC AGATCTGTGG TACGACACCT TGTCAAAAGC GCTGTCCACA GAGCGCTCCC TGTGGAAGGA CACCTGGTAC CTCGGCGCGC CGGGTAAGCC CTTCGGCGGC TACCAACCGC CGTGGGTTAA ATGA
|
Protein sequence | MKLSQVVGIV VALVVVALLG YYLGSMSQQP QTTAPPSQTT ASPQPTTPSA PKKITFYTWW AGLERFAIDA VIGNFTKKTG IAVEKTAVPG GAGVNAKFAI LALMQAGNPP AAFQVHCGPE MLSYIYVAPK GEASFMELSN VAKEIDMSTQ APDVLSACSL NGKVFALPVN IHRANLIFIN KQVLDKLGGS VPTTLDDLVA LCKKASAAGM PCLIQAGADQ FTVLHLWEQV FLAVAGPQKF IMFMYGTLSP DDPSLKQATE KFLELAKSFP ANWPALDWTG AVADLVAGKG LAHVDGDWVV GLIYNVYPQV KMCPYTSITP DCNIVVAPFP GTQGVYNLVI DSVAVPAGGP TTQLGIEFVK FFAGPEGQSI FNPLKGSIAV YKNIDPAIYP TPIQRWEVQE YRDARSYVFS LTHGALFSDV WQMLLQQAIV LVQTGRADLW YDTLSKALST ERSLWKDTWY LGAPGKPFGG YQPPWVK
|
| |