Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0422 |
Symbol | |
ID | 4908502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | - |
Start bp | 404242 |
End bp | 405819 |
Gene Length | 1578 bp |
Protein Length | 525 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640124174 |
Product | extracellular solute-binding protein |
Protein accession | YP_001055322 |
Protein GI | 126459044 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTCTAG CTCAACAACC AGCGCAGCCC ACAACCACTG CCGCCCCCAC CACGGCACGA CCTACCACGG CTCCCACCCC CCAGCCCACT ACCCCCGCCA CGACTACTAC GCCGCAGGCA ACGCCTTTGA CCATCACAAT AGGCGTCACA GACAAGGTGA CGGACCTCGA CCCGGCAAAC GCCTACGACT TCTTCACTTG GGAAGTCTTG TACAACACCA TGGCGGGCCT CGTCAGGTAT AAGCCAGGCA CTACGGAGAT AGAGCCAGAC CTCGCCGAGA GTTGGACTGT GCTCGAGGGG GGCAAGGTGT GGGTCTTTAA GCTAAGGCCG AATCTCAAGT TCTGCGACGG GACGCCGCTG ACCGCGGCCG ACGTCAAGAG GTCTATTGAG CGCGTGATGA AGATAAACGG CGACCCCGCG TGGCTTGTCA CAGACTTCGT AGAAAAGGTG GAGGCCCCTA ACGCTACAAC GGTGGTATTC TACCTACAAA AGCCAGTCTC CTACTTCCTA GCCCTAGCGG CTACGCCGCC CTACTTCCCG GTGCATCCCA AATACGCTCC AGACAAGATT GACTCAGATC AAACGGCGGG CGGCGCGGGG CCTTACTGTA TTAAGAGCTT TGTGAGAGAC CAGCAGATAG TCCTTGAGGC AAACCCCTAC TACTACGGCC CCAAGCCCCA GGTTGGCCGG GTAGTGATTC GGTTCTATAA AGATGCCACC ACTCTAAGAC TTGCCCTTGA GAGAGGGGAG GTGGACATTG CCTGGAGGAC TTTAAATCCG CCCGACGTAG AGGCGCTGAG GGCCTCGGGC AAGTTCAACA TAGTGGAAAT ACCGGGCTCC TTCATTAGGT ACATAGTGCT CAACCTCAAT ATGCCAGAGT TAAAAGACGT CAGAGTGAGA CAAGCCCTCG CCGCGGCCGT GTGCAGAAGG GACATAGTCA ACGTGGTTTA CCGCGGCACA GTTACGCCGC TGTACACGTT GATACCAGAG GGCATGTGGA GCTCTTACCC AGTCTTCAAA GAGAAGTACG GCGATTGCAA CATCACGCTT GCAAAGACGT TGCTACAACA GGCTGGTTAC AGCGAGTCCA AGAAGTTGAA CATTGAGCTG TGGTACACGC CTACTCACTA CGGCGACACT GAGAAAGACC TCGCGGCGAT GTTGAAGCAA CAGTGGGAGG CCACGGGGAT GATCGCTGTC ACAGTTAAAT CTGCCGAGTG GGCCACATAT GTGCAACAGC TCAGAAGCGG CGCATTGATG GTCTCACTGC TCGGCTGGTA CCCCGACTAC ATAGACCCCG ACGACTACAC AACGCCGTTT TTAAAGACTG GCGCAAATAA GTGGCTTGGA AACGGGTACA GCAACCCAGA GATGGACCAG ATCTTAGACA AGGCGTCGGT GGAAATATCT CAGACTGCCA GAGAACAGCT TTACCTACAG GCACAGCGCA TACTGGCCCA AGACGTGCCC ATAATACCGC TTATACAAGG CAAGTTGTAC ATGGCGACGA GGCCGGGCAT ACAGGTAGTG GCAGACCCCA CAATGATATT CAGGTACTGG ACCATCAAAG TCGGGTAG
|
Protein sequence | MFLAQQPAQP TTTAAPTTAR PTTAPTPQPT TPATTTTPQA TPLTITIGVT DKVTDLDPAN AYDFFTWEVL YNTMAGLVRY KPGTTEIEPD LAESWTVLEG GKVWVFKLRP NLKFCDGTPL TAADVKRSIE RVMKINGDPA WLVTDFVEKV EAPNATTVVF YLQKPVSYFL ALAATPPYFP VHPKYAPDKI DSDQTAGGAG PYCIKSFVRD QQIVLEANPY YYGPKPQVGR VVIRFYKDAT TLRLALERGE VDIAWRTLNP PDVEALRASG KFNIVEIPGS FIRYIVLNLN MPELKDVRVR QALAAAVCRR DIVNVVYRGT VTPLYTLIPE GMWSSYPVFK EKYGDCNITL AKTLLQQAGY SESKKLNIEL WYTPTHYGDT EKDLAAMLKQ QWEATGMIAV TVKSAEWATY VQQLRSGALM VSLLGWYPDY IDPDDYTTPF LKTGANKWLG NGYSNPEMDQ ILDKASVEIS QTAREQLYLQ AQRILAQDVP IIPLIQGKLY MATRPGIQVV ADPTMIFRYW TIKVG
|
| |