Gene Pcal_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0422 
Symbol 
ID4908502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp404242 
End bp405819 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content56% 
IMG OID640124174 
Productextracellular solute-binding protein 
Protein accessionYP_001055322 
Protein GI126459044 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTAG CTCAACAACC AGCGCAGCCC ACAACCACTG CCGCCCCCAC CACGGCACGA 
CCTACCACGG CTCCCACCCC CCAGCCCACT ACCCCCGCCA CGACTACTAC GCCGCAGGCA
ACGCCTTTGA CCATCACAAT AGGCGTCACA GACAAGGTGA CGGACCTCGA CCCGGCAAAC
GCCTACGACT TCTTCACTTG GGAAGTCTTG TACAACACCA TGGCGGGCCT CGTCAGGTAT
AAGCCAGGCA CTACGGAGAT AGAGCCAGAC CTCGCCGAGA GTTGGACTGT GCTCGAGGGG
GGCAAGGTGT GGGTCTTTAA GCTAAGGCCG AATCTCAAGT TCTGCGACGG GACGCCGCTG
ACCGCGGCCG ACGTCAAGAG GTCTATTGAG CGCGTGATGA AGATAAACGG CGACCCCGCG
TGGCTTGTCA CAGACTTCGT AGAAAAGGTG GAGGCCCCTA ACGCTACAAC GGTGGTATTC
TACCTACAAA AGCCAGTCTC CTACTTCCTA GCCCTAGCGG CTACGCCGCC CTACTTCCCG
GTGCATCCCA AATACGCTCC AGACAAGATT GACTCAGATC AAACGGCGGG CGGCGCGGGG
CCTTACTGTA TTAAGAGCTT TGTGAGAGAC CAGCAGATAG TCCTTGAGGC AAACCCCTAC
TACTACGGCC CCAAGCCCCA GGTTGGCCGG GTAGTGATTC GGTTCTATAA AGATGCCACC
ACTCTAAGAC TTGCCCTTGA GAGAGGGGAG GTGGACATTG CCTGGAGGAC TTTAAATCCG
CCCGACGTAG AGGCGCTGAG GGCCTCGGGC AAGTTCAACA TAGTGGAAAT ACCGGGCTCC
TTCATTAGGT ACATAGTGCT CAACCTCAAT ATGCCAGAGT TAAAAGACGT CAGAGTGAGA
CAAGCCCTCG CCGCGGCCGT GTGCAGAAGG GACATAGTCA ACGTGGTTTA CCGCGGCACA
GTTACGCCGC TGTACACGTT GATACCAGAG GGCATGTGGA GCTCTTACCC AGTCTTCAAA
GAGAAGTACG GCGATTGCAA CATCACGCTT GCAAAGACGT TGCTACAACA GGCTGGTTAC
AGCGAGTCCA AGAAGTTGAA CATTGAGCTG TGGTACACGC CTACTCACTA CGGCGACACT
GAGAAAGACC TCGCGGCGAT GTTGAAGCAA CAGTGGGAGG CCACGGGGAT GATCGCTGTC
ACAGTTAAAT CTGCCGAGTG GGCCACATAT GTGCAACAGC TCAGAAGCGG CGCATTGATG
GTCTCACTGC TCGGCTGGTA CCCCGACTAC ATAGACCCCG ACGACTACAC AACGCCGTTT
TTAAAGACTG GCGCAAATAA GTGGCTTGGA AACGGGTACA GCAACCCAGA GATGGACCAG
ATCTTAGACA AGGCGTCGGT GGAAATATCT CAGACTGCCA GAGAACAGCT TTACCTACAG
GCACAGCGCA TACTGGCCCA AGACGTGCCC ATAATACCGC TTATACAAGG CAAGTTGTAC
ATGGCGACGA GGCCGGGCAT ACAGGTAGTG GCAGACCCCA CAATGATATT CAGGTACTGG
ACCATCAAAG TCGGGTAG
 
Protein sequence
MFLAQQPAQP TTTAAPTTAR PTTAPTPQPT TPATTTTPQA TPLTITIGVT DKVTDLDPAN 
AYDFFTWEVL YNTMAGLVRY KPGTTEIEPD LAESWTVLEG GKVWVFKLRP NLKFCDGTPL
TAADVKRSIE RVMKINGDPA WLVTDFVEKV EAPNATTVVF YLQKPVSYFL ALAATPPYFP
VHPKYAPDKI DSDQTAGGAG PYCIKSFVRD QQIVLEANPY YYGPKPQVGR VVIRFYKDAT
TLRLALERGE VDIAWRTLNP PDVEALRASG KFNIVEIPGS FIRYIVLNLN MPELKDVRVR
QALAAAVCRR DIVNVVYRGT VTPLYTLIPE GMWSSYPVFK EKYGDCNITL AKTLLQQAGY
SESKKLNIEL WYTPTHYGDT EKDLAAMLKQ QWEATGMIAV TVKSAEWATY VQQLRSGALM
VSLLGWYPDY IDPDDYTTPF LKTGANKWLG NGYSNPEMDQ ILDKASVEIS QTAREQLYLQ
AQRILAQDVP IIPLIQGKLY MATRPGIQVV ADPTMIFRYW TIKVG