Gene Pcal_0033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0033 
Symbol 
ID4909466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp29959 
End bp31899 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content54% 
IMG OID640123786 
Producthypothetical protein 
Protein accessionYP_001054939 
Protein GI126458661 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACACCA AACCCTGGCT ACTACTAGCG TTAGCAGCCG CGTTAGTTGT AATATTCTAC 
TTGATGAATA CATCACAGAC TCCCACGGTG CCTACGCCGA CACCGACAAC TACTGCAACG
GTGACCCCAA GCCCGACCAC CTCCACTACG CTGACGGCCA CGCCCACTGA GACGCCGAAG
CCCACAGCCA CTGCTACTAC GACGACGGCA CCCAAGCCCA GCGCCGCGCC CGTGTATATC
CCACGCCTCG AGGTAGAGCT GTCGGCGCCC CAGGCGGTGA ACACCACGAA GTTGCCCACG
GCGGTGAATT ACACCGTGAC GTTGAGAAAC GTTGGAAACG GCACGGCGGT GGTGTACGTC
TTTGGAAAAT ACGTGGAAGT CAAGCCGGGC GAGGTGGTTA AGTTAAACGC CACAGCCACG
GCACAGGCGG CGGGCATACT CAAAATAGCA GTCGAAGTAA ACGGCACAGA GTACGCAAGG
GAGGTCTACA TCTACTACTA CACACCGATC TTGGCGGCAG AACCCGCCTA CGTCGAAGTG
AGAAAGTTAC CCACAAACGT CACCCTAAGC GTGGTGGTAA AAAACGTGGG CAACTGGACC
GGGAGGCTCG GCCCCATAGA GATACCGCCC GGAGGCACAG CCGCAATAAA TATAACAGCG
GCGGTCAACG CCACAGGCAC CTACTCTCTG CAAATAGGCG GCGTAGAGGT GCCCATAACC
GTCGTGTACA AGGCGCCGAG CTTTGAAATA AAGACGGGCG GCCCCACGGA GACGGAGGCC
CTCCCCGGGG AGAAGTACCC CGCCTGGCTG TGGATAAAAA ACGTGGGCAA CGCCACAGCC
AAACTCTCCA TAGACGGCGA AGAAAGAGAG CTAGGGCCAG GCGACGCTGT CAATATCACA
AAGTGGATAC AAGTAGACAA GGTAGGCATC TACAAAGCTG TGTTTAAGGT GGAGGGCGAC
TTAAACACGA CGGCCGTGCA CCAGCTGTCC GCCAAGATAG TGGCCGTGAA AGTGGAGATG
GTGTTGTGGA AGCCCGAGCT CAGGAGGGGC TGGCCGCCGC CCAACGGGGA GGATAGAACG
TCACTGTTGC TTGAATCCAA GACTGCCGAA GTGCAGTGGG GGTACATAAT AACTTCAAAC
GCCAGCAAGC GAACTGTCGT AGTATATGTT GAGGACGTTC AAGGCCGCGA CTATTTCACA
ATACCGCCCA AAGGCTCCGT AGGCAGAAAT CTAACTGCCA CAGTCCAAGC GCCGGGGAGC
ATAGCCGTCT GGGTAATCGT GAACGGCACT AAGTACACAT ACGTCATAAG TACACAACTT
GTCCCGCCTA AGGTCACTAT AAGAGATGTC TCAAAAATAG AGTTTAGAGA CTCCAGAGAA
ATTCTTGGTC TAGGGATAAA GTGTAGCGGA ATACCGATAG TGGGTACAAT ACAGCGTACA
ATAGACATAG TGGAAGTTTC TGGAGTGTTG GCTTATACCA CAGACGGCAA GACCATAGAG
GGTACTGTAA AGATCAGAAG CGTCGACGTA TACACAGGTA GCTACAGAGG TATCATCACT
GGCACAAGCG GACGAGTAGA CATCGATGTG GACTTCATGG GAGGACACCA CGTAATTACT
ACAAAGTTCC GGACGTCTCC GTTTGAAATA ACTGAGGTCC TAATAGACGG TGTCCCCGGG
AAATGCGATA TACCAACTCA GCTGATACCT AGCATATTCC TCAGCGGAAA ACCCGCGGCT
GACAATGAAT TGGCGACGCA GTACGCCTTT AGGCTAGTGT CTGCTTTTAA GAAGGGGGAC
AGCGACGTGC CTCAGCGGGT CGAGTGGAAT GGAGAATACG TAGAGGTAGT AGACAAGGGA
GGCAACGTGC TGAGGGTCTA CTTTGGACAG GGAGAAGTGG TCATAGAGGG GCCCCTCTCG
GCCAGGCTGG TGATATCTTA G
 
Protein sequence
MDTKPWLLLA LAAALVVIFY LMNTSQTPTV PTPTPTTTAT VTPSPTTSTT LTATPTETPK 
PTATATTTTA PKPSAAPVYI PRLEVELSAP QAVNTTKLPT AVNYTVTLRN VGNGTAVVYV
FGKYVEVKPG EVVKLNATAT AQAAGILKIA VEVNGTEYAR EVYIYYYTPI LAAEPAYVEV
RKLPTNVTLS VVVKNVGNWT GRLGPIEIPP GGTAAINITA AVNATGTYSL QIGGVEVPIT
VVYKAPSFEI KTGGPTETEA LPGEKYPAWL WIKNVGNATA KLSIDGEERE LGPGDAVNIT
KWIQVDKVGI YKAVFKVEGD LNTTAVHQLS AKIVAVKVEM VLWKPELRRG WPPPNGEDRT
SLLLESKTAE VQWGYIITSN ASKRTVVVYV EDVQGRDYFT IPPKGSVGRN LTATVQAPGS
IAVWVIVNGT KYTYVISTQL VPPKVTIRDV SKIEFRDSRE ILGLGIKCSG IPIVGTIQRT
IDIVEVSGVL AYTTDGKTIE GTVKIRSVDV YTGSYRGIIT GTSGRVDIDV DFMGGHHVIT
TKFRTSPFEI TEVLIDGVPG KCDIPTQLIP SIFLSGKPAA DNELATQYAF RLVSAFKKGD
SDVPQRVEWN GEYVEVVDKG GNVLRVYFGQ GEVVIEGPLS ARLVIS