Gene Pcal_0953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPcal_0953 
Symbol 
ID4908124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum calidifontis JCM 11548 
KingdomArchaea 
Replicon accessionNC_009073 
Strand
Start bp900225 
End bp901295 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content54% 
IMG OID640124701 
Productvon Willebrand factor, type A 
Protein accessionYP_001055844 
Protein GI126459566 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACT TAGTAGAGCT CTTAGCGGCT GTATACTCCT GCCTCGGGGG CCTCTCCATA 
CGCAGTCTTA TATACGGCAT TGAGGACGTC TATGTACGGG CAGAGCTCGG CGACGTGGAC
TGGGAAAAAG TTCTTGAAAT ACTTGCACAG AATTTGGCGG GCACGCTTAA GATGAGCCCC
TCTGCGGCGA AGGAGGTTAT AAGAGAAGCT ATAACGTGTC GCCCGAAGTT GCGCGAAGGC
GCGCCCACGC TAGCCATAGG CTCGGTGGGC GACGAGAAGG CGCCTACGCT TGCCCACTTG
GTTAATAGAC ACGTCCCAGT GGACGCCACG CCCAGGGTAA AGCTTGAGGT AGTTAGGAGG
CTGGGCCTTC CTAGGGACAG AATTTTGCGC TCGTATAGCA GAGTCGTGGG TAGAGGCGAG
GGGTGGCACG TGCGGGGCGC CGTGAAGTCT CTGCGAGGCT ATATACCCGG CACGCCTTTC
GCCGATGTAG ATCTGATAAG GACGGCCACG GCTTTTAGAA GAAAGCTCGT CATGAATATG
CCGATTTCCG ACTTCGACAT ATTCGTAAGG GAGTATTCAA GGACGGCGGA TAAGCCGGTG
TACATAGCGC TTGACGTCTC GGGGAGCATG AAGGAGTACA TGTGGGGCGA CGTGAAGCTT
AGAGTCGCCA AGAACGCCGT GGCGAGGTAC TTGCGTCAGA TGGCAAGTCT CAGAGGCCGC
GTCTCGTTGT TGCTCTTCAA CGTCGACGCC GACTTTATGT GGACTCCCTA CGAGGTTCAT
AAGTATCTTA GGGAGATGCT CGAAATTCTC GAGTACGTAT ACGCCGGGGG CGGCACCGAG
CTTGCGTCTG CCCTAGAGGT GCTCTACAGC TACGGCGTTA GAGAGGCGGT GTTGATAACT
GATGGGAGAA CCGCCGACGT TGAAAAAACT TGGAGTCTCG TGAAAAAGTT CAAGAGACTC
CACGCCGTGG CGGTTGAGAA AAGCGACTTG TTGAAACAGA TTGCGAAAGC CACAGGCGGG
AAATACCAAG AGCTTAGCCC CAAGTTAGAC ATGTCGGTAA TACATGACTA G
 
Protein sequence
MNDLVELLAA VYSCLGGLSI RSLIYGIEDV YVRAELGDVD WEKVLEILAQ NLAGTLKMSP 
SAAKEVIREA ITCRPKLREG APTLAIGSVG DEKAPTLAHL VNRHVPVDAT PRVKLEVVRR
LGLPRDRILR SYSRVVGRGE GWHVRGAVKS LRGYIPGTPF ADVDLIRTAT AFRRKLVMNM
PISDFDIFVR EYSRTADKPV YIALDVSGSM KEYMWGDVKL RVAKNAVARY LRQMASLRGR
VSLLLFNVDA DFMWTPYEVH KYLREMLEIL EYVYAGGGTE LASALEVLYS YGVREAVLIT
DGRTADVEKT WSLVKKFKRL HAVAVEKSDL LKQIAKATGG KYQELSPKLD MSVIHD