Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1921 |
Symbol | |
ID | 4909301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 1788428 |
End bp | 1790053 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640125672 |
Product | cytochrome d1, heme region |
Protein accession | YP_001056803 |
Protein GI | 126460525 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCAGT CCCAGGAGAA GGGTGCTGGG AGACGCGACG TGTTAAAGGC CTTTGCCGCC GCTGGCCTGG GCTTCGCTGT TGGTAGCTGG GCCCTTGCCT TGAGCCGCGG CGCGCCTGTG CAGAAGGTGG TGTACGAGAA GGCTGAGGAG GTGAGGGTGA AGCCCGTGGT TCAAGTACAG CAACAAACCG CGGCGCCGCA GGTGTCCGCG CCTCCCCACG ACGCTTTTCA GAAGAGGGGG CTCGCCTACT TGAGCGCCGA TTATATACAA AGCACTCTGA GGGTGTTGGT GCCGGAGGAC CAGCTCCCCA GCAAGCCCAC CGCCTACAAC ATCAACGACT TGGACTGGAT AGCCATACTG ATTGAGAGCA GATACCACGA GCCGGGCGTG GAGATGGTGG GGGCCTACAC CTTCCTCGAT ATGAAGAATT TTAAGGTGCT GAAGCGGGTG AAAAACGCGG GGGATAGAGT CCACGTGGTG AGGTTCGGCC GCGAGGAGTG GCCTGAGCAC AAGAGGAGGT TTGCCCTCGG GATGTCAAGG GACTGCTGGC TCTCCAAGAT CGACCTCTAC TCCATGCAGG TGGTTAGGCA GATTAAAATC GGCGTTGACT GCAGAAGCGC GGCGTACGAC AAGGACGGGA AGTACGTAAT AGCTGGCTCC AAGGACCCGG GCCACGTGGT AATCCTTGAC GCGGAGACCT TCAAGGTGTT GAAGGTTATC CCATTCCTTG GCGTGAGCAA GTTCTTCCCC ACGCCCATGA TGGGCCGGCA GGGCGCCATA TTGACAACCG ACTTGGGCTA CTGGCTAGTC AACGTAAAAG ACGCCGAGAT GGTGCTTGTG ATAGACTACA GAGACCCCTC CTTCCCCATT GTACACGTCT TCACGAGCTA TGACACTAAC AAGAAGGGGA GAAGCGTCAA AGTCCAAATC GGCGACAAGA CCTACGAGAC CACAGGCATT GGCAAAAGCC CCCACGAGCT CAACAAGTTG GACAAAGTGG GGAGGTATGT GGCTGTCACA GGGCAGGAGA GTAGCACAAT TACAATACTC GACGTAGAGA ACTTCGAGGT TGTCAACGTT ATCCCATGTG GAAAGAAGCC TCACCCAGGC CCCGGCACCC TAGTCCCCAA CAAGTACTTC CTAACCAACG CCATAGCCGA GGGCAAAATC ACAGTGATAA ACCTACAAAC GATGGACCTA GAGAAGTACA TAACTTACCC GCAGGAGTTC CCCGCCGACA CAGGCGGCGG CCTCTACTCC ACGCCGCCGC TCCCTGATGG CACAATACCC AAGGGCTTGG CGTGGTTTGA CACCACGTTT AATATAAACA AGGGCATATT CGCCGTAGAT ATATACATGC TTGACGTGGC GACGCGGCCG CCGAAGCCCG CGATATTTGC CGCAAACAAG CCGGGCAAGT GGGCCATGCA CCCCGGCTAT ACCCCAGACG GGAAGTATGT AATAAGCGCG CTGGAGCGCA CCGACACGGT GTACAAGGTA GACGCCGAGA CTGGGGAAAT TGTCGGCACC ATAAACCTGA AAAACGTAGA GCCAGTCCAG CTACTAGAGG AGCCAAGCCC AACGGGCATA TTCCCAGCTT GGCGCATAAA GGCGCCCTGG TTCTAA
|
Protein sequence | MSQSQEKGAG RRDVLKAFAA AGLGFAVGSW ALALSRGAPV QKVVYEKAEE VRVKPVVQVQ QQTAAPQVSA PPHDAFQKRG LAYLSADYIQ STLRVLVPED QLPSKPTAYN INDLDWIAIL IESRYHEPGV EMVGAYTFLD MKNFKVLKRV KNAGDRVHVV RFGREEWPEH KRRFALGMSR DCWLSKIDLY SMQVVRQIKI GVDCRSAAYD KDGKYVIAGS KDPGHVVILD AETFKVLKVI PFLGVSKFFP TPMMGRQGAI LTTDLGYWLV NVKDAEMVLV IDYRDPSFPI VHVFTSYDTN KKGRSVKVQI GDKTYETTGI GKSPHELNKL DKVGRYVAVT GQESSTITIL DVENFEVVNV IPCGKKPHPG PGTLVPNKYF LTNAIAEGKI TVINLQTMDL EKYITYPQEF PADTGGGLYS TPPLPDGTIP KGLAWFDTTF NINKGIFAVD IYMLDVATRP PKPAIFAANK PGKWAMHPGY TPDGKYVISA LERTDTVYKV DAETGEIVGT INLKNVEPVQ LLEEPSPTGI FPAWRIKAPW F
|
| |