Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_1780 |
Symbol | |
ID | 4910233 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | - |
Start bp | 1650067 |
End bp | 1651917 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640125529 |
Product | hypothetical protein |
Protein accession | YP_001056663 |
Protein GI | 126460385 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000000813258 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | TTGGTTGTGA AGACTGTATT AACCGCGCTT GACTTATTGG CCTCGTCTGT GGAGATGTCG AGGCTTGTTG GAGCACATGT GGAAAACGTC TACAGGACGG CGGCGGGATT CCTCTTTAAG TTTGGCCAGG GTTTTGTGAC TATCACTAGG CATAGGGTCT CTTTGACTGG GCTAATTCCT GAGAAGACGC ATGAGGGGGC GGAGACTTTG AGGGGGCTTT TCCGCGGCGA GCGGCTTACG TGGGTTGGCC TCCCGCGGTT TGACCGCATT GTGGAGTTCC ACTTTGAGAC TGGGACGCTC GTAGTTGAAC TATTGGAACC TTTCAACGTT GTCGCGGTTA GGGAGGGCAG AGTTGTGTGG CTTCTCCACA GCTATAGGGG GAAGGACAGG GAGCTCAAGG CGGGGGCTCC CTACACCTAT CCGCCCGCTG TGTTTGTAGA CGTCTTGAGG GCCGGGGTGG AGGAGGTGGC AAAGGCGATA GATCCGTCAG ATGTGAGGCG TAGCTTGATT AGGCGAGTTG GGACTGGACC TGAGTTCGCC GATGAGCTCA TTGCTAGAGC CGGCGTTGAC CCCCACGCGC TAGCCGCCGC GTTGAAGAAA ATCGTGGACG ACGTGGCGGC GGGGAGGCTT GAGCCGTCTG TGTGCATAAG GGGCGGCGCC GCTGTGACCG TCCTCCCCAT TGTGCCTGTG GCCGCCAAGT GCGACGAGGT GAGGCGGTTT GACTCATTCT GGGTTGCCCT CGACTTCTAC TTTGGCCCAC TGGAGCTGGA GGCGGCTAAG GCGTCGGCTA CGCGGGAGCT AGAGCAGAGG CGGAGGCGGC TGGAGGCGAG CATACAGGAG TTGGAGAGGA AAATTCCTGA GTACAGAGGC GAGGCGGCGA GGCTCAAAGC TCTTGCCCAT AGGCTGTTAG TGTATAAGTA CGAGGTAGAG CAAGCCTTGG CTGGCTCCGA ATCAAGTATA CGTGTAGTAT ACGTAGACGG AACTAGAGTA AAGATAATTC TGCCAGAGGG CGACGAGGTG GAGATTAGGC GAGATGTACA GTTGGGGCGC CAAATCTCTG CCTTGTTTGA GAAGGCCAAG GAGTTGGAGG AGAAGGCGGC CAAGGCGCAG GCAGTTCTAG ACAAGATGAG GGGGGAGTTG GCTAAGCTGG TGGAGGAGCA GAGGAAGGCG GAGGAGAAGG TGAAGTCGTC TGTCAAGGCT GTGGTTGAGA GGGAGTGGTT TGAGAAGTAC CACTGGACTG TGACCACTGG GAAGAGGCCG GTGTTGGGCG GCCGCGATGC CTCTCAGAAC GAGTCCATTG TCCGGAAGTA CTTAAAGGAT CACTACCTGT TCTTCCACGC GGATATCCCC GGCGCCTCTG TGGTAATAGC GCCACCCATA GAGGACCCCC TTGAGGTGCA CCAAGTGGCC CAGTTCGCCG CGGCGTATAG CAGGGCGTGG AAGATAGGCA TTCACGCGAT CGACGTCTAC TACGCCAGGG GGGAGCAAGT GTCTAAGCAG CCGCCGGCCG GCCAGTACTT GGCGAGGGGG TCGTTCATGG TGTATGGAAA GAGGGAGTAT GTGAGAAACG TGAGACTGGA ACTCGCCGTG GGCTGTAGAC GCGACGGCGA GGCGGCGAGA GTGGTCGCCG CGCCGCCTAA GTCGGCTCCC TTACTCGCGG AGAAATACGT CGTGGTGACC CCTGGGAACG TTGAGAAGAG TAGGCTGGCG AAGGAGTTGG CGGAGAAGTG GGGCGGTTGC AACGTTGATG AGATAGTGGC GGCCTTGCCC GGCCCCTCGA GGGTGTCTGA GGTGGGCAAG GGCTCGCCGC TGTCTTGGGA GGAGATAAGG GAGATATTTA AGTCGTGGTA G
|
Protein sequence | MVVKTVLTAL DLLASSVEMS RLVGAHVENV YRTAAGFLFK FGQGFVTITR HRVSLTGLIP EKTHEGAETL RGLFRGERLT WVGLPRFDRI VEFHFETGTL VVELLEPFNV VAVREGRVVW LLHSYRGKDR ELKAGAPYTY PPAVFVDVLR AGVEEVAKAI DPSDVRRSLI RRVGTGPEFA DELIARAGVD PHALAAALKK IVDDVAAGRL EPSVCIRGGA AVTVLPIVPV AAKCDEVRRF DSFWVALDFY FGPLELEAAK ASATRELEQR RRRLEASIQE LERKIPEYRG EAARLKALAH RLLVYKYEVE QALAGSESSI RVVYVDGTRV KIILPEGDEV EIRRDVQLGR QISALFEKAK ELEEKAAKAQ AVLDKMRGEL AKLVEEQRKA EEKVKSSVKA VVEREWFEKY HWTVTTGKRP VLGGRDASQN ESIVRKYLKD HYLFFHADIP GASVVIAPPI EDPLEVHQVA QFAAAYSRAW KIGIHAIDVY YARGEQVSKQ PPAGQYLARG SFMVYGKREY VRNVRLELAV GCRRDGEAAR VVAAPPKSAP LLAEKYVVVT PGNVEKSRLA KELAEKWGGC NVDEIVAALP GPSRVSEVGK GSPLSWEEIR EIFKSW
|
| |