Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pcal_0033 |
Symbol | |
ID | 4909466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum calidifontis JCM 11548 |
Kingdom | Archaea |
Replicon accession | NC_009073 |
Strand | + |
Start bp | 29959 |
End bp | 31899 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640123786 |
Product | hypothetical protein |
Protein accession | YP_001054939 |
Protein GI | 126458661 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCA AACCCTGGCT ACTACTAGCG TTAGCAGCCG CGTTAGTTGT AATATTCTAC TTGATGAATA CATCACAGAC TCCCACGGTG CCTACGCCGA CACCGACAAC TACTGCAACG GTGACCCCAA GCCCGACCAC CTCCACTACG CTGACGGCCA CGCCCACTGA GACGCCGAAG CCCACAGCCA CTGCTACTAC GACGACGGCA CCCAAGCCCA GCGCCGCGCC CGTGTATATC CCACGCCTCG AGGTAGAGCT GTCGGCGCCC CAGGCGGTGA ACACCACGAA GTTGCCCACG GCGGTGAATT ACACCGTGAC GTTGAGAAAC GTTGGAAACG GCACGGCGGT GGTGTACGTC TTTGGAAAAT ACGTGGAAGT CAAGCCGGGC GAGGTGGTTA AGTTAAACGC CACAGCCACG GCACAGGCGG CGGGCATACT CAAAATAGCA GTCGAAGTAA ACGGCACAGA GTACGCAAGG GAGGTCTACA TCTACTACTA CACACCGATC TTGGCGGCAG AACCCGCCTA CGTCGAAGTG AGAAAGTTAC CCACAAACGT CACCCTAAGC GTGGTGGTAA AAAACGTGGG CAACTGGACC GGGAGGCTCG GCCCCATAGA GATACCGCCC GGAGGCACAG CCGCAATAAA TATAACAGCG GCGGTCAACG CCACAGGCAC CTACTCTCTG CAAATAGGCG GCGTAGAGGT GCCCATAACC GTCGTGTACA AGGCGCCGAG CTTTGAAATA AAGACGGGCG GCCCCACGGA GACGGAGGCC CTCCCCGGGG AGAAGTACCC CGCCTGGCTG TGGATAAAAA ACGTGGGCAA CGCCACAGCC AAACTCTCCA TAGACGGCGA AGAAAGAGAG CTAGGGCCAG GCGACGCTGT CAATATCACA AAGTGGATAC AAGTAGACAA GGTAGGCATC TACAAAGCTG TGTTTAAGGT GGAGGGCGAC TTAAACACGA CGGCCGTGCA CCAGCTGTCC GCCAAGATAG TGGCCGTGAA AGTGGAGATG GTGTTGTGGA AGCCCGAGCT CAGGAGGGGC TGGCCGCCGC CCAACGGGGA GGATAGAACG TCACTGTTGC TTGAATCCAA GACTGCCGAA GTGCAGTGGG GGTACATAAT AACTTCAAAC GCCAGCAAGC GAACTGTCGT AGTATATGTT GAGGACGTTC AAGGCCGCGA CTATTTCACA ATACCGCCCA AAGGCTCCGT AGGCAGAAAT CTAACTGCCA CAGTCCAAGC GCCGGGGAGC ATAGCCGTCT GGGTAATCGT GAACGGCACT AAGTACACAT ACGTCATAAG TACACAACTT GTCCCGCCTA AGGTCACTAT AAGAGATGTC TCAAAAATAG AGTTTAGAGA CTCCAGAGAA ATTCTTGGTC TAGGGATAAA GTGTAGCGGA ATACCGATAG TGGGTACAAT ACAGCGTACA ATAGACATAG TGGAAGTTTC TGGAGTGTTG GCTTATACCA CAGACGGCAA GACCATAGAG GGTACTGTAA AGATCAGAAG CGTCGACGTA TACACAGGTA GCTACAGAGG TATCATCACT GGCACAAGCG GACGAGTAGA CATCGATGTG GACTTCATGG GAGGACACCA CGTAATTACT ACAAAGTTCC GGACGTCTCC GTTTGAAATA ACTGAGGTCC TAATAGACGG TGTCCCCGGG AAATGCGATA TACCAACTCA GCTGATACCT AGCATATTCC TCAGCGGAAA ACCCGCGGCT GACAATGAAT TGGCGACGCA GTACGCCTTT AGGCTAGTGT CTGCTTTTAA GAAGGGGGAC AGCGACGTGC CTCAGCGGGT CGAGTGGAAT GGAGAATACG TAGAGGTAGT AGACAAGGGA GGCAACGTGC TGAGGGTCTA CTTTGGACAG GGAGAAGTGG TCATAGAGGG GCCCCTCTCG GCCAGGCTGG TGATATCTTA G
|
Protein sequence | MDTKPWLLLA LAAALVVIFY LMNTSQTPTV PTPTPTTTAT VTPSPTTSTT LTATPTETPK PTATATTTTA PKPSAAPVYI PRLEVELSAP QAVNTTKLPT AVNYTVTLRN VGNGTAVVYV FGKYVEVKPG EVVKLNATAT AQAAGILKIA VEVNGTEYAR EVYIYYYTPI LAAEPAYVEV RKLPTNVTLS VVVKNVGNWT GRLGPIEIPP GGTAAINITA AVNATGTYSL QIGGVEVPIT VVYKAPSFEI KTGGPTETEA LPGEKYPAWL WIKNVGNATA KLSIDGEERE LGPGDAVNIT KWIQVDKVGI YKAVFKVEGD LNTTAVHQLS AKIVAVKVEM VLWKPELRRG WPPPNGEDRT SLLLESKTAE VQWGYIITSN ASKRTVVVYV EDVQGRDYFT IPPKGSVGRN LTATVQAPGS IAVWVIVNGT KYTYVISTQL VPPKVTIRDV SKIEFRDSRE ILGLGIKCSG IPIVGTIQRT IDIVEVSGVL AYTTDGKTIE GTVKIRSVDV YTGSYRGIIT GTSGRVDIDV DFMGGHHVIT TKFRTSPFEI TEVLIDGVPG KCDIPTQLIP SIFLSGKPAA DNELATQYAF RLVSAFKKGD SDVPQRVEWN GEYVEVVDKG GNVLRVYFGQ GEVVIEGPLS ARLVIS
|
| |