Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpen_1227 |
Symbol | rbcL |
ID | 4601724 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermofilum pendens Hrk 5 |
Kingdom | Archaea |
Replicon accession | NC_008698 |
Strand | + |
Start bp | 1164846 |
End bp | 1166177 |
Gene Length | 1332 bp |
Protein Length | 443 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 639774003 |
Product | ribulose bisophosphate carboxylase |
Protein accession | YP_920628 |
Protein GI | 119720133 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1850] Ribulose 1,5-bisphosphate carboxylase, large subunit |
TIGRFAM ID | [TIGR03326] ribulose bisphosphate carboxylase, type III |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGAAG AGTTTGAACC TTACGGCGAA TTTGTGGTCA AGAGCTACCT GCCGGATCCC GATAAAGACG TCATTGTCAC ATTTAGGGTT ACTCCGAGCG AGGGCTTCAC AATAGAGGAC GCAGCGGGAG GCGTAGCCGC GGAGAGCAGC GTCGGCACGT GGACTACCCT GTACCAGTGG TACGATAAGA GCAGGATCGA CAGGCTCAAG GGGAAGGCTT ACTACATGGA GAGCCTGGGC GACGGGTCAT ACATCCTCCG GGTCGCGTAC CCCGTCGAGC TGTTCGAAGA GGGGAACATG CCGGCCTTCC TGGCGTCCGT TGCCGGCAAC ATTTTCGGCA TGAGGAGGGT CCGCTCCCTG AGGGTTGAGG ACATATACTT GCCCGAAGCC TTCCTGAAGC ACTTCAAGGG CCCCTCGCAG GGCGTCGAGG GTGTACGCGG GAAGCTGAAG ATCTGGGGGA GGCCCATAAT AGGCACCGTG CCGAAGCCGA AGGTCGGGTA CTCCCCCGAG GAGGTCGAGA AGCTGGCCTA CGAGATACTG GTCGGGGGCA TGGACTTCGT GAAGGACGAC GAGAACCTGG CGGGCCCGAG CTACTGCAGG TTCGAGGAGA GGGCTAAGGC GATAATGAAG GCGATAGACA GGGCCGAAAA GGAGACGGGC GAGAGGAAGG CCTGGCTCGC CAACATAACG GCGGACGTCA GGGAGATGGA GCGCAGGCTT AAGCTTGTAG CGGAGCTCGG CAACACGCAC GTCATGGTCG ACGTGGTGAT AGCGGGCTGG TCCTCCCTGA CGTACGTAAG GGATCTAGCC GCGGACTACA AGCTGGCGAT ACACGGGCAC AGGGCTTTCC ACGCCGCCTT CACCCGGAAC CCCTACCACG GTGTATCGAT GTTCACCCTC GCGAAGCTGT ACAGGATAAT CGGCGTCGAC CAGCTACACG TAGGGACACC GGAGGTCGGC AAGCTGGAGG CGAAAGCCGT AGACGTGATC AGGATGGCGC GCCTACTCAG GGAGCAGACG TACAAGCCAG ACATTGAGGA CGGGCTCCAC ATGCAGCAAC CATTCCCCGG GATAAAGCCG GCTTTCCCCG TCTCCAGCGG AGGCCTCCAC CCGGGCACGC TACCCGCTGT CATCAAGGCT ATGGGCGTAG ACACCGTCAT CCAGGTTGGA GGGGGCGTTG TAGGGCACCC CGACGGACCG AGGGCGGGCG CCGCCGCGGC TAGGCAGGCT GTAGAAGCGT ACCTCGAGGG AGTCCCGCTA CAGGAGTACG CGAAGACGCA CAGAGAGCTT GCAAGAGCCC TCGAGAAATG GGGGCAAGTG ATACCCGTCT AG
|
Protein sequence | MPEEFEPYGE FVVKSYLPDP DKDVIVTFRV TPSEGFTIED AAGGVAAESS VGTWTTLYQW YDKSRIDRLK GKAYYMESLG DGSYILRVAY PVELFEEGNM PAFLASVAGN IFGMRRVRSL RVEDIYLPEA FLKHFKGPSQ GVEGVRGKLK IWGRPIIGTV PKPKVGYSPE EVEKLAYEIL VGGMDFVKDD ENLAGPSYCR FEERAKAIMK AIDRAEKETG ERKAWLANIT ADVREMERRL KLVAELGNTH VMVDVVIAGW SSLTYVRDLA ADYKLAIHGH RAFHAAFTRN PYHGVSMFTL AKLYRIIGVD QLHVGTPEVG KLEAKAVDVI RMARLLREQT YKPDIEDGLH MQQPFPGIKP AFPVSSGGLH PGTLPAVIKA MGVDTVIQVG GGVVGHPDGP RAGAAAARQA VEAYLEGVPL QEYAKTHREL ARALEKWGQV IPV
|
| |