Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_14457 |
Symbol | RPA1 |
ID | 7203231 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 631178 |
End bp | 633225 |
Gene Length | 2048 bp |
Protein Length | 606 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | replication protein a large 70 kD subunit |
Protein accession | XP_002182265 |
Protein GI | 219123923 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGCA ACGGCGGAGG ACCTTCCTTC AGTCCCATCC TCCAAGTCCT CGATACCAAG CAAGTTCCGG GACCCCAAGG AAGTGTGCGC TATCGGGTAC GTCGAGAGTG TGTGGGGTGG TTGAGTTGAT GTAATGGTGT TCCTAGTTGC GGGTATCACC AACGGGGTCG CCTCGAGATC TCTGCTTGTA CTCACAATAC CACAACACAC ACTCTCACTT GCAACACACT CACTCACTTA CCATATATTT TCTACAGATT GTCCTGTCCG ACGGCAAGCA CTACATTCAA GGCATGCTCG CGACCCAACT GAACCACATG ATTGCTACTA ATCTCATTGG TGCCAACACG ATAGTGCAAG TCGAGCAATT CATGTCCAAT CGCGTCAAGG ACCGCACCGT CATTATTCTC CTCAACGTGC ACGCCCTCCG CAACGAACCC TCCCGCATTG GGGACCCGCA GGACATTGAA AAGGCGCAAA TTGCAACCCC GGGAGTCGCC CCGTCCCTCA CGAGCGCACC AGCAGCAGCA GCAGTACCGG CACAACCTCT CTACAACAAG ACCAATTACG CCACCGGAAG CTCCGTCCCA CCCGTCAAGA CCGGGAGTCC CAATCCCTAC GGAGGTCCTT CCCGAAACCC ACACCACGCT TCCCCCGTGC GGTCCTCCCA CGCACCGATT GTACGCGATA CCGTGAACGG AACACCCATC ACCAACATTA GCAATCTCAA CATGTACGCC AACAAGTGGA CCATTCAAGC CCGCGTTACT TCCAAATCAG ATATTCGCAC CTGGTCCAAC GCCAAAGGCG AAGGTAGCCT CTTTTCCGTC GAACTACTCG ATCAAACGCA GGATGTCCGG GCTACCTTCT TCAAGGAAGC CGTCGACAAG TTCTACAGCT TTTTGCAGAT CGGATCCGTC TACACATTTT CCGGTGGACG TCTCAAGGTC GCCAACGCAC AGTACAATAC CTGCCAGAGC AATTTCGAAA TCACTTTTGA TCAGAACTCC GAAATACACC TCGCCAACGA CGACGGAAAT ATACAACGGC AGCACTACGA ATTCATCGAC AGCATCGCCG GTCTGGAGCG TACCGAACCC AACACCATGG TGGATCTGCT CGCCGTCGTA CAGGCGGTCG GAGAAGTCGC CACGATTGTC AGCAAAAAGT CGGGCCAGGA GCTCACCAAG TGCGATTTGA CCCTCATCGA CACCTCCGCC ACGCAAATCA CCCTCACTCT CTGGGGAGAC AAGGCTGCCT CGGCACTCAC CGACTACAAC CAACAACCCG TCGTCGCGAT CCGGCGCGCA CGCGTCAGCG ACTACGGCGG ACGTAGTCTC TCCCTCTCCG GATCCATCGA AACCAATCCG GACATTCCCC AAACCGCGCC CTTGCAAACG TGGTGGCGGA CCCAAACCGC CAACGGCGGT GTCACCGGGA AATCTCTATC CGCCACCCGC GGCGGAGCCG GCGTGATGGA AAGTTTGGAA CAACGACAAA CCATTGCCGA CATCAAGAAT AACAATCTGG GTTACGCCGG CGACAAGCCC GATTGGCTGA CGTTCAAGGC CACCGTTTCC TTTTTGAAAA AAGACAAGGA GGGGGGTTCG TGGTACCCAG CCTGTGCCAA CGCCGGGGAA CCCTGCAAAA ACCGGTACAA GGTCACGCAA ACGACCGACG GCAACTGGTA CTGCGACAAG TGTCAAGGCT CGTTTCCCAC CTGCGTCCGG CGCTGGATCT TTTCCGGGGT CGTCGAAGAC GACACCAGCT CTACCTGGGT CAGTTTCTTC AACGAACAGG CCGAAACACT GCTCGCCGGC GCCACGGCCG ACCAAGTCTA CGCCGAGACC TATCAGGATC AGCAAGATCA GGACGCCTAC GACTCGTACT TTGCCAAGGC CAACCACACG GAATGGATCT TCAAATGCAA GGTCAAGAAC GAAATGGTCA ACGAAGAATC CCGCGTCAAA ACCAGCGTCG TGGCCATGCA ACCCGTCGAC TTTGCCAAGG AATCCCGCGA TCTCCTGTCG GCCCTGGCCA AGTTTTAA
|
Protein sequence | MTGNGGGPSF SPILQVLDTK QVPGPQGSVR YRIVLSDGKH YIQGMLATQL NHMIATNLIG ANTIVQVEQF MSNRVKDRTV IILLNVHALR NEPSRIGDPQ DIEKAQIATP GTNYATGSSV PPVKTGSPNP YGGPSRNPHH ASPVRSSHAP IVRDTVNGTP ITNISNLNMY ANKWTIQARV TSKSDIRTWS NAKGEGSLFS VELLDQTQDV RATFFKEAVD KFYSFLQIGS VYTFSGGRLK VANAQYNTCQ SNFEITFDQN SEIHLANDDG NIQRQHYEFI DSIAGLERTE PNTMVDLLAV VQAVGEVATI VSKKSGQELT KCDLTLIDTS ATQITLTLWG DKAASALTDY NQQPVVAIRR ARVSDYGGRS LSLSGSIETN PDIPQTAPLQ TWWRTQTANG GVTGKSLSAT RGGAGVMESL EQRQTIADIK NNNLGYAGDK PDWLTFKATV SFLKKDKEGG SWYPACANAG EPCKNRYKVT QTTDGNWYCD KCQGSFPTCV RRWIFSGVVE DDTSSTWVSF FNEQAETLLA GATADQVYAE TYQDQQDQDA YDSYFAKANH TEWIFKCKVK NEMVNEESRV KTSVVAMQPV DFAKESRDLL SALAKF
|
| |