Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48114 |
Symbol | |
ID | 7203471 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 241834 |
End bp | 244829 |
Gene Length | 2996 bp |
Protein Length | 213 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182496 |
Protein GI | 219124408 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGAAA TCGATATTAA AAAGATGAAA GGTAGGTACC GGCATTGCTT TTTCATTTGC GAGAATCGGA TAGTTTTGGT CGAACCCACA TGTACGCATT TGCGGATTGC AGTCCTCGAT TTTTCAACCT GTGACACATA TTGACTCACA AAACAGCACT GTACGTTGCA CAGTTGCGGA ATTAAGGACA GAGTTGGAAA GACGCGGTCT TCCCGTGTCG GGGAACAAAC CAGATCTTAT CAATCGTTTA CAGGCGAGGT TGGACGAGGA GGAATTCGGT TTGATTGATG CGCCCGCTAC CACCATCAAC ACTACCATGA CGAATTTAAA CGAGACGGGA GAAGCAACGG GCGACAACCC CAGCGATTCA GCGACCGAGC GAGAAAAGAA GGGGCTTGTG CCAACAAAAA CCGAAGAAAT GGGGGAAGAA TCCAAAACAG ACAAATTCAT CAGCGAGGCT CACGATGCAA ACAACGTGAA AGTGTCGGCG CAGCCCACTT TTGACGAAAT TAAGCGTCGC CGCGCGAATC GTTTCGATAT ACCGGTCGTC GAAACGAAAG TTCAGTCGCC GCAGAAGCGT GGCCGTGACA AAAAGAAAAG TGCAAATCTA CCCATTCTAC AAAAGGGCAG CAAGCGCGCC AGGCAGCAAG CAAGTGAAAA AGATGATGTA CTGTTGCCGA AAGAAGAAAT CGAAAAGCGA CTGAAACGGG CAGAAAAGTA TGGGACTGGC AACGAAGAAA ACATTCTTAA ATTGAAGGCT ATGCTTCGCA AGTATAGATT TTGACTGGAG CTAGGGAACA CCTTCAGATT GTACAACCTT GGCTTCGGTC TCGAAATTAA GTTGTAGAAT CTACCCTGAT CAGGAAATGG TAAATAAAAT ATCTTCGAAG TTTCACAGTT TCAAAGTGTC TTTGCTCCGA TACCAGCCAG GACTTGTCAT TCAACATTTG AGTCAAAGCC ATTTCCTTTT TGGAAGTCTC AAAAATAGAC CGAGGGTACT CTGTATCGAG ATAAAACCAG AGGATTCGAT CCGTTGAAGC TAAATCTGGA AGTTTTTTAG GTGAAAGCAG GCGAAAAGGA AAAAAAGCAT AGCAGGATGC TTCTAAGCCT CGGATATCTT TCCAGTGGCT ATAAGAGACT GAGACTCTCA CTTCGGAGTC TGGGCCCGTC GGTTTACCGA CTTCTTAAAC TTTAAATCAT GCTCCGACTG GCATTTGTTT CTCATTGTAG GTCTATGATC CAAGCTTAAG GCAGCAACGG ATAGGGCTTC AAGAAGACCT TTTTCAGATT GGTACTTTTT GCGGAGAAGT GGCCGAGAGG AACGTCCGTC ACATCTTGAC GGAGTCTTAG ACCGATGCTG ATCATGCGTG CGTACGAACG CACGCTCCGC TTTACTTTTA CCTGGCTCCC TACACACCAC TCGGGTTTCA CACAAGTTTT GTTCTCCTGC GTGCTGTTCA AGTGGCGCAT TGGGAAGCCC GGCGCAAATT TTGTGTGGAG CCTTGCCGTG CGCCTGGATG TCAGGAGGCT TGCAAATAGA TTCATCTGCC GCATAAGAAG AACACCCTTT CAACACGATC TCGTCGTGCT TTTTCGGCGA ATAGGAAAGA TGTGACTTTG ATTTCTTCGA CGTAGGATGA TGAGTCTCGT TTCTATCAAA TGTGGCGTTG CTTCTGACTC CAAGGTTAGT CACCTGAACC ATTCTTGCAT GTCGGCTGTG GCTATCCGGT TGATGACGAG AAGCAGCGGT TGCGCAGACC ATTGACTGCC TAGTGGAAAG GGACCGCTGT TTCATCGACG TCACAGCCAA TTTCGAAGGA CGACGAGCGC GAATGGGGTT GTCTGGGACT GCAAGTATTT CCGGTTTGGA ATATTGCCCA CTCTTGTTGC TCCGTCGAGG GCGCGATGTC AAATCTTTAG ATTTCGTTGA CGAAGACGTC TCGCCCACGA GAAATGAGCT TGTTTGGGTA CCAGCCGTCG AAATAATTTC ACTCACAAAG CGACGCTCGG ACTTCTGAGC ACGTACTTTT CTCAGTTTTG CTTGCTTGAT TTCAACTGAG CTCTCCGATT TGTGTATTTT TTTCAATTGC GGACGCTTTA TTACCTTGGT AGCGGTCGAA GATCTGCAGG GTAAAGATGG AACTTGATCC CGGATTTCTG GGGCTAAAAG ATCATTCGAC GAGGATGCTC GAATATATTT TTTGAGCTGA CCGCTGTTTT GAGCTTTCGC CGCGTGTTGC GATCTTCTTA TTTTCCCTTC TGGATTCAAC GATTTGACTT CTGTTTTAGT GAGAAAAAGA GAAGTCCATA GCGATCGTGA GGAGATAGAT CCATTCGGAG CAGACAGAAT AGACAATAGT GGCTGAGAAA GAGAACTTTG AGTAGGGCAG AGCACATGAC TCGAGTTGAA TGAGTCATCG CGAACGGTGG AGGAAGATTC GAATTTCTTG ACATCGAGAT TTGGATACAA GTTCGATGTA TGCGATGAGT CAATGGCAAT TAGTTTTCGC AACACGGTAC GGCACGGTGG ATCCGCATGC TGTATCCCGT TAAAGTCCAT CTCGGGAGGC TCTGAATCAA TGGAATTCGC TTCATCAAAC GTAATAGAGG AGTCCGATTG TGCAATGATA CTCGGTTTAG TATTTTGACG TCTTTCCTTG AAAGTGTGTC CAGATTCGGC GATATCGTAC TCATCAAATC CGAAAGGTTT TTCTTCGTTT CTCTCACTGC TAACAGTAAG GCTATCATTC ATGTACCAAG AGGGTTCGCC CATACTCACG GCTACCACCA CATCCATCTT ATTTTGCTGT TCACTATGAA TTCCGCGGAA ATTTTCTCGT TCTTCAAATT ATAGGGAGCC AAAAATTGGA TTCACTTTGG ATGCAAGATT CTACGAATTC AGACCCAAAG AGAGACAGGA TGATCAATAT TTACACGTAC AGTCCTAGTA TGAATCGACA TATTTCCTAC GATGAGACGA GCCAAAGATG CGCATT
|
Protein sequence | MGEIDIKKMK VAELRTELER RGLPVSGNKP DLINRLQARL DEEEFGLIDA PATTINTTMT NLNETGEATG DNPSDSATER EKKGLVPTKT EEMGEESKTD KFISEAHDAN NVKVSAQPTF DEIKRRRANR FDIPVVETKV QSPQKRGRDK KKSANLPILQ KGSKRARQQA SEKDDVLLPK EEIEKRLKRA EKYGTGNEEN ILKLKAMLRK YRF
|
| |