Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48357 |
Symbol | |
ID | 7203568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | + |
Start bp | 220664 |
End bp | 222148 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182795 |
Protein GI | 219125036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTCTT CGACTCTTTC TAGAATTGAT TTGGCGAAAA GTAGTAACTA CCACCACGAA ATGCGTGATC CTCTAAAACC AAGGCGTTTG ATATACCCAG AGGGAACTTC TGCTGCTTCT ACGCCGAGTC GCACGAGGAG ACTAATACGG TACTGTAGAG CGATCTCAGT AACGCTCGGC ATGTTGTCTA CGGCAAGCTT TGGACTAAAT GCGCTATTGC TATACAATTC AAATACACCG GGATCGACTG AGCCCTCTTT CGAGAATATT TTGATATCGC TTCTAGAAAA TTCCAAGACC ACTGTGGTCG GTGGTCGAGA AAGGGACAAA GCTGTCGCTA AAGGGGCCCT GGAGTTTTCT TCCCATCGAG CAAGTAGTCA GATACCGCTT TGGAGGCAAA CAGGCAACAG CTCCACAGAA GCTCTGTTGA CAACAACATT ACCAGTCTGG ATGCAAACTT ATGTGGATTG GCATGTGGAG ACACGTGCGA ATTTAACCTC ACAGAACTGG AACTCTACCC GCTACATCTT CATTAGCTGT CTTGCTAGCG ACACAAAATG CGGTGGAGCC AGTGACCGGC TACAGCTGTT ACCTTGGGCT GTACTCATGG CAGCTCGCGG CAATCGCCTT TTACTGATAC GCTGGGAACG TCCTTGCGCA CTCGAAGAAT TTTTGGTACC GAAAGATGTC GACTGGACTG TGCCTGAATG GCTGTGGCAA AGTGTGCAAT TGTACAGTCC GCATCCCAAG CTTTTGATGT CGGGCGGCAA GCCTTCATTG CGACATGCCC AAGCGGCCGA CCTCATTGTC GCTATTAGGC AGCAAGCCCA TGATCACGGC AAAACATTTT ACGATGAATT GAAAGAAGAC AATGAAGCGG GGTTCTATGA AGTGTTTCAT GACGTGTGGA AGGCCTTCTT TCAACCTTCC CCGATAGTTC AAATACAAAT CGAAAAGACC ATGGACGACC TCGGTCTGAG ACCGCGACGG TATATTGCGG CACATGTTCG CCAGAAGTAC CACCGAGACA AGACACACGA CACCGACCAC GTGGACAATG CCGTGAGGTG TGCCTACCAA TCGCGACAAG GCGTTTCGAA CACTATATAC TTTGCCTCCG ATTCAACCGT GGCCACAAAG CGGGCCGTGG ACTTTGGGCG ATACATTACG GCGTTAGCCA ATGATACCGT GCCGTCGAGC ATCAATGTTG TTGCACGCAT CAATGTGTCG GAGCCGCTGC ACCTGGACCG CGGATCTGCG TATTTGCAAA ACACGGATAG CTGGCAATCT TTTAAACCAG ATGACTTCTA CGATGTGTTT GTTGATTTGT ATCTTTTGGC GTCGAGCACT TGTGTGGTAT ACGGTGTTGG CGGCTATGGT CTTTGGGCAA GTCTATTGAC CACGAAACGG TGTTCGTTCC GGCATTCAAG TCGCCACTGC GGCTGGGAAG TTCCGTCGAA CGGGAGCATT GCCCATATGT TGTAG
|
Protein sequence | MASSTLSRID LAKSSNYHHE MRDPLKPRRL IYPEGTSAAS TPSRTRRLIR YCRAISVTLG MLSTASFGLN ALLLYNSNTP GSTEPSFENI LISLLENSKT TVVGGRERDK AVAKGALEFS SHRASSQIPL WRQTGNSSTE ALLTTTLPVW MQTYVDWHVE TRANLTSQNW NSTRYIFISC LASDTKCGGA SDRLQLLPWA VLMAARGNRL LLIRWERPCA LEEFLVPKDV DWTVPEWLWQ SVQLYSPHPK LLMSGGKPSL RHAQAADLIV AIRQQAHDHG KTFYDELKED NEAGFYEVFH DVWKAFFQPS PIVQIQIEKT MDDLGLRPRR YIAAHVRQKY HRDKTHDTDH VDNAVRCAYQ SRQGVSNTIY FASDSTVATK RAVDFGRYIT ALANDTVPSS INVVARINVS EPLHLDRGSA YLQNTDSWQS FKPDDFYDVF VDLYLLASST CVVYGVGGYG LWASLLTTKR CSFRHSSRHC GWEVPSNGSI AHML
|
| |