Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45586 |
Symbol | |
ID | 7200641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | - |
Start bp | 622608 |
End bp | 624858 |
Gene Length | 2251 bp |
Protein Length | 706 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179881 |
Protein GI | 219118203 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCCTT CACAAGTAAC ACGAGTCTCT ACAGGAATAC TTGAAGCAGA AGCACATACC TGCGGGCGAA TGAAGCGCAA GCGGGAAACC CGCGAGGGGA TTCCGGCGGA GCGCAAAAAG ACTTTGCGGA TGGCACACAC CGCAGCAGAA TGCGTTGTAG TTCCTCCAAT AAGGACCCGT AGATCTCAAC GCCTATTTTG GAAACGTCGA CGTCGTTCTG TATGGGACGA AACGCGGCGT AATCTGCTTC TCCTACTGGC CGGTCTGTTG ACTGCAACGC GTGGACAGTT GACCCGCGCT CCCGTGTCCG TGGCTGTCCC ATCCGTGCCT CCCGCGCCGA CTTTACCGCC GACCTCTTTT CCTACAGTGA GTCAGGCGCC ATCTGTGTCT GCCGCTCCGA CAGATCTCCC GACCGTGTCA CCACAGCCGT CGACACTACC TTCGTCGATT CCTTCGCCGC TGCCGACTAT TTCTCCTAGA CCCACGGTTA GCGAAGCTCC TACGGGAGAA CCGACCGTGA CACCTTCCAA CAGTCCGTCG TCAGCCCCTT CGATTGCGCG TCAGCTTTTT GTTCGACAAG AATACAAGCA AATCTATATC ATTCCAACGG AGCGATTTTT TGAACCCGAA GAAATTGCTG CTTTCAACGA AATTTACACA GGGTACGCTC TTTCATTACC AGCGGCAGCC GAGCGCGTGA ACGCTACCTG CGAAATCGGA GCGCCGAGTA TCTCCGCTTG CGTTCCGGGG GTTGATGCAT GTGACACCTT CCCGGATGGC TTTTTGAACG ATTTTCAATT TGCTTGCAAC TGGTCCTCTG ACTTGACGGA GGTTGGCAGG TTTCCTAGCG AGCTGGAAAC ATTTATCAAC TCTGATCTAG CGATGGTAAC CGATGATCTT CAGGCGGCGA ACATCATTTT GACCGAAGCC TTGGCGGCTC GGCTGATTCT TACACAGACG CCGGCTCCAT CTAGTTCTAT GAGGCCGACA GCTTCACCGT CAGCGCGTCC TACGATCAGC CCCCCTCCGT CACTGCGACC AAGTATCGCC CCCTCGGCTT TTCCAACCGC TGTTGTATCC ATGGCACCGT CTTCTCTTCC ACCTTTTACT ATGCCACCGT CTCCGCCCCC CTTGCCCGCA GATAGCAATG GGTTGAGCGT TGGAGCTATT TCGGGAATCG TAGTTGTAAT CGGTCTGGCA GCTCTAGCCG GCCTCGCTTT CTACTATTTT CGTCGCCGCA AGAAGCGCCG TGAACAACGC CTACCCGATG CTGCTAATCA ACGGCGTGAA AAGTATCGAC CCGACCCCTT TGATAGTGTC GCAGCGCTCG CGGATGATCG AACAGATAGC AATTTTGCGC CAATTCTGCA AAGTGAGTCC GTCGTGTCCA ACAAATCGCT ACTTTCGGTC GGCGAATCGA ATATAGAGGA TGAGTCTGAG CACGAAACGG ATGGCACCAA GAACCTCCAA GATGAATTCG ATCTATACAA GAACCCAACG CTTGAAAAAT TGCGGTCTGG CGTGGAAGAT AACGTGTCAG GGTTTGAAGG AATTATGAGT GCCGCAGTCA CCAACGCTCT CATGGGTGTG GAAGAAGCTC AGGTTGATTC CGCCGAGCTC ACATGGGGTT GCGGAAGTAA ATATACGGGC GCCGAACTCG AAGCCAGCGC TTTGTGTGAA GTTGACTATT GGCTTCGACG AAACGAGAAT GCAAGCGTGG AAGGAAAACG CGCTTTTATG CAAGACATTC TCAACCGTGT GGTGGCCAGC GTCCGTTTCG GAAAATTGGG CGCCGACGAT GCTTCTAGGA CCATCCACGA ATCGGCGGCC CTCCTAGAGT TGCCGCTCGC AAACAAGCTT CCTATGGCAA CCGTCATTAT TTCCGGAATG CGGAAAACTG TAACCTCTTT TGACATTACC AAGGCTCTTC GAGAATTCGG AGAAATCGAT GTCGCTGCGG TGGCCTCTGG ACAGCGGGGG TTCGGGATTT TGCGATTTCG GCATCTCAAA TCGGTTGATC GCGCCATGAA CCGCTATCGC AAAGGCGAGA TAGTGGTACA AGATGTAGCG GTACAAATGA AGGCCCTGAT GCCCAGTGGA GCTTTGGAAA GCCGTGCATA GAAGCAAAAA ACGAGAATTT GTCATGTCAC TACGATGAAA TAATTTGTAC GATCGGTGTC GACGACTATA CAAAGCATGG CATTCTGTTT ACCTTGAGAT ACGCATACAG TAGACATCCT GATAGCAAAC T
|
Protein sequence | MDPSQVTRVS TGILEAEAHT CGRMKRKRET REGIPAERKK TLRMAHTAAE CVVVPPIRTR RSQRLFWKRR RRSVWDETRR NLLLLLAGLL TATRGQLTRA PVSVAVPSVP PAPTLPPTSF PTVSQAPSVS AAPTDLPTVS PQPSTLPSSI PSPLPTISPR PTVSEAPTGE PTVTPSNSPS SAPSIARQLF VRQEYKQIYI IPTERFFEPE EIAAFNEIYT GYALSLPAAA ERVNATCEIG APSISACVPG VDACDTFPDG FLNDFQFACN WSSDLTEVGR FPSELETFIN SDLAMVTDDL QAANIILTEA LAARLILTQT PAPSSSMRPT ASPSARPTIS PPPSLRPSIA PSAFPTAVVS MAPSSLPPFT MPPSPPPLPA DSNGLSVGAI SGIVVVIGLA ALAGLAFYYF RRRKKRREQR LPDAANQRRE KYRPDPFDSV AALADDRTDS NFAPILQSES VVSNKSLLSV GESNIEDESE HETDGTKNLQ DEFDLYKNPT LEKLRSGVED NVSGFEGIMS AAVTNALMGV EEAQVDSAEL TWGCGSKYTG AELEASALCE VDYWLRRNEN ASVEGKRAFM QDILNRVVAS VRFGKLGADD ASRTIHESAA LLELPLANKL PMATVIISGM RKTVTSFDIT KALREFGEID VAAVASGQRG FGILRFRHLK SVDRAMNRYR KGEIVVQDVA VQMKALMPSG ALESRA
|
| |