Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44961 |
Symbol | |
ID | 7199488 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 802536 |
End bp | 804620 |
Gene Length | 2085 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179067 |
Protein GI | 219116544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.114443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACG ATGACAGCGA TAAGCTAATT CATCCCATGA CGGCGGTTCC CGACGCCATC CGCATCGTAC TGACCGAGTG CGCCCGCGTG CTCCTGCAAA AATCGGATTT TGCTCCACCA GTTTTGTCGA TTGATGCCCA TGTCGATGGT GGGTCCTTGC TAGGTCAGGT ACTTGCCGAG CCAGTTGGCA TGTCTGAACC AGGGTATCCG CCCTACCGAG CAAGCATTAT GGACGGATTT GCCATTCGAA CGACCGATCG GTTTCCATCG ACTTTTCCAA ATGGATCTTC CCCGCCGACA ACGAAGCAAT GGACCCATAC AATTGCTGGA AAAGTGTTCG CTGGTAATAC TTCCCAAACA AAAGATGACA CCGGTACCTC ACAGACAGTG GACGCGTGTC AGTTACCCAC AGCATACTAC CTCACCACAG GAGCCGTCGT TCCTAATGAC TTTGACTGCG TGGTTCCTGT CGAAAAATGC ACCGTCAACA ACAAGACAAA TCCGACTTTT GTGAATATAC AGGCAGTTCC GGCCGACATC CAATCAGGTA AATGGATCCG CGATGTAGGT TGCGATATAC CAGCCGGTAT GGAAATGTTG CCACGGGGTC ACGTTCTGGA TGCCGTCTCC CTGGGTTTGA TACGGCAATC TGGTTGTGAG CGCGTAAGTA TTCGGCGTCG ACCAGTCGTG GGCGTCCTTT CTACGGGCAA TGAGCTACTA GGTGACAATG TACGCGGGGA ACAATCTCGG CACGGTATGA TTCCGGATGT GAATCGACCC GTTCTGCTTG CCACGTTGAA ATCTATGGCA AACTGTACGA CTGTGGATTT GGGGTTGGCT CGAGACGATA GCGTCGACGA TATGGCATCC CACCTTCAGT CTGCATTGGA GCGCTGTGAT GTAGTGATTA CAACTGGCGG AATTTCCATG GGAGAAACGG ACATTATTGA AGAAGTACTA GTGGAACGAC TCCAGGGTAA AGTACACTTT GGTCGACTCC TCATGAAACC TGGTAAGCCC ACAACCTTTG CTACGGTTGT GACTGCACCG TCCTCTGGGA CCAAGCTAAT CTTCGCCATG CCAGGAAATC CTGTGAGTGC AGTCGTCTGT ACACACTTGC TGGTACAGCC GTGCCTCGAC TTGCTCCATC ACGGACCGGA TAGTACAGCC GATACTTACG GAGAGAGTGT GGAAGAACAA ATACACCGCG TTGTGTTGAA TGCCGCAGTC CATCCAGAAG TTCAGGCAAT TCTCAGTCAG GACATCAGAC TGGATCATGA ACGACCCGAA TATCATCGCG TTCAACTGCG AGAACAAATG CCGGGAGGAA GCGTTTTCGC GTTTAGCACG GGGGTCCAAC AGTCTTCCCG ACTCATGAGT ATGCTTGGCG CCGATGCCCT TCTCATTTTA CCGCAAGGAA CGACGTCCAA ATCAACTGCC AAAAAAGGTG AAACGTACAC GGCACTCCTA CTTCGTCATC GCAGTCGCTA CCCACAAAAA CTCGTGACCG AGGCTCAGCA CTTGAATCCA ATCTCTTCAA CGGGAAACAA TATCCGGATC GGCGTTGTCT TTGCCGCCCC AAGTGTGCAG TTGTTACTCA CCCCTACGCT AGAAGAGATC ACCGAGTCGG TCCAAGCTGC AATGGCTGGA TCCAAAAAGA GCCACAGCAT AGAAATATCC TCCACACAAC TGTACACAGG CAGCGCAACC AAAATTGAGA ATTTTCTTAA CTCCATACCG CTGGATGTCG ATATTCTCAT CATTACTTAT TCAAAACGAC AATTTCGGTA TCAGCTGGCC TTGGCCAATT CGCTGCGCCA TGCGCTGATC AAACGTGCCG ACTGGATAGC GTTGCAGGCT CGTCAAGGTT GTGCCGCATA CGATCCTACC ACTGCCGCTT CTGAAATGGT GGTCGGTTTT TGGGAACGAA CGAAAGAAAC CTCCTTATCC GATGCGATAG TGGTATGTTT GCCGGCCGAA GGGGTCGGAG GATTGTCCCA TGTGCGGGGG GTTCTGCGAC ATGCGCTCCG CGTGGCACGC GGGGCCGGTC ACTCTGATGA AGGCTATCCT CGGAAGGAAA GCTAG
|
Protein sequence | MTNDDSDKLI HPMTAVPDAI RIVLTECARV LLQKSDFAPP VLSIDAHVDG GSLLGQVLAE PVGMSEPGYP PYRASIMDGF AIRTTDRFPS TFPNGSSPPT TKQWTHTIAG KVFAGNTSQT KDDTGTSQTV DACQLPTAYY LTTGAVVPND FDCVVPVEKC TVNNKTNPTF VNIQAVPADI QSGKWIRDVG CDIPAGMEML PRGHVLDAVS LGLIRQSGCE RVSIRRRPVV GVLSTGNELL GDNVRGEQSR HGMIPDVNRP VLLATLKSMA NCTTVDLGLA RDDSVDDMAS HLQSALERCD VVITTGGISM GETDIIEEVL VERLQGKVHF GRLLMKPGKP TTFATVVTAP SSGTKLIFAM PGNPPCLDLL HHGPDSTADT YGESVEEQIH RVVLNAAVHP EVQAILSQDI RLDHERPEYH RVQLREQMPG GSVFAFSTGV QQSSRLMSML GADALLILPQ GTTSKSTAKK GETYTALLLR HRSRYPQKLV TEAQHLNPIS STGNNIRIGV VFAAPSVQLL LTPTLEEITE SVQAAMAGSK KSHSIEISST QLYTGSATKI ENFLNSIPLD VDILIITYSK RQFRYQLALA NSLRHALIKR ADWIALQARQ GCAAYDPTTA ASEMVVGFWE RTKETSLSDA IVVCLPAEGV GGLSHVRGVL RHALRVARGA GHSDEGYPRK ES
|
| |