Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46242 |
Symbol | |
ID | 7201199 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 700578 |
End bp | 702368 |
Gene Length | 1791 bp |
Protein Length | 564 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180693 |
Protein GI | 219119884 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAACT ACAAGAAAGG CGAGCAAATA ACAACCCTTC CAGTGTACTG CGTTGTGCCA GTGTTTGTCG AAGACGAGAA CTCTAATTTT TCTGTAGATT TGCCTTTATT TGATGCCGAC TCTTCTGCTG TACCGGCTCT GCCGAAACCC AGCTTTCCTA AAGATTTACG ATCCCGTCTC CGGCGTCATT TTCAGGTTGT ACATACAGAC GATTCGCATC TTGTCTATGC CATAACATCA TCACTATCGC TTCTGTTTAT GGAGGACGAT TTTCAAGTGA ACACAATTCT CGATGGCCTC CCATCATACG CTCACGATAT CTTCGACGAA AACGTTTCTA TCATCCATTT TTGGAAGTGT GTGGCTAATA AAATCAGCTT GGAAAGGTGG AAAAGTCTTG TACAAAAAGT GTATCAAAAT CAGTACATAC TATGGATAGA TCACCCCTTG CAGGACTACG CTTCGCGTCA GATACCACTA TTGAACTCCA CTCAAAGACA ACTTCTTGCG CGTCGACTAG ATCTATCCCC GAAACGATCT TCTGTACTGT CTTTGATCAA GTTGATACAC GACAAGTACA CTCCTCGACC ATACGGAGTT ATTTGTAAAT TACGTAGGGT GCAATTGATG CATTCCTGCG TCCCGTCTGC ACAGCTGGAA ATGGTTTCTG GTGGCAAGGA GATTGGGGTT ATGGCACTTT TTGATACTGT AGAAAACGAT CCCGTATCTA TCTGTGCTAT TGAAAATAAT AGGTCAATCG AGCAGCGAGA TCGCCTGGTG GAACAGCGTT CGGGCGAATC TTGTAGTTGC CTACGATGCC GGTACGAAGT CGACCCTTCT CGACTGCTAA GCGAACTCAA CACATCGCAG CAAAAGATCT TGGCTCGCTT CTATATGTTT TCTGGTATGC CACACGAGGC AAAACTGCTG TACGATAAGG CTTTGAAGGA TGAACCGGAA AATCCCGAGT TGTGGCACGC CCTCGGCGCT GTGGCACTTT CGACTTGTAG CTTTCTCGAG GCGCAACGGA TTTGGAAGCG AGCAGTGGAC ATGTATCCTG AGGCGTGCAG CAAGCATGCG GGCATTTCCC TCCAAAATAA AAAAATGCGA GCCTACCAAT ACTGGGGACA GGCAAGCGAG ACGAAGCTCC CTCCAGTTCC GACCACGTGG AAGCCACTTT TACCCCAAGC CTTCCTTGCG AAAGCACTTG ACGAAACGAC GTGCAAGCAG ATTATAGTCT GGGCCGAAAG CGCCCGCTGG ACTCAGCAAC GCCATTATGC AGTGCCCACG TACGATGTCC CTGTTCATCA AGTTGAACCT CTGTTGCAGT GGTTTCGCCA CTGGTTTCAA ACCGCAATGG CTCCAACACT TGCAGACCAG TTTGGTACTT CCTCCAACTA TTATGTGCAT GATGCATTTT GTGTTCGGTA CGAGGCTGGC CAGTCCTCCA ATCATTTGCC AGTCCACACC GACGAGTCTA CCCACTCATT CGTACTGGCG TTGAACGAGG ACTACGAAGG AGGAGGTACG TACTTCTACG ATCAGGATCT AATTGTCAAA CTAAGGATTG GTGAGGTCGT TAGTTTTCGA GGGGAGCACT TGCTGCATGG CGGCGAAACA GTGACAAACA AGCGACGCTA CGTAATAGCT GCATTCCTTT ACCACGATGA TGGAAGCAGC AGTCCACTGG CCCGTCTTCG GAAGAGAAAG GCAATTATAC TCAAAAACAC GATTCGTGAG TCGAAGCAAC AACGAACGGC CTTTTCATTT TGTTTCGACA CATCTCGCTG A
|
Protein sequence | MGNYKKGEQI TTLPVYCVVP VFVEDENSNF SVDLPLFDAD SSAVPALPKP SFPKDLRSRL RRHFQVVHTD DSHLVYAITS SLSLLFMEDD FQVNTILDGL PSYAHDIFDE NVSIIHFWKC VANKISLERW KSLVQKVYQN QYILWIDHPL QDYASRQIPL LNSTQRQLLA RRLDLSPKRS SVLSLIKLIH DKYTPRPYGV ICKLRRVQLM HSCVPSAQLE MVSGGKEIGV MALFDTVEND PVSICAIENN RSIEQRDRLV EQRSGESCSC LRCRYEVDPS RLLSELNTSQ QKILARFYMF SGMPHEAKLL YDKALKDEPE NPELWHALGA VALSTCSFLE AQRIWKRAVD MYPEACSKHA GISLQNKKMR AYQYWGQASE TKLPPVPTTW KPLLPQAFLA KALDETTCKQ IIVWAESARW TQQRHYAVPT YDVPVHQVEP LLQWFRHWFQ TAMAPTLADQ FGTSSNYYVH DAFCVRYEAG QSSNHLPVHT DESTHSFVLA LNEDYEGGVT NKRRYVIAAF LYHDDGSSSP LARLRKRKAI ILKNTIRESK QQRTAFSFCF DTSR
|
| |