Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_30461 |
Symbol | |
ID | 7198241 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 107365 |
End bp | 110252 |
Gene Length | 2888 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184305 |
Protein GI | 219128198 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTACCGTAT TATATGCTAC GCTTTTTCAG CGGCCAGGAA GGCTAGACTA ACCAATCCAG GATCTCCTCG ATGAAAAGCC TTAAAAGTTA ACTTTTCCGT ACTCTTAGCA TCTCTTTACC TCGTTTGATT GCACCTGACC TGTTGTCACC GGGAGATTAT CCATTGGCCT CCACATGCCT CGATAGGTAT GCCGCTGAAA CTCCCTTTTG AGCATATAGG ACCTTGGGGG TAATGTGAGC GAGATCAAAG TCGATCTACT AGGCTTGTAT GACCGAGTCG ACTACGCTTA TCAGCTTCAA AGCGAGGCAA TACTGTCGAG ACGAAAGATC AAAACACGTC ACTCAATTTC GATATAAGGG ATTCGATGAC GTTTCTGCTA TAGCCTGGGA TCGCCCGCCC CCGTGCCGCT AATTCGAAAA ACTCCTTGTA TTGGGGTACC GCCAGGTTCC CGAGTGGGAC CGTTATTGGA GCTTTTCGTG TGGTTTATAT TACATTAGTT CGTTTTATTG TGACACCCGC GTGCACCCCG TGGGATGGAA ACGCGTAGCT ATGCGTGAGC GCGGTTTTTT TTTCGTATGG GTCTGGCTAT CACTGTCGCC AGAGGACTAG GGAGGGAATT TGCACGGTAG CACCATGACT AATGGAAAGC ACAACGTGGT CGTCAAGCTA GTAGAGGGTG GGAGCGTTGA ACGACGGGTC GGGGCATAGA GTTTTCAAAC TTTGGAATCA GTAAAAGTGC CTCACCGGTT GCTCCCCCAA GGAGTATCCC GATCTAGGAA GGCCTCAAAA CGAGATCACT GCCTCGTCTA AGTAACGACT CGTGAAGAAG TGGTCCCTAT GTCTCGATAG GTAGGCTTCC CGTTGATAGC CTGGAATTCG GATGGGCATG CAGTCTCGGT ATTTCTTTTT GCGAAGTGTG GCATGGGGTG CCCTTTGACC ATCTATACAG AAACCAGGTT GGTTCACGTT GGCGTTTGGC AGTTTTCGAT TCCTTTCGAT GGCAAGAATT TCTTTTTATC TCTGCAGTTA TTGCGTATAT ATATTGAATT TTGCTGGACG GTGTCTGTTC GCTCGCCATC ACCGAAAGGC GTCAATTGTA AGCATCCGCG TCAAACGTCA CAGTCAGTAA CTTTTTTCAA AAATGAAAAA GTCGGACCAA TGTTTCCGGG TCTCTCGTTT TGGAACGGTT CAGCACGGGC GCCACCTCAC AGTCAGCGCA GCATTGCTCC GAATTCCAGC GCGTTTTTAC TCACTCTAAC ACGCTCATAC AGCCACAAGG TAGTACGGAA GAGAGGACAG CCCCGTGGAA TCGTCTCCTC ATTCGTTGTC GTGTATAAAG GCTCGTAGAG TAGCAAGCAC AGTTTTGCCA TTGCCACTAG GATGACCTCC TCTAGCTCCA GCAACGGCGC GAGTACGCGG ACGATGAAGA AGATCGGAAC GCCGACGAAA TCGACGGACC TGGACTGGTC CGGCGGTGAG GGTGTACTGC CAGGACGCGC AGCACTGGGT CCGATCTTTC TCATGAGTGT CACTCCCGTC TTCTCCATTG TCTTTTTTCA CGTGTGCGCC AACATGAAGG GAAATTTCCT CGCCTTTGGA CAGCAGTGTC TGCGGGACGG ATTGGGCGCC ACAATCACGG CCATTTGGCC CGATCCGTGG GACGCAGTCG CGTGGAAAAT GATCCTTTCC TTCATGGCCT TTCAGCTTCT CCTCATGAAG GTGATTCCGG GAGATCGTTT CGAAGCGACA CTCACGCCCA AAGGCAATCG TCCCGTCTAC ATCGCCAACG GAATGGCCTG CTACCTCACC ACTCTCGCGG TACTCGTGTT GTTGGATCTC ACCCAGGCCT TTAACCCCGC CACCATTTAC GACAAATTCG GCAACATTTT GTCCTCCATG AATGTCTTTG CCTGGTGTTT TTGCTTGGGA CTGCTTGTCA AGGGTCACGT CGCCCCATCG AGTTCCGATT CCGGTACCAA CGGCTCCTGG ATTACCGACT TTTACTGGGG CATGGAACTC TACCCGCGCG TGCTCGGTTG GGACGTCAAG ATGTTCACCA ATTGCCGCTG CGGAATGATG TTTTGGGCCG TTGCGATTGT TGCCTTTTGC TACAAAAACA TGGAGTTGCA CGGTGGACAC TTGCAGTACG GCATGGCCGT TAGTGTGGCT CTACAACTCA TTTACATTTT CAAGTTCTTT CACTGGGAAA TGGGATACAT GTGCTCCATG GACATTCAGC ACGATCGGGC CGGTTACTAT ATTTGCTGGG GTTGTCTCGT CTGGGTTCCC GCCGTCTACA CCTCACAGGC CTTTTACTTG ACGGCACACG CACCGGAACT GTCCACACTC ACGGCCCTCG CTATTTTCAT CGCCGGCTTT GTCTGCGTCT GGAGCAACTA CGACTCCGAT CACCAGCGCT ACATCTTCCG ACAAACCAAC GGGGATTGCC GTATCTGGGG ACGCCAGCCG GACAAAATTG TGGCCAAATA CGTCGCCAAC GGCGTCGCGA AAGAATCTCT TCTACTGGTG GACGGGTGGT GGAAAATCTC GCGTCACTTT CACTACGTTC CGGAAATTCT GGCCAGCTTC TTTTGGTCCG TCTCGGCGTT GGATACAGGC TTGGTGGGAC CCTACTTTTA TGTCGTCTTC TTGACCATTC TGCTCACCGA TCGGGCCTTT CGGGACGATG ATCGCTGTCG CAAAAAGTAC GGCAAGTACT GGACCGAGTA CTGCGAAGTT GTGCCTTACA AGATTGTACC CGGTGTCGTC TGAGCGACAA GCACTTTACG GCGGACTGTA CTATGGACGA ATCTTTCGAT GGAAAAACTT CCACGCTATA GAGAATAATA GCAACTGCTA ATCCGTTTAA ATGTTTCACG AAATGAAC
|
Protein sequence | MTSSSSSNGA STRTMKKIGT PTKSTDLDWS GGEGVLPGRA ALGPIFLMSV TPVFSIVFFH VCANMKGNFL AFGQQCLRDG LGATITAIWP DPWDAVAWKM ILSFMAFQLL LMKVIPGDRF EATLTPKGNR PVYIANGMAC YLTTLAVLVL LDLTQAFNPA TIYDKFGNIL SSMNVFAWCF CLGLLVKGHV APSSSDSGTN GSWITDFYWG MELYPRVLGW DVKMFTNCRC GMMFWAVAIV AFCYKNMELH GGHLQYGMAV SVALQLIYIF KFFHWEMGYM CSMDIQHDRA GYYICWGCLV WVPAVYTSQA FYLTAHAPEL STLTALAIFI AGFVCVWSNY DSDHQRYIFR QTNGDCRIWG RQPDKIVAKY VANGVAKESL LLVDGWWKIS RHFHYVPEIL ASFFWSVSAL DTGLVGPYFY VVFLTILLTD RAFRDDDRCR KKYGKYWTEY CEVVPYKIVP GVV
|
| |