Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37419 |
Symbol | |
ID | 7202307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 98438 |
End bp | 99976 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181481 |
Protein GI | 219122291 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.245451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAGAC GAGCCTTTTT GGCCGTCGCC TTTGTGTTGT CGCTCTTGGC GCAATTTCAT ATCCACCGCT CGGGTATATC AATTGCCGTA TCGCCATTTT TCGATGACCA AGCTGTCATT CCATCTTTAT CGGTAACGGC AAGAGAATGG GAGTATCGAA CGAATCACCA GCCAGCCGCA ACCAATACAG ACCCGACGAC GGAAAGGGGA ATGGATGCTG GTGACTCCGT GTCGAATCTA CTCGAGGGCC TTCCCGATTG GGTGCGCACC TACATTGACT GGCATCAGCT CCAACGCCAA AAATTTCCTG GCACAAAACT GTTTACCGAC CCCGCTGCTC CTCCTGTACT CCTGCGGACG TGTTACCGCA TATGTGGCGG CCTACACGAT CGATTGGGCA AGCTACCCTG GGATCTGTAT CTAGCCAACC AAACGGGTCG CGTCTTGCTC ATGAATTGGT GCCATCCGGC ACCACTTGAA GAGTTTTTGC TGCCTAATTT GCTGGATTGG ACCCTACCAC GGGATCTAAA TGCTGCCTCC GTGGATGACG GAGTCCCCGC TGCCATGAGG TTGTTTCACA ACGAAACAAC GTGCCTATGG ACCGAAAAGA TGATTCCTAC TTTGTTCGAC GAGGCGTCAC ACGAGCATAG ACCTCGGCCC GAGTTTTGGT CACACTACTT GGACGAGGGC ATTCGCCAAG ATAACGTTGC GAGAGGCGAT CAGAAACGCC TTCAAGAGAA AAAGGTTGTG CGGCATCGCA TTCTGGGGGG AGAATTGGAG TTTCAGAAGC GACTGGAAAT GGCTGGGGAA ACGACCGATT TGATCGACTG GACACCGTCC TTTGGCAAGA TATTTGGGGT TTTCTTTCAG CCCAGCTTTG GTGTACAGCG AGAACTGGAC AAAGTGTATC GAGATCTGCG TCTAATACCG GGTCAATATT CGGCCGTCCA CTGCCGGGTT CGTCATCCCA AGGGAATAAA AAAGTTTTCC AAAGGAAAAA CACCCGTACC GGGCGGCCCT GATCGAGTAG GCTTACTGTG GGAAGGTGAG GGCCGGGAAT TTGCAATTGA AACGGCGGTC CACGCCCTGC AATGTGGACA AACCTTGCTC CACCAGGGCG CCAATGAAAT GGAACCCATA TACTTTTACT CTGACTCGGA AGACTTGGTG CGCTACGTGA CGATCGAGCT GCATGACCCC ATGTTTGAGA AAAATTACTC GTCCATTTTA GATTCCAACA AGATCCACGC CACAGCTCGC AAGGTAGTGC AAAGGAGTCG CATTGTGGGG CGGCACGATG GGACAGCTAA TCTACATATT GATCGACAAG GGGGAGCTAA ACCGACAGCG TATTACGGAT CCTTCGTTGA CCTAATGGTT GCGGCACAGT CTCGTTGCGT CACTTACGGA GTCGGCAACT ACGCCATGTT CGCTTCCAAA ATATCTGGCT CCAGTTGTCG CCTTCAACAC CAAGAAGAAT CATGGGGGGT AGACGAAAAC AAGAAGGAGC GGACAAAGCT CTGCTCACTC CCGTCGTAG
|
Protein sequence | MTRRAFLAVA FVLSLLAQFH IHRSGISIAV SPFFDDQAVI PSLSVTAREW EYRTNHQPAA TNTDPTTERG MDAGDSVSNL LEGLPDWVRT YIDWHQLQRQ KFPGTKLFTD PAAPPVLLRT CYRICGGLHD RLGKLPWDLY LANQTGRVLL MNWCHPAPLE EFLLPNLLDW TLPRDLNAAS VDDGVPAAMR LFHNETTCLW TEKMIPTLFD EASHEHRPRP EFWSHYLDEG IRQDNVARGD QKRLQEKKVV RHRILGGELE FQKRLEMAGE TTDLIDWTPS FGKIFGVFFQ PSFGVQRELD KVYRDLRLIP GQYSAVHCRV RHPKGIKKFS KGKTPVPGGP DRVGLLWEGE GREFAIETAV HALQCGQTLL HQGANEMEPI YFYSDSEDLV RYVTIELHDP MFEKNYSSIL DSNKIHATAR KVVQRSRIVG RHDGTANLHI DRQGGAKPTA YYGSFVDLMV AAQSRCVTYG VGNYAMFASK ISGSSCRLQH QEESWGVDEN KKERTKLCSL PS
|
| |