Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42116 |
Symbol | |
ID | 7202200 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | + |
Start bp | 848733 |
End bp | 850008 |
Gene Length | 1276 bp |
Protein Length | 417 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181275 |
Protein GI | 219121859 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGGAA TCTTTGTACT CGGCTACATG GGCATTATTT TCGAAGAAGT CTTTGAATTT AATAAAGCCG GCGTCGCGCT CTTGATGAGC ACCGGATTAT GGGTGACCTA CGCGGACTTT TACAACAGTG CCGGTACGGC GTCCACGGCT GTACTGGAGC AACTGGCGGA ACAACTCTCG GAAGTATCCG ATATTTGCTT TTTCCTCCTG GCCGCTTCGA CAATTGTGGA AGTGGTGGAC GCCCATCAAG GGTTCAAAGT TGTCACCAAC CAAATAAAGA CCACTTCCAA AAAGTCTCTG TTTTGGACCA TTGGATTCCT GACCTTCTTT TTGTCGGCCA TTCTTAATAA CTTGACCATC ACAATTGTCA TGGTCAGTCT ACTGCGCAAG CTCGTGCCCA ACGTAGATGA TCGTCGTTTA TTCGGAGCCA TGGTTGTCGT GGCGGCCAAC GCTGGTGGTG TTTGGACGCC AATCGGGGAC GTGACCACGA CCATGCTATG GATTAACAAT CAACTATCAA CGATTCCGAC CGTTCTCGAT CTCTTTCTAC CGTCGCTAGC ATGCTTGGTA GCTTCCTTGG CCTTTTTGGT CAACAAGGTG GAAGAAGACG ACTCTTTAAA GGCATCGACA CTACCGGAAC CGACCCCGTT GTCGCAACGT GGGCAGTTGG TCTTCTACAG TGGAATTGCC GCTCTGTTAT CGGTGCCTGT CTTTAGCGAA CTGACAGGAC TGCCACCGTA TCTGGCCATG TTAACGGGTC TTGGGGCCAT GTGGACCCTG ACCGACATCA TTCACATGGG AGACAAAGAG GAAGGGCTCA AAGTGCCGGC GGCCTTGTCC AAATTAGATA CATCCGGCAT TCTATTTTTC CTCGGAATTC TCATGAGTAT TGGCGCATTG GACAAGAGCG GCTTGCTCAA AAGTCTAGCC GTCTTTCTGT CGGACAACTT GCCCAGTCTC GATATTATTG CTACCGTTAT TGGTATCGCA TCGGCCTTGA TCGATAACGT TCCGTTGGTC GCGGCAACCA TGGGTATGTA TGATCTATCC GAATATGGTA CGGACGATAA ACTCTGGCAG TTGATCGCGT TGTGTGCTGG TACAGGGGGT TCCATTCTAG TAATTGGCTC CGCCAGTGGC GTGGCCCTCA TGGGACTGGA GAAGGTGGAC TTTTTGTGGT ACGCCAAGAA TGTTTCGATC GGAGCCGCGG TAGGGTACTT CGCCGGAATT GCAACATATT TGGCCCAGTA CGCAATCTTT CACGGTGATC TGCTGA
|
Protein sequence | MIGIFVLGYM GIIFEEVFEF NKAGVALLMS TGLWVTYADF YNSAGTASTA VLEQLAEQLS EVSDICFFLL AASTIVEVVD AHQGFKVVTN QIKTTSKKSL FWTIGFLTFF LSAILNNLTI TIVMVSLLRK LVPNVDDRRL FGAMVVVAAN AGGVWTPIGD VTTTMLWINN QLSTIPTVLD LFLPSLACLV ASLAFLVNKV EEDDSLKAST LPEPTPLSQR GQLVFYSGIA ALLSVPVFSE LTGLPPYLAM LTGLGAMWTL TDIIHMGDKE EGLKVPAALS KLDTSGILFF LGILMSIGAL DKSGLLKSLA VFLSDNLPSL DIIATVIGIA SALIDNVPLV AATMGMYDLS EYGTDDKLWQ LIALCAGTGG SILVIGSASG VALMGLEKVD FLWYAKNGTS PELQHIWPST QSFTVIC
|
| |