Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22453 |
Symbol | |
ID | 7203628 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 525396 |
End bp | 528716 |
Gene Length | 3321 bp |
Protein Length | 622 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182977 |
Protein GI | 219125414 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTGCAG CCCTGGCGTC GGATCGTCTC GGAGCGGATT CAATTATACT TGCGGCTCTA ACGGCGCTCA TGGCTGCTCA GGTTATCACG GTGAAAGAGG GCTTTCAAGG TTTCGCCAAC GAAGGAGTAC TAACTGTCCT TGCTCTGTTC GTCGTTGCTG AGGGGATCAC CAAGACCGGA GCGCTCGATT GGTACATTGG AAAGGTTTTA GGAAGCCCCA CCAATGCAAG TTCAGCACAG CTCCGTATCC TCGCACCAGT AACGTTTGCA TCTGCCTTCC TGAACAATAC TCCTATTGTT GTCGTGATGA TCCCAATCAT CCAAAAATGG GCCAGGAACA TACATATTCC GGTTCAACAA CTCATGATTC CGCTCTCGTT TGCGTCGATT TTGGGGGGAA CTTGTACACT GATTGGAACG AGTACCAACT TAGTCGTGGT GGGTCTTTTG GAAGAGCGCT ATCCCAACGA TCCAGATGTC AGCATCGGCC TCTTCGACAT TGGACTTTAC GGCGTCCCAG TCGCTCTCGT GGGGATGGCA TATATTCTTT TTGCTTCCCC TTATTTGCTG CCTGGTGGAG GGGGGCAGTC TCAAGATGCC CTAAATTCTT TGGAGAAAAA TGAAGAGATT TTGCTTGGAG CCCGGCTGAC GTCGTGGTCC CCTGCAGTTT CGAGGACTGT CAAGCGTAGT GGTCTGCGTG ACACTGGAGG CATCTACCTG GTTTCTGTAG TACGGGCAGC AACGGGCCAC GTTCACAGAG CCGTTTCGCA TGATTTCGTA CTGAATGTCG GGGACGTCCT TTACTTTACT GGACTTGTTG AGAGTTTCGG AGAGTTCTGT GAAGAACACG AACTTGAGCT ATTAACAGTC GACCATGATT TTCACGAAAC GAGTATCGAT GGCACAGTTC TTACACTTCC AGAAAAAAAA CAAGTGAACT GGGAGTCATC TGACGATCAA GATTGTGATC TCACAAAGAG ATCTGTGGTC ACTAGATTTT CTGAAGAGGG TGAAAGCCAC CGTCTGATTA ATCGAATGAC AGACATAATC CGAGGTGCAG AGTCTGGGGA AGAAGGGACT TATGCTATCC ACGAATCTCG ATTTATTGAT GACCCTGCGA AAATTGTGGT CACAATCGAC AAGGCCCTGG TGGTGGTTGG AATTGACGTC CAAGATCGAC CCGGGCTCAT GGTGGACATT TCAAAGGGAC TTTTGCGCCT AAATCTGGAG TTGCACCATA CCGAGGCCGC CGTTATTAAA GGGCGTTCCC TTTCCATTTG GCGCTGTGAA GTGATCGGAC CACAAATACA CGACCAAGAA CAAATTTGGT CGGGTATCGA GTCGATTCTT GCCACTTGGT CCGGTATCAG TGCCCTCAAA CACAGAGGTC TCCGTGTGAT TCGAACGCGA GTTGTTAAAG GATCTAGATT GGTCGGTCAC ACAGGAGCAG AAGTCGACTT TCGGCAAACG TACGAGGCTG GTATTGTCGC TCTGCAAAAA AAGGGGAAAA ACGCAGAAAT TCCTCTGTCA AGAGTCCGGT TTGAAGTTGG CGATATTCTT GTTTTGCAAG CAAATGATGC CTCGCCACTG TTGAAGATTC CTCCCGCGGA ATTCTACATG TATCTTTCGG AAGGCTCAGG AACGGATGAG AGCATACCCA GAACTTCTTC TGTAAGAAGC ATGGTAAATA TGGTGACGCG GCGAAAGGCT AGCACTGATG TTACCCCGGA ACTCGCGAGC GCTAGGAAAG TAATCGACAA TTCCCACCTC GCACATCATG GTGACGAGGA AGATCCTGCC GTTGTTGATC TGCCGGAATT AGTGCAACAA ATTGAAGAAC AGGAGGCGGT TTGGAAAGAT CTGCAGCTCC TCTTGCCTGA CGAAGGGATA CATAGCGGTG ATGGAGCAGC TCGCGAGTTT CTTACCGCGA TGCAAGTTGC CCCAAAATCC AAGTTGTCGG GGAAAACCGT TGCAAAAAGT GGCATCGACA GGCTTCCAGC ACTTTTCTTG GTAAGTATCG AACGCCCCAT CCCTGCAGGG ACCTCTTTGC CAACGAAGAA CAAAAGACTA TCAGTGATGT CTGGCGCATC CGATGCGCAT TCTCTGGGAG AGGACAGCAA TCAGCGCCTT GGCTCGATTC AAACAGACAA TCAGGCATAC CAATCCATTG CTCCAGAGGA GCCGCTTCAG CACGGAGATG TTCTATGGTT CTCCGGCTCT GCATCGTCCG TTGGCGATCT GCGCAAGATT CCAGGGTTGA TCTCGTATCA AAACGATGAG GTGGAGAAAA TCAACGAGAA GGTCCATGAT AGACGTCTGG TTCAGGCTGT CATTGCCAGA AAAGGACCAT TGGTCGGGAA GACTGTGAAG GAGGTCCAGT TCCGGAAGCG GTATGGAGCC GCGGTGATTG CTGTACATCG CGAAGGCAAG CGTGTGCACG AGCATCCGGG GAACGTGAAG TTGCAAGCAG GTGATGTGCT GTTACTGGAG GCGGGTCCTT CGTTCATCGC CAAGAGTGGT GAGAACGACA GATCGTTTGC TCTGCTAGCT GAAGTGGAGG ACTCGGCCCC TCCTCGTTTG AGTCTTTTGA TTCCTGCGTT GTTGATCACG GCAGGGATGC TGGTTGTATT TATGGCTGAC TGGACGTCGC TATTGGTTTC TGCACTAGTG GCTTCAATGT TGATGGTAGC TCTTGGTATT TTGTCAGAAC AGGAGGCTCG GGATGCGGTG AATTGGGAAG TATTTATAAC TGTTGCTGCA GCCTTCGGCA TTGGTACAGC TCTTGTCAAC TCAGGGGTGG CAGGAGGGAT TGCTAACTTT TTGGTGGACG TAGGTACTGC ATTAGGTATC GGGGAGGCAG GGTTGCTTGG AGCCGTGTAC TTCGCAACCT TTCTTATTTC AAATGTGGTC ACGAACAATG CAGCGGCGGC TCTGTTGTTC CCTGTCGCGT TGAATGCAGC GGAGCAGACA GGCACTGATC GTGTTTTGAT GAGTTATGCG TTGATGTTGG GCGCGTCAGC CAGCTTTATG TCACCTTTCG GCTACACAAC GAATTTGCTG ATCTACGGTC CTGGAGGATA CAAGTACAAA GACTTCCTTT TGTTTGGAAC CCCAATGCAG ATCGTGTTGT GGGTAGCGTC AATTGCCTTC TTGGCGATCA TTGAGCCTTG GTACATTAGT TGGATCGCTG CAGCGGCCAT TCTTGGGATT ATCATTGCCC TCCGGTTATT CTGTCTTTCG CGCACAGCCC TCAGAGCTGG GGAGGAAAAA TAAGCTCACT ACTCATGCTA CATAATTAAG ATAGGTTTTG TTTTCTTTGC T
|
Protein sequence | MFAALASDRL GADSIILAAL TALMAAQVIT VKEGFQGFAN EGVLTVLALF VVAEGITKTG ALDWYIGKVL GSPTNASSAQ LRILAPVTFA SAFLNNTPIV VVMIPIIQKW ARNIHIPVQQ LMIPLSFASI LGGTCTLIGT STNLVVVGLL EERYPNDPDV SIGLFDIGLY GVPVALVGMA YILFASPYLL PGGGGQSQDA LNSLEKNEEI LLGARLTSWS PAVSRTVKRS GLRDTGGIYL VSVVRAATGH AYQSIAPEEP LQHGDVLWFS GSASSVGDLR KIPGLISYQN DEVEKINEKV HDRRLVQAVI ARKGPLVGKT VKEVQFRKRY GAAVIAVHRE GKRVHEHPGN VKLQAGDVLL LEAGPSFIAK SGENDRSFAL LAEVEDSAPP RLSLLIPALL ITAGMLVVFM ADWTSLLVSA LVASMLMVAL GILSEQEARD AVNWEVFITV AAAFGIGTAL VNSGVAGGIA NFLVDVGTAL GIGEAGLLGA VYFATFLISN VVTNNAAAAL LFPVALNAAE QTGTDRVLMS YALMLGASAS FMSPFGYTTN LLIYGPGGYK YKDFLLFGTP MQIVLWVASI AFLAIIEPWY ISWIAAAAIL GIIIALRLFC LSRTALRAGE EK
|
| |