Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44421 |
Symbol | |
ID | 7197879 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 498864 |
End bp | 503120 |
Gene Length | 4257 bp |
Protein Length | 1206 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178236 |
Protein GI | 219114881 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.130794 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGTC GAAGTACCAG CTCTGTATAC TGTAGTCTAA GCTTTGTCCT GGTACTATCC ACGATACCTG GCCAGTCCGA GCATCAAACA CAACGACGAC AATGCAAGGC TGGGAACGAC GATGATACCT GCGACAATGG ACATGATCCA TTCGTTCGAA AGCCCCTTGA TGGCTTTCCA GCTCGCTGTG GCCTTTACAT GGCGGAGTCG TCCATCCCAC ACGCCGGATG GGGCATGTAC ACGGCTCAAG ATCTGTTCGA AGGTGACGCC ATCCAACCCC TCGATGTGTC CATTCCCGTC TTTGACCTCG AGCACCATCA AAACGTGATT TCCAATAAGT TCAAGCGCGA AGTTCCCGAA TGGCTTATGG GTCAGTACTA CTGGAACGCT GATGTGATTT ACGCGCAGTT CGACGCCATT GACGTTAAAG GTATTTTGCC CGGTTTCGGC ATGCTCGCCA ATTCGCACGT TGGTCTGGTC AACGCCGAAA ATGGTGGCAA TCGAGATCGA TTGGCACTCG GTCGGGAAAC ACCGCACGTG GGAGCTTCGA GTCACTATCA GGATTTGACC TTTACCGTGC TTGATGATTT ACCTGCCGGC CACGAAATCT TCGTTGAATA CGGCGATGAA TGGTTCGAAG ATCGATCACA CGTCTTTGGT GATGATCTAC CTTTATCCCA TCATTTCGAG AGTGCGGACA CAATTCTGGC GCAATGGAAA GAGGTCGTGC CGGGAGATTT GGAATCCAGC CTGGGAACCG ACTTGTACAA TTTGATACTG GACGGCCTGG GACATTTTCT GTCGCCGAGG CTGCGCAGAG CCTTGCCACA AACTGCATCG GATGCAGTAT CTCAGGCCAA TGGTCAAGGA ACTGCATACG CAACCGTACC TAATGTAGTT CGCGATATTG ATTGGCTAGA AGAAAACGGA ATGTGCCTCG ACAATTTAGC ACCCCGGGAA AATGTTGACG CCGGCAACAA GGGAGCTTTT GCGACGCGCT TTCTGTCTCG CGGTTCCCTT GTCGCGCCAG CACCTGTCAT TCAACTATCC CGACAACACT TGGAAATGAT CTTGGTGGAT GCGTACGACG AAGTACTTTG GCAAGGTCAC CAACTGCTAT TGAACTATTG CTACGGACAT GCAGGTTCTT CACTTTTATT TTTTCCGTAC AGCCCCGCTA CCAACCTTAT CAATCACGGT TCCGGAGAAA GGGCCAACGT CGGGGTGCGC TGGTCGGACC GAATGTCAAA CCCTGAAATG CTGCAGTGGA CAGCTGACGA AATCCTAGAA TCCAACGAGA AAGCTGGTCT CATGATGGAA TTTTACGCGC TACGCGACAT CCAACCGGGG GAAGAAATAT TATTGGACTA TGGAGACGAA TGGCAAGACG CGTGGGATCG GCACGTTAAC GACTGGCGAC CACCGATTTA CGAAACGGAT TATACGCCGG CGTACACTTT CGACCGTCAT GATGAAATTT ATACGTTGGA AGATGGAGAT CTGTATCCAC CGCCCTATGT GCAGGTTCGT TGCTACGTGA ACGAAGACGA ACCAGGCGTA CCAGATGAGG ATGGATGGTA CCAGTGGACG CCTGTTGAAA ATGAGGATTT GTCGTACACC GTTCCTTGTA CGGTTCTTTC GAGCGATGCA GTCAACGATA AAGAAAAGGC CTACCGAGTG CAGGTCAACG CAAACGAAAA CCTCAAATTC AAAGCTGCAC CGTGGTCGTC TATTACTTTT ATCGACGTGA GTTACACAGG CAATCAGCAT CTTCGACAAG GGTTCCGACA CGAAATCAAA CTACCTGATG CAATGGTACC CGATTGCTGG CGTGACATTG AAAATCCTCC CGACAACGAA CTGTGTAATT TGTTCATGGC CGAGTCTGCG ATACCCAACG CTGGTCTCGG CATGTTTACG GCGCGACGGT TGGACAAAGG CGAACTGATA TCTAGCGGAG ACGTAGTGCT CCAGATCGAA GACGCCGACT TGAACAAGAA CCTTCGCTTC TGGCGGCAAG GTATTACTGA TATCGACGAA CCCCCGTGGC TATTGGAAAA TTATTTCTGG AATCCATCCA ACACCTTTGG ATCGTTCGAA GCACATGATG TGGAGAGTAT TGTACCAGGC TTGGGCATGC TAGCGAATTC ACATCCAGGA CTGGTGAACT CGAAAATGCT GCCGTCCTCA GCAACAGCTG ACTTGCATCG CGGCCTAGAC CCAGGTGCTG GTGCATCGAC ACACTACCAT AATGTTAGGT TCAGAGCTAC CGACAAAATT GAGGCTGGCA CAGAACTTTT TGTGAAGTAC GGCGACAGCT GGTTTGAAGA GCGAGAAGAT CTTGGGCCTA TTCCATTGTC GGACGACTAC CAACGAGCTG ATCGAACTGT GAAACGTTTT TGGAAAATTA TCGATGGCAA CACGACAAGA GAACTGGCAC GTGATCTGTG GGACTTCATT TTACAAGCAT CATCTATTCC TATGCGCCAC AACATTGCAC TACCACAAAG CCTTGAGAAT ATAGAAAGTA TCTTGAATAA AGGTTCCGCT TATTACTCAG TTCCGGACCG CATTAAATCA ATTGGATGGC TCGAGGAGAA CGGACGCTGC CTAGACAACA TTCGCCCCAG TCAATCAAAG CTTCGTCAGG CTGGAAAAGG GGCATTCGCA ACGCGAGCAA TTCCGAAAGG AAAAATTATA GCTCCGATGC CCGTGGCGCA TGTTCGACGG CATCATATGG ATATCTTCGA CAGTGACGAC CACAGCGACC CTTCAGCGAA CGTTTGGAGC GACGGAACTC AGGTACTCAT GAACTATTGT TACGGCCATC CCAATAGTAC CCTGCTTCTA TTTCCCTACT CACCGGTGGT AAACTACGTC AATCACAATG CAACTTCAGC CAACGCTGAG CTGAGGTGGT CGAAGCTGCC CAATCACCTT GAAAGCTGGT TGGAACGAAC TCCCGATAGT CTTGATTCGG AGGAGCACGC AGGTTTGATA ATGGAGCTAA CCGCAACCAG AGACATAAGC GCTGGCGAAG AAATCTATTT GGATTACGGC ACTTCTTGGG ATGAAGCATG GGCAGATTAC GTTGCAAATA AATTTCGTCC GACAGGCGAG GATTTGTTGT ATAGGTCAAG CGCTGATCTC AACAGAAGAG TCGAATGGAT CAAGACACGA TCCGAACTCG AAACAGAACC GTACTACTTC GAAAGCGCCT TTACCGCTTG CTTTGTGGGG AGAAGTCAGC GAACAGGCAA TCATGATGAG GCTGACGATA GTAAGCTTCA GCAATTCTTG TGGACGCCGG TAGTCGGCAT GTACGATGAC ACATACAACG CATACCCTTG CACTGTTTTG GAGCGGACCG TCAATGACGT CGAAGGGTAT GCACCTCATC GTAGAGACAG CGTGTGGCCA ATGGAAGTTA CTTACAAAAT TCGTCTCCAT CAGCAAAATG AAGGCGATGT CATCATGACG CAAGTACCAC GGCGAGCTAT ACAGTTTTTT GATCGCGCCT ACCAATCGGA TTTGTTCATT CGTAGTGCCT TTCGGCACGA GATTCATTTG CCTGACGCAA TGGTTCCACC AGCCTGGCGA GACTTGATAG AACTAGTGTG AAGGTGAACA ATGTACAGCA AACAGGGATG ATTCACCTAT AAAAAGTCCG CATCGGTCAA CTCTTCGCCG AAATATGCAA ATAGCTTTTC CAATGCCGGT CTTTGTAGTT GTGTGTTGCG CTGAGTAGAC CGCATCAAGT CAACGAAACG TTCATCGTGG TAATTATTCG GTTGCCAGTA CCACAACTGC TGAATGAAGT ATACGTATCG TTTGGCCTCG TGAATCATAT CCCTCGCTAG ACAAAATTCT AAAATCTTGG TATGCGTCAT GTACGGTGGC AAGACGTCGT GTTGATTGAA AAAATCCTGA ACCACCGTCA CGGCAGCCGC CACGTCCCCT ACATTCAAAA GCCCATCGAC AATTGCGCGC ACGGTAAACG GATCCGGACG TAAACCATGG TTCAGCATTT TTCGAAAATA CTCTTGGGCA GTAGCGACGT CGCCTACTGC TGCGAGTCCG CCCAAAATAA GATCAAACGA AATGTTGTTA GGCTCACAAC CGGACGTCGG CATTACGGTG TCCAGAACGT CCATGGCTCT CCAGAGGGCG CCTCGATGTA CGCAAGCCTT GATCAATGCG TTGTAACTGC CAACATCGAG ATCTAGGGGT TGATGCCCAC AGGCACCTTG CTGCAAG
|
Protein sequence | MKGRSTSSVY CSLSFVLVLS TIPGQSEHQT QRRQCKAGND DDTCDNGHDP FVRKPLDGFP ARCGLYMAES SIPHAGWGMY TAQDLFEGDA IQPLDVSIPV FDLEHHQNVI SNKFKREVPE WLMGQYYWNA DVIYAQFDAI DVKGILPGFG MLANSHVGLV NAENGGNRDR LALGRETPHV GASSHYQDLT FTVLDDLPAG HEIFVEYGDE WFEDRSHVFG DDLPLSHHFE SADTILAQWK EVVPGDLESS LGTDLYNLIL DGLGHFLSPR LRRALPQTAS DAVSQANGQG TAYATVPNVV RDIDWLEENG MCLDNLAPRE NVDAGNKGAF ATRFLSRGSL VAPAPVIQLS RQHLEMILVD AYDEVLWQGH QLLLNYCYGH AGSSLLFFPY SPATNLINHG SGERANVGVR WSDRMSNPEM LQWTADEILE SNEKAGLMME FYALRDIQPG EEILLDYGDE WQDAWDRHVN DWRPPIYETD YTPAYTFDRH DEIYTLEDGD LYPPPYVQVR CYVNEDEPGV PDEDGWYQWT PVENEDLSYT VPCTVLSSDA VNDKEKAYRV QVNANENLKF KAAPWSSITF IDVSYTGNQH LRQGFRHEIK LPDAMVPDCW RDIENPPDNE LCNLFMAESA IPNAGLGMFT ARRLDKGELI SSGDVVLQIE DADLNKNLRF WRQGITDIDE PPWLLENYFW NPSNTFGSFE AHDVESIVPG LGMLANSHPG LVNSKMLPSS ATADLHRGLD PGAGASTHYH NVRFRATDKI EAGTELFVKY GDSWFEERED LGPIPLSDDY QRADRTVKRF WKIIDGNTTR ELARDLWDFI LQASSIPMRH NIALPQSLEN IESILNKGSA YYSVPDRIKS IGWLEENGRC LDNIRPSQSK LRQAGKGAFA TRAIPKGKII APMPVAHVRR HHMDIFDSDD HSDPSANVWS DGTQVLMNYC YGHPNSTLLL FPYSPVVNYV NHNATSANAE LRWSKLPNHL ESWLERTPDS LDSEEHAGLI MELTATRDIS AGEEIYLDYG TSWDEAWADY VANKFRPTGE DLLYRSSADL NRRVEWIKTR SELETEPYYF ESAFTACFVG RSQRTGNHDE ADDSKLQQFL WTPVVGMYDD TYNAYPCTVL ERTVNDVEGY APHRRDSVWP MEVTYKIRLH QQNEGDVIMT QVPRRAIQFF DRAYQSDLFI RSAFRHEIHL PDAMVPPAWR DLIELV
|
| |