Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34832 |
Symbol | |
ID | 7200232 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 467164 |
End bp | 469322 |
Gene Length | 2159 bp |
Protein Length | 581 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179437 |
Protein GI | 219117285 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTCCG TATGTAACGC GAATACGGGG AATTTGCGCA ATAAGGCCGA CACAAACTCC ACACGTGACA CGTAAACGCC GCACCCATCG GATTGGCTCA CACTTGCGTT TTCCATACTA ACAGTGGGCG TCACCATAGG ATGATCATTT GGCTATAACT TTTCTAGAAA GGCGCAAATG GCTCTGCTGT CCCTACAGTG GGAAAGGAAT TGATTCGAAC TTTCTAGTAA GTTTTTTTTT TAATTTCTGA CTCCTAGGAG AGATTGAGTT ATCATAGTCC AAAAAAATTT GGGCTAGCTC GTGACTCGAG TCACGAGTGG ATTTCGACGC CAAATCGTTG AAACACAAAA GTCACCATGG CGGACTTGGA TGAAGCTTTC ATTGCTTCAT CAGATGATGA ATCTGATAAA AATTGTAGAG TCACAAGGAA ACGTACAATA GAAGATACCG GGACAAAAAT TTCGTCAAAA TTGTCCCGAT CGAAGTCTCC AATGAAGTCC AAACGGAAGC AGACTCGGAT GCCAACAACC ATGGTGGCGC TAATACCAAT GAAAAGACAA GTTGAATCGC TTCTCCGGGG CTCACACTAC GACCAAAACG TTTCGTTACG TGTTATTACA GCCGCCCTCA GGCTTCAAGA GGACTATATT TCCAAAAAAT CAAAAAGAAA AGGGGAACCA GTACCACCGC CCAAAGTGCA AAACACTGTG TGTCGTCTGC TTGGTGTTTC ACCCAAGAAC TACACCAACA TTGTATCGCA ATACCTCATA GACGGCTCTA TTTATCCAAG TAATGCAGGA TGTGACGGGA GGGGTGGAAA TATGGACCGG AAAGAGACTC GAATCCCTCG TACGAAAAAT GTTACAATTG CCATCCGTGA ATTTGTTCGT ACGGAGCGGA AGAATCGGAA ACGGGTGACT GCTCGGCAGA TATTGGATTT CCTGGCTAGA AAAGGTATCC TACGTATACC AGTTGATCAG AGAACAGGAG TTTACGAGAA AACTGAGTTC CGGGCGGCAT TACGGAATGT GCAACGGTTT GTACAAAGCC AGGGCTACCG GAGGGGTCGT CGGAACAACA TAGCCCCAGA TCCGTCACTG ATCATAAAAC GCCACGAGTA CCTACAAGCA TTCTTTGATA ATGAAGCATT GCCAAAAGAG GAAAGGCTCC GCAATGTTTA CATGGATGAG AGTTACATCC ATGAGCATTA TAATAGGAGT GATGATAGCC TTTGGGATCC GAGCGACAAC TTGGATATTC AATTTGGTAA GTCAAAGCAC AAAGGACGGC GATATTGTTT TGCTGCTGCT ATACAGGGCC CAGACCCATT GGTAGACAAC CCCGGAATTG CATCAGAAAT GGCTGGGCTG GTGCCAGGAA CAGTGTGGGC CTTCTGCCCC CAGCAGAAGC GTAGCCACCA AGGGGACTAC TACAAAGTTT TCAATGGTGA AAACTTTCTG GCCTGGTGGA AAGATCAGCT GCTCCCCAAT TTACACCAGC ATTCTCTTAT CCATATGGAT AATGCGGCAT ATCACAAAGT ATATGGGAGT CATGTGCCCA AGTGGGGAAA GCTACGGAAG CAGGAGTGCA TTGATTTCCT GTCGTCAAAG GGAATTGAGG TTGAGGCAAG ATGCCCTGCT GTTGTGGTCA AAGCCCGGAC AAAAGAGTGG ATCCTTGCCA ATGAAAAGTT TGAGTGTGTC AGGTTAGCAG AGGAACAAGG ACATAAAGTT CTTTTCACAC CACCATATCA TAGCGATCTT CAGCCAATTG AGTTGACATG GGCTCGGATT AAAGGAAACA TTGGCAGACA GTACAGTGTT GGTACAACAC TGGCACTAGT ACATGAGCGA TTGCTTCATG AGTTCAAAAA CTTGGAGGAG TCAGGGCATG GTGCCATCCA AGGTATGATC AACAAGTGTG TCAGGATCGC AAAAACATTT TATGACGACA TGCCAGAGGA GGAGTTAGCA GAGGAGGCTT TAGAAGATGA GGAAGACGAC GATTATGGTG ACTACGAAGC GGGGTTTGAC GAGGGTGTAC CGCATGAAAA CACTTTAGAC GAAAGCATAG TCGGAGAAGA GCTAGAGGAT GTAATTTTTG CGGCTAACAA AGACGTTGCC ATAGAGAACG AAGACATTGA AGACGTAGTA TCGGTATAA
|
Protein sequence | MTSVCNANTG NLRNKADTNS TQRRKWLCCP YSGKGIDSNF LSHKETYNRR YRDKNFVKIV PIEVSNEVQT EADSDANNHG GANTNEKTTA LRLQEDYISK KSKRKGEPVP PPKVQNTVCR LLGVSPKNYT NIVSQYLIDG SIYPSNAGCD GRGGNMDRKE TRIPRTKNVT IAIREFVRTE RKNRKRVTAR QILDFLARKG ILRIPVDQRT GVYEKTEFRA ALRNVQRFVQ SQGYRRGRRN NIAPDPSLII KRHEYLQAFF DNEALPKEER LRNVYMDESY IHEHYNRSDD SLWDPSDNLD IQFDPLVDNP GIASEMAGLV PGTVWAFCPQ QKRSHQGDYY KVFNGENFLA WWKDQLLPNL HQHSLIHMDN AAYHKVYGSH VPKWGKLRKQ ECIDFLSSKG IEVEARCPAV VVKARTKEWI LANEKFECVR LAEEQGHKVL FTPPYHSDLQ PIELTWARIK GNIGRQYSVG TTLALVHERL LHEFKNLEES GHGAIQGMIN KCVRIAKTFY DDMPEEELAE EALEDEEDDD YGDYEAGFDE GVPHENTLDE SIVGEELEDV IFAANKDVAI ENEDIEDVVS V
|
| |