Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49638 |
Symbol | |
ID | 7198271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 274682 |
End bp | 276672 |
Gene Length | 1991 bp |
Protein Length | 611 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184335 |
Protein GI | 219128260 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGAATACA ACTCCTATTG GTTTCCAAGA CTTCAAGAAA CAAAGCTTCT CCTTTATTTT GTGTAAGCTA CAGGAGGTAG GGAGAACTTC ACATCATTCC ATCAGCAACC GTTTCCGGCC ACAACCAATA TGCGGAATCC GTTTCGACGC AATAACCCTG CGGCGTCGAA CGCGAATCCC CGTCCGATGC AACACGAAGC TTCGAACCCG TTTGAGGTGC TACAACGCGG TGCCCGACAA GCTGCTAGCT CCCTTATCGA TAGTTTAGGG GCTACGCATC AGATGGTCGA AGAGCAATTT GAAGCCGCAA CGCAAGCCGC GATGCATGCT TCCATGCAAG CCCCTGCTTC TTCTCAAGGA CCGCCCGCTG CCTCGGCACA GGTCCTGCAC CACCTTCCGC AGATTCGTAT CACTCGTCAA GATTTGGTCG AACCTACCAA TCGCGAATGT TGCGTCTGCT TCGATCTTCA TCGTTTGAAC GACAAGGTTT TGCGGCTGCC CTGTGCTCAT GTTTTTCATC CACAATGCAT CACCAAGTGG CTACAATCTC ACTGCACCTG TCCGGTTTGC CGATACGAGC TACCAACGGA TGATCCCGAC TACGAACGGG GCCGGATTGA GCGGATGCGG AACCGGAAAC CGAGATTTGC GAGGCATGAA CTGGATCGGA TGACAGTCTC TGAGTTGAAA GCGCTTTTGG CGAAATCGAA AAATTGCCGC CAACGGCCGG TGGACAAGCA TGATTTAATA TCTCTCCTTA TTTCGTCGAA CGCTATTGAT GTCGTGGAAA CTCCGGAACC GGTTACTTAC CGGCTCTCTG CCTTAAAAGA TATGAGTGTG GGAGCATTGA GGCGGTGTAT GAACGACGAA GCTGGGGTCT TTTACGATCC GAACGAGGTG GTAGAAAAAG CAGATATGAT CCAAATTTTT TTGAACAGTG GGCGCTTACT CTTAAACCCT GAAGATAATG CGACCAGTGA AATCAACGAA GACGACTTTG TTTGGAAGGA TCTGTCGCCG ATCAACAGCG ACGAGGATGA AGAAGACAAA TATCCGTCTT CCGCAGTTGC GTCTATCCTT GTGGAGACTG TAGTAGATGA AACTGATGTA TCAATATATG ATCGAAGAGT TAACAAAATG TTACTCATGG AAGAAAACTC GTCCTTTGAC GAAAGTTCGT GCACTCCAGC CATGGAAGAT GTGGAAAGAT TACTAGAGTT CAAGGATGAG ACATCGGGCT CAGCGACTGC CGACGAACAA TTCACTGGAA TAGATGCCGT TGAGGTCGCA TTGACTGATG CACCTATGGA CATGGGAGCT GATTCGCAGA ATGTTAAACG GCGAAAACGT GTAAGAAGTT TAGGTGGGAA CGAATATACC AGGCAGGTGG CAAAAGAAAA AAACTCCGCA AAAGCATCTC TGGACTCTAT TGACGAAGGA AATGTCACGG AAGGATCCAC GGCTCTCCGC CTACCAGTGG ACGATAATGA GGAAAGTATG GATTTCCACG GAGAAGAAAA TATCTCCTCT TGTTTCGACG ACTTGAGCGT TTCCGAACTT CGAGCACGCG GGCGAGAAAT ATCGGTTGAT CTTTCCGATT GTATCGAACG TGCGGAAATA GTCCAGCGCC TTTCGTCCAT TGAAAGTGAT GGACAACGTG CCGGCCGCCT CATGAATTGG GAAAAGTGGC GGGTTTCAGA TCTCCGAGCG GTTGCCGCGT TGACCGGTGT GGACTTATCA GAGTGCCTTA ATCGACAAAG TATGGTTGAA AAAATGCAAC ATGCAGGAGT TGAACGTCCT CATTTAGGAC GATTCTTGCA CTCACTGGCT CCATTAGCAC GTCTTACCAG TCTACAATTA CTGGCCGTAG CACGAGACTG GCAGGTTGAC GTTTCCGACT GTCTCGAAAA AGGCGATATT TTGCGCCGAT TGGTTGAATC GGGGCCAGGT ATACGATTTG AATGAATACA TTCGTCATAT TTTAGTTGGT G
|
Protein sequence | MRNPFRRNNP AASNANPRPM QHEASNPFEV LQRGARQAAS SLIDSLGATH QMVEEQFEAA TQAAMHASMQ APASSQGPPA ASAQVLHHLP QIRITRQDLV EPTNRECCVC FDLHRLNDKV LRLPCAHVFH PQCITKWLQS HCTCPVCRYE LPTDDPDYER GRIERMRNRK PRFARHELDR MTVSELKALL AKSKNCRQRP VDKHDLISLL ISSNAIDVVE TPEPVTYRLS ALKDMSVGAL RRCMNDEAGV FYDPNEVVEK ADMIQIFLNS GRLLLNPEDN ATSEINEDDF VWKDLSPINS DEDEEDKYPS SAVASILVET VVDETDVSIY DRRVNKMLLM EENSSFDESS CTPAMEDVER LLEFKDETSG SATADEQFTG IDAVEVALTD APMDMGADSQ NVKRRKRVRS LGGNEYTRQV AKEKNSAKAS LDSIDEGNVT EGSTALRLPV DDNEESMDFH GEENISSCFD DLSVSELRAR GREISVDLSD CIERAEIVQR LSSIESDGQR AGRLMNWEKW RVSDLRAVAA LTGVDLSECL NRQSMVEKMQ HAGVERPHLG RFLHSLAPLA RLTSLQLLAV ARDWQVDVSD CLEKGDILRR LVESGPGIRF E
|
| |