Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49869 |
Symbol | |
ID | 7198501 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 137612 |
End bp | 139792 |
Gene Length | 2181 bp |
Protein Length | 726 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184655 |
Protein GI | 219128933 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.37954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCTT GGCACGTTTC CGGTGCTCTT CAACAGATTG TGGGCAGTCA AAGTGTCCTT TCGCAAGAAC TGGTGCTGCT CCAGGCTGAC GTCGATAACG TTAACGACGA TGACGACAAC AACAGCAACT GCCACGACGA TAACGATAAC GACGACACTC CGGCGGAATC GGCACTCGTA CCGTCCTCCG AGCCATCGAC TTACACCCCG CCGTGTCCGA CTCGATCCCG GACGGTCACC GCCAGTATCA AGACCAAGAC CAGGCATCGA AAGCCACCAC CACCCGCCAT TGGCGTTGGC TCAACCGCAA CGGCAACTGC CCATACAACA ACAACAAGAA CAAAAGTCAC CGACTCAGCC CTCGACGCGA CGTCTCCCAT CAAGGCGGCT CGACGAATCG GTCTCGTCCT GACGCAAGAA GAGGAGGAAC TCTCCGATTC TGGTGACCCG ATGGATGACA ATCATCCACC GCACGGTACC ACCCGTGTCT CCGACCCGAC ACGATCCGCC CGGCTCGGAT CGCAACAGCC CTTTTCTTCT TCCAGGGCAA CCCGCCCGGT ACCAATGCCC TTTCCCGGGT CTCCCGCATC GCCCCCGGCA GCGTCCATTC TACCGACGAC GTACCTCGAA CTAGGACAAG GAAGCTCCGC GTCGCCACAC ACCAGTGGAA AGCTACCAGT CTTGGCGACT TCTCCGGAAG CGTTGTCGTC CCCACTCAAG CCGTGTCGTC TCTCGCAAGA GTCCCGCTCA ACGAAAGTTG AGCGTAGATC GACCGACCGG GAAAATTTCG AATTCTCATT CTCGCAAGAA TCGGCCGCAT CCACCAACAC CACGGATCCC AATCTGGGAC CGCTCCTACA CGCCATTGCC ATTCAGCAAA CCGGTCTTTC CCAGGAACCT TTGTTCTTCA CATCCTCTCA AGATTACGTC TATCCGACCG TGGTCCTACC GTCCCAGCTC TCCGAAGAAG CCGAAGAAGA AGAAGACTCC CCTTCCGACT TTTTGCTCCG TCCCCGCAAG TCGCGTCGGC AGCGAAAACG GAAACACGAA CCGATCGCCA GTGAAAGCGA CAACGAACCG TTACCACCAA CATCCAAAAA TCCAGCCAAA CGAGCTTCCA CGAGCTCCAC CAAGAAGCCA CCAGAAACCA AACGGGCAAT CAGCGAATCC GCCAAAGTTC TCAAGCCTAC CTCCAAGCCT TCCACGACAG CGCTCCCATC TACATCTCGA ACATCCACTA ACCCCGCCGC TACCACACTG CCGCCATCCC CCACACGATC GAACCCACCA TCAGGAAATC ACTCTTCCGA CCTTTCTCGG CTCCAAAGCC AGCCATCCGC TCAAACGCAA CAATCGCCAT GCCCCACGAC ATGGTCCGTA AGCTATTCCA TTCCCAGTCC GAACTCGGCT GGGCAGAAGA GCAAAAACAA TATCAAGAAC AATATCAAGA ACGACGCCCA ACTCAGGCAA AGCAGCACTG CGTCCGCGGG CGCTACCGGA CCGACTCCGG AAGTACAAAA GGCGGCACAA TTGGCCGCAC GGGTCCTACA TGATCCCGAC TTGGCCAAGG CTCTCTTGTT GCGCATGGCG TTGGTCCGGG AATCACCCCG CCCCGCATCT CAGCGCGTCG ATCCGCCTCC CGGAACCATC CTTGCGGAGC ACTTTGTCTG GGCCAAGTTT CCTTCTCTCG AGAACGTTTT GAAACTCCAC ATGCTGGACT ACTACCAACT CTCCATGAGC AGTTGTCAAT CCAACGCGCA ACAAGAGTAC AATAACCGCC TCGTGACGAT TATTCGCGGT CACGTGTATC GCAAGGGCTG GACGTTTTGT GATCGATACC GGAACTCGAC GGAGGTGAAA CCACTCCGGG ACCGTATTCG GTGTTACTAC AAGACGCACA TTCAGAACGC CAAGAAGCGG TTGCGAACAA TGTTGCGCAA TCCGACAAAA AAAGCGAGTG CGAAGCACTT GGTCGCGCAC TACGATTTGA TTCAGGAAAC GGTGGAGACG AGCGGCAAGG TCCAAGCCGT GGCGGAAACG TACGGCTCGG CTGCGAGCCA CTCGCCACCA CGGGGGAAAA AGTCCCCGAA GCGACGAATC GAGGCGTCGT CGCGACGGAA TAAGGAAGCG AGTGAGGACT CTCTTAGTGA CGAGGAAGAA AGGGCAACGC TTCTTGTTTG A
|
Protein sequence | MRAWHVSGAL QQIVGSQSVL SQELVLLQAD VDNVNDDDDN NSNCHDDNDN DDTPAESALV PSSEPSTYTP PCPTRSRTVT ASIKTKTRHR KPPPPAIGVG STATATAHTT TTRTKVTDSA LDATSPIKAA RRIGLVLTQE EEELSDSGDP MDDNHPPHGT TRVSDPTRSA RLGSQQPFSS SRATRPVPMP FPGSPASPPA ASILPTTYLE LGQGSSASPH TSGKLPVLAT SPEALSSPLK PCRLSQESRS TKVERRSTDR ENFEFSFSQE SAASTNTTDP NLGPLLHAIA IQQTGLSQEP LFFTSSQDYV YPTVVLPSQL SEEAEEEEDS PSDFLLRPRK SRRQRKRKHE PIASESDNEP LPPTSKNPAK RASTSSTKKP PETKRAISES AKVLKPTSKP STTALPSTSR TSTNPAATTL PPSPTRSNPP SGNHSSDLSR LQSQPSAQTQ QSPCPTTWSV SYSIPSPNSA GQKSKNNIKN NIKNDAQLRQ SSTASAGATG PTPEVQKAAQ LAARVLHDPD LAKALLLRMA LVRESPRPAS QRVDPPPGTI LAEHFVWAKF PSLENVLKLH MLDYYQLSMS SCQSNAQQEY NNRLVTIIRG HVYRKGWTFC DRYRNSTEVK PLRDRIRCYY KTHIQNAKKR LRTMLRNPTK KASAKHLVAH YDLIQETVET SGKVQAVAET YGSAASHSPP RGKKSPKRRI EASSRRNKEA SEDSLSDEEE RATLLV
|
| |