Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47617 |
Symbol | |
ID | 7202667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 291603 |
End bp | 292809 |
Gene Length | 1207 bp |
Protein Length | 372 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181877 |
Protein GI | 219123117 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.261739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACCGAAACAC ACTTCAAGCT TGCTTGACCA TCAAAGAACG CGACAAGCCC CGTCTCGCTC CTCTTACTAC TTTCCCAAGT GAAGATTCAT GGCGAGATTA CGAATACCTC CGCATATACT GACGGCCCTG GGTGCCGCAT CCCCCGCAAT TTACGGATCT TGGTTTGCGT TGAACAAGTG TGAGGCGTCG GCAAAGACCG ACAGCCGATC AAGGATCGTT TTCTTAGGTA CTGGATCATC AACAGGATGT CCACGCCCCC TTTGTCCGAT GATTTTTCAG CCCAACCAAT CAAAGAAGCA AGAAGAGACG ACGGAAACGA AAGCCATGAA AGAAAGGCTG GCTCCCTTTT GCAAGGTATC CAATCTTGCG ATCGAAGGCG ATCCGAAGAC CAACAAAAAT TATCGAAACA ACCCTTCTCT TGTGATCGCT TTCGACGATA ACGGGGTACA AAAGCACGTT CTTATTGATT GTGGGAAAAC CTTTCGGGAG GGAGCCCTCC GATGGTTTCC AACATTGGGA ATCAACAGTT TGGATGCCGT CGTTTTGACC CACGAACACG CCGACGCCGC CTTTGGCCTG GACGACGTAA GAGGGTTTCA ACGAACCGAG GGAGGATTCG CCGGAAACAG TCAATTTCGT CAGGTGCCCA TGCCTTTATA CGTTTCGCAG CAATGTCTAA ACGAAATTGC GGAACGCTTT CCTTGGCTCT TTCCGGAACT ACAATCCCGT GCCGATATCG CCCTGGTCGA CAAAGCTGTC GTGAAGAGGC ACGTCGCCTC TCTTGATGTG CACGTCATGG AACCCTTTAA AGCAGTGAAC ATAGAAGGAC TAGAAATTAT TCCGCTACCA GTTATGCATG GTGAAGACTT GGTTTCTTTC GGATACGCAT TTACCGTTGG GCAGACGAAC GTCGTTTACT TGTCCGACAT CTCTCGAATG CTGCCGGAAA CATTGGCGTT CATTAGCAAA AGTCGACCTC CCACAGACAT TCTGGTTGTG GACGCTTTGC ATCCCACGCG CGACAATCCG GTTCATTTCA GTTTAAATTA TGCCTTGAAT CTAGTGAACG AAATAAAGCC AAGACGTACG TTTGTAGTGG GGATGAACTG CGATTCTTTC CTACCTCATG ACCAGGCCAA CAAAGACCTT CGAGACAGTT ACGTCAACAT TCAGCTCGCA TACGACGGGC AAGTAGTCGA GTGTTAA
|
Protein sequence | MARLRIPPHI LTALGAASPA IYGSWFALNK CEASAKTDSR SRIVFLGTGS STGCPRPLCP MIFQPNQSKK QEETTETKAM KERLAPFCKV SNLAIEGDPK TNKNYRNNPS LVIAFDDNGV QKHVLIDCGK TFREGALRWF PTLGINSLDA VVLTHEHADA AFGLDDVRGF QRTEGGFAGN SQFRQVPMPL YVSQQCLNEI AERFPWLFPE LQSRADIALV DKAVVKRHVA SLDVHVMEPF KAVNIEGLEI IPLPVMHGED LVSFGYAFTV GQTNVVYLSD ISRMLPETLA FISKSRPPTD ILVVDALHPT RDNPVHFSLN YALNLVNEIK PRRTFVVGMN CDSFLPHDQA NKDLRDSYVN IQLAYDGQVV EC
|
| |