Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39520 |
Symbol | |
ID | 7195350 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 41762 |
End bp | 43217 |
Gene Length | 1456 bp |
Protein Length | 454 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183660 |
Protein GI | 219126848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTCGT CTTTGCAGGG TGCCAACGCC ACTTCCAACG GTGCCATGGC GCTGTTGCAA AAATTTGCGG CCCTCAATAA TCACATCGAA GAGATTCGTC GGCAACAAAA CAGTGTACTG CGTGAAATAG ACTCGGTCCA GCAGCAACTA GTGGATGTGG GAGAAGACCG AGAAAAGATG TGTGAAAAAA CAGACGAAGC GGGAAAGAGC CTCATCCAAT TGGAACAAAG GACTAAAGAC GCTTTGGATT CTCAACTTCA GGTAGAAAAA GGTCATTCCG AAGCGCTGTT GACTAACCAA GTATGTGCCC GTCGACTTGA AGCAGCCCGT CAGGATACGT TGGAATCCCA GCAAGCCTTT CTCGAACGAA CAAGGAACTT TCGTCATTCT TGTCGACGCC TGCAGCTACG CGCACAGCAC ATGGGAATCC AACACGCCTC TCTCCGGGCA TGGATTGCTG CAAAAGGGGA AACGATATCG GGAACTGATC TTGTGGGCGA CCAACGACAT CAAACGTACG GGTCCCGGTA TCGGAACTCC AAATTTGATA TTCGGGATCC GGATTCATGG GGTCTCGAAG TCGTCCAGGG CGACGAAGAG CTTCACGAAC TGTTCTTAAA GTATGAGTCA AAGAAGAGTG ATTTGGACTT GGCACAAAAG AACCGTGAAA TAGCTCGAAT CGCCTGGCAA GAGCAGCTTA CCAATGCCGA CACTCGCCGT GATCGTCGCA CGAGACTTGA AGAGCAACTA CGAAGGATCG AAGGAGACAA TGACGCCTTG GAGTCGCAAA TGCGTGAATT GGAATGGCAG ACCGCTAGAG TTCGAGAGAC CGCGACAGTC CCAGAGCTTC TTAAGAGTGA GTCGACATTC TTAGGATCAT TGGGATCCAC ACCAGGCTGT TCCCTGGGCT TGCAATTCTA ACCGCTAAAT CTCTCTCTAT CTACCAGGTA CCGTTCAGCT GATATCTCGA TCGACGAACT CCACCAAGAC GCAGCAATCA TCCGCCGCGG TGACTCCTAC GACAGGCTGC CGTTCTACAA CCGGCCTCTC GTCAAGGGTT GCCTTTCACC GGACAAACCC GTATGCCAAT ACAGGAAAGA ACCAATACAA GAATCGAATT TCCAGTACAG ATTCCAACGT CATTTCCACG CCCGAAATCC GTCTACCGGC ACTCCATTCA GAAGCTTCCG ATTTCCGCGA CAGCGCGTCG GCTAGTATTG GACGGCGTGG CCGTCGTCTT GGTGGAAGTG AATTTGGATT GAGCATGGAA ATTGTCGGGG AGCCATCCAT CACATCCATG AACTATAAAA AAGCAACAGA TACCTGCTTC AATGATCCTC ACATTTCTTT GCCGGCGGAT GATTCTTTGA AGCGCACCAT TGCATCGCTG CAAGATAGCG ACGGCGAAGA CTTTTCCTAC ATGCCATTTA CTAAGAAGAG CAATCAGCCT TTGTAA
|
Protein sequence | MSSSLQGANA TSNGAMALLQ KFAALNNHIE EIRRQQNSVL REIDSVQQQL VDVGEDREKM CEKTDEAGKS LIQLEQRTKD ALDSQLQVEK GHSEALLTNQ VCARRLEAAR QDTLESQQAF LERTRNFRHS CRRLQLRAQH MGIQHASLRA WIAAKGETIS GTDLVGDQRH QTYGSRYRNS KFDIRDPDSW GLEVVQGDEE LHELFLKYES KKSDLDLAQK NREIARIAWQ EQLTNADTRR DRRTRLEEQL RRIEGDNDAL ESQMRELEWQ TARVRETATV PELLKSTVQL ISRSTNSTKT QQSSAAVTPT TGCRSTTGLS SRVAFHRTNP YANTGKNQYK NRISSTDSNV ISTPEIRLPA LHSEASDFRD SASASIGRRG RRLGGSEFGL SMEIVGEPSI TSMNYKKATD TCFNDPHISL PADDSLKRTI ASLQDSDGED FSYMPFTKKS NQPL
|
| |