Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49547 |
Symbol | |
ID | 7198218 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 6705 |
End bp | 8328 |
Gene Length | 1624 bp |
Protein Length | 498 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184282 |
Protein GI | 219128150 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.358617 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCACCCGG AGTACACCTC CTTCGACAGC ACACTTTCAT GGTGACCATG AGGAGTTTTT TGCTGCTGCA ACTTTGGATC GGGTTCTTAA AAGCGTCGGC TTTTGCTGCC GCTTACATTT TTCCTTCATC CGCCCGCAGC CTGAGACGAT GTTTTCAGGA AAAGGGGACA TGCCCGTTGC CATTGTCTCA CTCATCCATC ACGTCTGGCT GGACAGCCAT TTTTAGTTCC ATGGAAGACA CAGATGCCGC CACGATCAAC GGAGAACAGT ATCCTGCTGA TACTAGTGCA AATGGTGGTG GGCTTGTTAC GACTGCCGAC TCTCGGTCGC AGCTGTTTTC TGCTTTCTCG GCGTTGACTG CGGAAGATCA GTACGATGCT GTCTTGACTG GTCTTTGTGC TAAGATTTTG GACGACACGA CTATTGTGGA AAGAGCTGCA ATTGAGCGAC TCAAGGACCC CACCAATCTA TTGAATGAAA TGAATTCCAG GCGAATACAG GCCAGCCCTC GATCTATGAT GGCGCTGATC GATGTAAGTA TGGATGGTAC CTTGGAATGG TTTGCGGTGC CAAAGTAGGT TCCATTGACA AAACTCACAC GGGAAATTGT TGCACGTACT AGTCAACTGT GAAAGCACAA GATGCGCAGA CCATGGCTCA AATGCTGTCT TTGAGCGTCC GGAACGGAAG TGTCACGCGT TACGGAGTTC GGCAGGCTGA CATTCTACCA CTTCCCCTAG CCGCAACGTC GAGAGTCAAA TGCCCCGATG GATCAATGAA AACTCGTTCG GAACGCTTAT CTACCGTTGC CGATATACCT TTTGATGAAC GAGGAACGGA AGTGAAAAAT GCCTTTGGAG TAACTGCCGT CGCGGGTGGT TGCTTTTTCA CAGATGTTGC TGGCATGGAT GACATTGCGC CCTTTGCCAA TGTATTCCTC AGTACTCTTA TCGTGGTTGG TGCTCTCGAT AACTTCTACG ATTTATTTAA AACAAGCACG TCAATGATAG CCAAACAGGT GGCAAAAAAT GGAGCCGCGG ATTTTGAACT TCCTGAAAAG GATGCCTTGC CGTTTGGGCT AGGTTCAGGC CAGTTGACAG GGAGCGTCGT CCGTGGCTTT GCTCGACTTT TGACGACCGA TGCCGAACGA GAGTCATTGT GCGAGGCATC GGCTTTACTT ACTGCGTATT GCACGGGTCT GCCCTGTTTT GCGTACCGAC CGAATGCTTT GGAAGCATCT GTCCTAGTCG TCGAAAGTAC CAAGAACAGC AACGGGATGG ACTCACTTTT GTCCAGCGCG GGGATTATGC GTGTTCTGGT CTGGCTCCTG GCTCCCGTTG CGGCTGAGTC TGCCAAGTTC CCTGTATGTA TTGTGAGCAA CCCGAGGGAA GCGGAATCTT TTTTGGACCG ACTGGAAGAA TTCGCTGCGA AGGATCCTTC TCTGGCCGAC GAGATATTTT GGACGGGAAA CAAGCAAGAA CGTAAGGATT TGCTCAAGTG GGCGTATACG GAAGCCGATT TGTTGCTACG TGAGAATCGC AAGATCTTGC AGGAAGTTAC GGAGCGCTTG ACCGGGGGAG CAGCAACAGT AGGAGATTGT ATCGCGGTTC TAGAAGAATG GTAG
|
Protein sequence | MVTMRSFLLL QLWIGFLKAS AFAAAYIFPS SARSLRRCFQ EKGTCPLPLS HSSITSGWTA IFSSMEDTDA ATINGEQYPA DTSANGGGLV TTADSRSQLF SAFSALTAED QYDAVLTGLC AKILDDTTIV ERAAIERLKD PTNLLNEMNS RRIQASPRSM MALIDSTVKA QDAQTMAQML SLSVRNGSVT RYGVRQADIL PLPLAATSRV KCPDGSMKTR SERLSTVADI PFDERGTEVK NAFGVTAVAG GCFFTDVAGM DDIAPFANVF LSTLIVVGAL DNFYDLFKTS TSMIAKQVAK NGAADFELPE KDALPFGLGS GQLTGSVVRG FARLLTTDAE RESLCEASAL LTAYCTGLPC FAYRPNALEA SVLVVESTKN SNGMDSLLSS AGIMRVLVWL LAPVAAESAK FPVCIVSNPR EAESFLDRLE EFAAKDPSLA DEIFWTGNKQ ERKDLLKWAY TEADLLLREN RKILQEVTER LTGGAATVGD CIAVLEEW
|
| |