Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50216 |
Symbol | |
ID | 7199002 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011696 |
Strand | - |
Start bp | 14041 |
End bp | 15348 |
Gene Length | 1308 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185188 |
Protein GI | 219130052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.00633421 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCTCG TTTTCCAGTA CGGAATCGCC CCCGCTATAG TTGACGCCAA CGATTTGACT AGCTTTGTTA GGGACAAGTG GACGGATGGA TGTTCCCAGT ATGTCGACAT CGACGTGGTC AAGTCTTGTG CTGGCAACAA TGGTGTTTAT CGCGTATCGG CTGCTTCGAC GCTATTCTTT GCATTCGCGG CGCTGGGAGC TCTACTTAAA CCCACGGCCA ATCGAGAAGC ATGGCCGGCC AAATACACTC TCTACTTCTT TCTCTGTATC GTCACTATTT TCATTCCCAA TGATCCGCTT TTTTCTGACG CCTACTTGAA CATTGCACGT ATCGGCGCAG TCCTTTTCAT TGTTGTTCAG CAGCTTGTCA TTGTTGACAT GGCTCACGAA TGGAATGACA GCTGGGTCGC TAAGGCGGAT GCCGCAGAAG CACAGGAGGC TGGGTCCGGG AAAAGGTGGC TCGGTGCTAT TGTGACTGCT TGCATAATGC TCTTTGGAAT ATCCATCATT GCAATAGGCG TCATTTTTTC TCGCTTCACG GGATGTGGCA CAAACAATGG ATTTATTACT GTCACGCTTG TGCTCGGCGT CTCAATTGTC GGTGCGCAGA TGTCTGGCGA AGAAGGTTCG TTGCTAGCCA GTGCCTGCGT CTTTGCGTGG TCTGTGTTTT TGTGCTACAC AGCCGTTTCC AAGAACCCTG ATGCATCCTG TAATCCTATG CTGGGCGAAA TGGATACTGT GAGTATTGTG CTGGGCTTGA CCGTGACGGC AATTAGCCTT GGATGGACGG GATGGTCGTA CACGGCCGAA GACAAGCTGC GGTCGTCTTC TGAAGAGGAA AGCGCTGCTG CAGCCACGGC CAGGGCCAGT GACGACTCCG AGAAAGATGT CAGGCGGGAC GTCACAGGTG TGGTCACAGG CAACGACTAT GGAACGCAAG ACGACGAAGA GCAAGCTAAC AGTGCGGGTC ATGCCGAAGT GGATGAATCA GTCTTGAACA ATCCTAGCCG TCTATCCAAT TCATGGAAGC TGAACGCCAT TCTAATGAGT GTATCATGTT GGAAGGCTAT GGCTCTAACC AATTGGGGCG CGATTGTGGC CAATGGCAAT GCTGCTAATC CTCAAGTCGG CCGTGTTGGG ATGTGGATGG TTATTGCCTC GCAATGGCTT GTTCTGACGC TGTACTTGTG GACATTGCTG GCACCGAGAC TCTTTCCCAA TCGCGAATTT GGCTGACTTT GTCTGATTTG CAGGATTGTT GGAGATAACA GCAATGTTTC ACAATATTGT ATAAGTTGAC GGTCTAAT
|
Protein sequence | MALVFQYGIA PAIVDANDLT SFVRDKWTDG CSQYVDIDVV KSCAGNNGVY RVSAASTLFF AFAALGALLK PTANREAWPA KYTLYFFLCI VTIFIPNDPL FSDAYLNIAR IGAVLFIVVQ QLVIVDMAHE WNDSWVAKAD AAEAQEAGSG KRWLGAIVTA CIMLFGISII AIGVIFSRFT GCGTNNGFIT VTLVLGVSIV GAQMSGEEGS LLASACVFAW SVFLCYTAVS KNPDASCNPM LGEMDTVSIV LGLTVTAISL GWTGWSYTAE DKLRSSSEEE SAAAATARAS DDSEKDVRRD VTGVVTGNDY GTQDDEEQAN SAGHAEVDES VLNNPSRLSN SWKLNAILMS VSCWKAMALT NWGAIVANGN AANPQVGRVG MWMVIASQWL VLTLYLWTLL APRLFPNREF G
|
| |