Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44868 |
Symbol | |
ID | 7199576 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 482590 |
End bp | 484311 |
Gene Length | 1722 bp |
Protein Length | 545 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179007 |
Protein GI | 219116424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGCGAATGA AGGGAAATGT GAACCCAACC AGCAACTGTC GAACGCGGGA CTTTCCCAGA CTAAAAAGCT AAACATTCGG GATCATGTTG TACCCCGCTT TGACGGTGTT TGGTGTCAGC CTGTTCTTGC TCGCTAATGC GGTTGCGGAC GCAAGTTTTG GTTCATGGAC GAATTCAGCC CCTAATCTAC GAATGGAATG GACCCCAATT CCTGTAAACG TCCACGTCGA CGCGATATCA GATTCCAACA CAACACTGGA TACCTACGAC CTTTTATTTT TGGGAATGAA TCAAGTAGGC AAAGAAAACA GCACCATCGC AGGGAGTTTG CAGCTGGCTA GGATTAACGC CAAGTACGAT GGCGCCATCG ACCGTCTCCT GGAAGAAGCG ACAAACTTCA CAATCGGGGA CAAGACCGCC ATCATGACCT ACAATACCGC CTCGGGCCAG AAGCCACAAA CGCTAATAAT TATTCATGTA GGAGCTACGG CAGCCGACGC TCGTAAAGTC GGTACGGATC TGGCACGAAC GATACAGGGC GGAAAAAAGG CAAAATCCTG TGCTATCCTG CTACCTAACC AAAAGTCGAC ACGGGAAGCC TCGGCTTATG TCACAGAAAT GACGACTGCA TTGTGGGTGG GGCTGTACAA GGACAAGCGT TTCAAGTCTA TCAAGGACGA TACTGACGAC GCCGGAAACG TGAACACGAT TGACCTGATT GTCCCGGATG GCAACTACGA TAGGGAGACG ATCCGAAACG CCATTGCGAA TGGCCGCACA CACAGTGCGG CAGTGTACCT CAGCAAGGAT ATTGTCAACG CACCGCACAA TGTGCTAAAT TCCGTATCAT TGGTCGACAC CGCCCAACGA TTGAACAAAA TGTACAAACG AACGTTGCGC TGTCAAATTC TGGACCAAGA AGAATGTCAA CGAAGGGGAA TGGGCGCCTA TTTGGGCGTA GCTCGCGGGT CCGAAACACC CGCCCAATTC ATTCACATAA CGTACCGTCC CCCAAAAAGC ACCTACTGGA GCAAATCGGC GCCTAAGCTT CGCCGTCTGG GTATCATCGG AAAAGGTCTG CTGTTCGATA CGGGTGGGTA CAACATCAAG ACCTCCATGA TGGAACTGAT GAAATTTGAT TGTGGTGGTG CCGCAGCGGT GTTGGGTGCC GTCCGGGCGA TTGCCGAACT CGAGCCACGA GGCGTCGAAG TCCACTTTGT GATCGCCGCG TGTGAGAATA TGATCAACGA ACGAGCGATG GTGCCGGGAG ACATTTTGAC CGCTTCCAAC GGCAAGACGA TCGAAGTCGT CAACACGGAT GCGGAAGGTC GTTTGACCAT GGCGGACGCC CTGGTTTACG TAGACCAGGA ACTGGACTGT GACGAAATTC TCGAGCTGTC CACACTCACC GGTGCCTGCA TGATTTCGCT GGGCACGTCG ATTGCCGGTT TGTGGACCAA CAACGACCAG CTTGCCGTCC GCTTATTGGA GTCGTCCGCT CGGACCGGGG AAAAGATTTG GCGCATGCCG ATGGAAGCTG ACTACCGTGA CGCGCTCAAA TCCAAAGTAG CCGACTTGAA GAATTTGGGC GCTCGCTACG GTGGGGCCAT TCACGCGGCG CTCTTTCTGC AGGAGTTTGT CGAGGGCGAC AAACCCTTTG CACACGTGGA TATGGGTGCG TTGCGCGGTG CACCGATGGG TTTGAGTCTG TACTTGGAGT AA
|
Protein sequence | MLYPALTVFG VSLFLLANAV ADASFGSWTN SAPNLRMEWT PIPVNVHVDA ISDSNTTLDT YDLLFLGMNQ VGKENSTIAG SLQLARINAK YDGAIDRLLE EATNFTIGDK TAIMTYNTAS GQKPQTLIII HVGATAADAR KVGTDLARTI QGGKKAKSCA ILLPNQKSTR EASAYVTEMT TALWVGLYKD KRFKSIKDDT DDAGNVNTID LIVPDGNYDR ETIRNAIANG RTHSAAVYLS KDIVNAPHNV LNSVSLVDTA QRLNKMYKRT LRCQILDQEE CQRRGMGAYL GVARGSETPA QFIHITYRPP KSTYWSKSAP KLRRLGIIGK GLLFDTGGYN IKTSMMELMK FDCGGAAAVL GAVRAIAELE PRGVEVHFVI AACENMINER AMVPGDILTA SNGKTIEVVN TDAEGRLTMA DALVYVDQEL DCDEILELST LTGACMISLG TSIAGLWTNN DQLAVRLLES SARTGEKIWR MPMEADYRDA LKSKVADLKN LGARYGGAIH AALFLQEFVE GDKPFAHVDM GALRGAPMGL SLYLE
|
| |