Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50485 |
Symbol | |
ID | 7199324 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 191030 |
End bp | 193289 |
Gene Length | 2260 bp |
Protein Length | 530 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185395 |
Protein GI | 219130486 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.517757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCTGGCAGT GAGTGTTCAC TGCATGCACC TCCTGACTTC TCCGGGGGAA GCAGGAAACA TATTTCGTCA GCAGCTTATA TTTACAAAAC TGCTTCGCTC TACCTCTTTG TGGTTCATTG AGTAGTGAGA GTTGCTACGG CAGAAAACTG TTCCGCACAG CACCGTTCCT TGTGCTAGTC CGTTCCCTAG CGCGGTTCAA CTTTCCTGAC AACCGTATAT TTCTTGGAAA GAACGGTCAC ATTTCTGAAC CCTTCGACTG TTGGAAGGCC GGAGTTTCCG GCTGCAAATG CATTGAGCAT CTCGTCAATG GATTCTGACA ACGTATTAAC AGCTGATGCT ATTGGGGACG GAGAGGAAGA TTTCCTGTTG CGGAACCGCC TGCTTTCGAG TAGCAACACC AGTATGGCAA GCCCACCATT GCACTCGGGA GGTGGAACTG ACCATGAGAG TTGCGGGGCT ACTTGCAGCA CTCCAATCAT GCCAACACCG ACTAGTGTAT CACCCTCAAC GCCGTCAATA GCAGCGACAC GCCGGAGCCA AGGATTCGGT AGAATCCTTT CCTTTCCAGG TCCAATCCTT TACAATATTC AAAGCAGATG TACTCGACTT ACTGCCGGAA TCCTTCAGAA CCCAATATCT GGAAAGGGTG GACCTAGCGG GTGGTTTCTC CTGCTGACGA TAAGTGCCTG GTTCGGCTTG GGCGTTGTGG CGATCGTAAC CACAAAACTC CTTTTAACGA GTTGGAAGGT TCCTCCTCTG TTGTTGACAT TCCAACAACT GACTGCAGCA TCAACATTGC TACGAGTCGT GTTGGGTTTG CAACAGAACC TTCAGCCGCT GCCATGGGAA AACTATTGTC GCGCCACCAT AGCACCTGAT GCTCCATCTG GTACGGGAGC ATCGACTCTG GATGCGACGG GTACCGAGGA ACACTCGATT GTTGAGCTGG GAGCCGATCA AAATCACTCT GCCGTGGAGG ACCGAATCCA GACGCATATT TCTAAATTTC ACAGCCCCAA TAACTCACCG TGGAATGTGG AAAATACGGA ATTTTTTCTG ATTGGACTGT TCAACGCATT GGACTTCTTG GCTTCGAACA CCGCATTCTC TTCATCAGCT GCTTCATTCG TGGAGACAAT CAAAGCGAGC GACCCCATTA CTACTACGGC TGTTGCCTTG ATTTGGAAAA TTGATCAGGT CAAACGGCCG GAAGCGATTT CCTTAATGGT ACTCATAATC GGTGTCCTGC TTTCTACAAT AGGAAATGCC ACAAGCTCCA ACACTACTGG CGAGGATCCA CTTTCTAGTT CGGAGCTGTC CGTAGACGAG ACTGACGATG ACAGTGCTGC CGAAGCGCAA GAAGCTTTGT ACCTGTCCAT CCGCACCGCC ATTACTGTGG TCACAGCCAA TTTGTGTTTC GCCTTTCGGG CCATGAATCA GAAATTGTAC CGCAGGCATA CAAGCACGGG AGACCAGCTG GACGATGCGA ATTTGTTGTG TCGGTTACAA CAGACTGGAG CCTTGAGTTT GCTGTTCCCC ACAATGCTCC TGTACGCAGG ATTTGTGTTT GACGCTCTTT GGCAAACGCC GAGAGAAATT GTGTTGCAGT ACGTTGGGTT GGCGGCAGTC AATGCGGGTG CATTTGTTGC CTACAAGTAA GTTTGCATCT GTAATGGAGG GGATTATGTA CGTCGTGACG TAAGCTCATG CGCGCTTTCT GGTTTTCATT TTCATTTTAG CCTTGCAGCA TGTTATGTTT TGAGCAACCT TACGGTTTTA CACTATTCAG GACTTGGTTG CATGCGCCGC ATGTTTGCTA TTTTGTCCAC TAGCATTTTT TTCGGCGTCC CCATTTCAAT TTTGGGAGCT GCGGGCATCG TGTTGTGCCT CGCTGGTTTC CTATCCTTTA CGTACACACG TTCCCAACGT ACAGCAAACA AAGCTATCTT GAAAAGTTTC GATCACAAGG ATTCCAATGT ATAAAGCAAT GCGACTTGAG TTTTCTATAC CGGACAGTCC ATTCCAAGTT CCTGCCTTGA ACCCCGCGTC ACAGTTTTGG CTTGCAGTGC CGAACCGAAA GCAAGCGTTA AACCAAAATT GGAGTGCTCT GTTGGAAAAC AAAGTTGCAC GGGTCAAGTT ATGTCCGTTC TTGGCCAAAA CGTAGGTATT CCGCTATGTG ATTTACTGTA AGGCAGTTGA GTACTGCAGC GGTCTGTATA GACATGCAGG GTTCAACATA ACTTTTAAAC TTATGCTAGG
|
Protein sequence | MDSDNVLTAD AIGDGEEDFL LRNRLLSSSN TSMASPPLHS GGGTDHESCG ATCSTPIMPT PTSVSPSTPS IAATRRSQGF GRILSFPGPI LYNIQSRCTR LTAGILQNPI SGKGGPSGWF LLLTISAWFG LGVVAIVTTK LLLTSWKVPP LLLTFQQLTA ASTLLRVVLG LQQNLQPLPW ENYCRATIAP DAPSGTGAST LDATGTEEHS IVELGADQNH SAVEDRIQTH ISKFHSPNNS PWNVENTEFF LIGLFNALDF LASNTAFSSS AASFVETIKA SDPITTTAVA LIWKIDQVKR PEAISLMVLI IGVLLSTIGN ATSSNTTGED PLSSSELSVD ETDDDSAAEA QEALYLSIRT AITVVTANLC FAFRAMNQKL YRRHTSTGDQ LDDANLLCRL QQTGALSLLF PTMLLYAGFV FDALWQTPRE IVLQYVGLAA VNAGAFVAYN LAACYVLSNL TVLHYSGLGC MRRMFAILST SIFFGVPISI LGAAGIVLCL AGFLSFTYTR SQRTANKAIL KSFDHKDSNV
|
| |