Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47462 |
Symbol | |
ID | 7202579 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 683345 |
End bp | 685594 |
Gene Length | 2250 bp |
Protein Length | 491 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181611 |
Protein GI | 219122561 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.28983 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCTTCCTTG TGTATCGCAC AAACGCTTCA AATTGAAATA TACCACCACC GGAGCATCAA TCGTTACACT CACATTCACA CACACCCAGA GGGAACAACA ATATTAGGGA CAAGTAGAAA CATTCTGTAG TGGCATCATG AGACACAGCA ATGGTCGGGG GGTTCCCGCT GAGGAAAGCG AATTGCGGAC ATCGCCGGTG ATCAACCGGA TCCGACCCCG CCGTGAAGCA CAAGCCTACC GCATCGCCTA TTCGTGTGAG CACGCCGCGC GCATTTTCCG GAGTACTAAA CGGCGCATTC GATGGTAAGT TTAAGTCAGC ATTGTTCTTG TCGTTGGGCG TGACCTCGAG TTTGGCCTAG TCGTTCCTGC ACCATCGCAC CATGGTGGAC CCCGTCCGAC ATCGATTACC GATGCTTGTT CGGAGTGGGC TTGCTAATTT TCATCGACGG ACACCCGTTC AATATTTTTC TGTCTTGGTT CTGTAATCGA AAACAGCTTT GTTGTTTATG TATTTTTGAT AAGGTATACA GTCTTGTTGA TGAATATAAC AGTTGGGACA TGAACTTTCC GTTTTGACCT GGGGTATCTA GACACTTCAC GTGTCCTCGT GTTCTTGTGT GGGAGGGGCA CAGCTTAGCC CTTCCAATCG CGCTCGGGAC CCCGTCTTAT ACTTATAGCA GCCCTCTTCC GATGGGCCGA CCATGATGGA ATCACACATG GTAGTCACCG TTCGCCAAGC TTTCCACTCA CCGAGTCCCG TCCTCTCCTC GCGTCACAGG TCATTCGGGC TTACCAACGT GAATGCTTTG CGAGACGGCG GTAAAGGCTT GGACTGTCGG GGGGAAGAGC ACGCCGTCGA GCTGGTTTGG AGCGTTCGCA GCGGAAAGAC ACGGGTTTTC TGGAACGGAA GGGATATTTC GAATCTGTTT CGGGGCGGGA ATCGATCCGG AATGGTCGAG TTTGCTTGGA ATACTCGAAC GGAGGAATCT CTGAGGATCG TCGCCCACGC GGAACCCCGT CGAGGAGTCC GGCAATACGA TCTGTTGGTG GACGGAATCA GTGTTTTCAA CCTGCCCAAG TTGGCCGAAA TTGGACAACC TTTGTCGGCC ACTTCAACCC CGTGGGAGCT TCCCTTGGAC ACAAGCCACG AAACCGAGTC CCGATCGCCA CGGTCGTCTC CGATACCGGT GCACACGATC GACTGCATTG AATCGATCGA TAGCATGGAG CACCAGACTT GGACGGATGC CGATCAGGCC AAAGCTCGGC TCGCGAGTGT TGGTCTAGCG TTACATCCAG ACACTGAAGA GAGCGACGAT TTGCACTCGG ATTTGTACTC GCCCATCGTT AATTCGATGC GCAACCTGAT CACGGCGCAC CTACCACAAA CGGAGGAAAC CGTCTCGCGG GCGTTTACGA ATGCACTGAT CAAGGATAGT GATTCGTATA CGAGCGAATC GTCCCTTTCG GATTCGAGTT GTCTTCACGA CGCGATGCAA ATTGAAGTCA ACGCGCTATG GGAAGCCTTC CGGTGGGTCA GGTCCAATGC CGATCAAATC ATGTTTTCGG ACGGCGAGGA GTTACAGCTC GAGTACATGC GTGAGCAAAT CGAAGCGGTG TTTGCCAAAG TATGTCACGA ATGTTTGACA CCGGGAGAAG CTTCGCGTAT TCTTTTGCAC GTCGGTGCCA TTCTTGGTCT CAAGTTTCAT CGCAATATTC TCATGGATAC CATTCTCGTG GACGGCTTGT CCAATTACTG CACCGTAAAC GACCTGGAAG CTGCGTTACG GCCGTTTGGC CGGATCGTTT CGATTGCCAT GGTCCAGGGA CATGGTTTCG GATGTTGCCG ATTCGTAGAC GACGGTCCGC TGCTGCGGTT ACAACGGAGA GGGCTTACGT TTACGATTGC CGGAACGAAA GCGCAAGGCA TGGTCATTTC TAGTCTTCAC GAATTCAATA CCAGCACACG TTATTCTCAA GGAGAGAACG GAGAGTTTTC CGAAGAAGAG CATTCCGACC AGCCTACAAT GCAGCGGACT TCCGCTTGTT CTATCCATGG CGAAAGGAGG AATTCTTGTG GAGTACCAGT AACACCGATG TCCGAGCAAA GTGTTCGAGA AATGATTGAC TTTGGGGACC AACATCAGAT CGACACCTTA GACTTTCTTC GGTCGCCCGA CTCGGTTACA CGAATGACAT TTGGTCCAGG CATCATGCCA TCGAGTCTCA CCGCTGCTTG TCTGGACTAG
|
Protein sequence | MRHSNGRGVP AEESELRTSP VINRIRPRRE AQAYRIAYSC EHAARIFRST KRRIRWSFGL TNVNALRDGG KGLDCRGEEH AVELVWSVRS GKTRVFWNGR DISNLFRGGN RSGMVEFAWN TRTEESLRIV AHAEPRRGVR QYDLLVDGIS VFNLPKLAEI GQPLSATSTP WELPLDTSHE TESRSPRSSP IPVHTIDCIE SIDSMEHQTW TDADQAKARL ASVGLALHPD TEESDDLHSD LYSPIVNSMR NLITAHLPQT EETVSRAFTN ALIKDSDSYT SESSLSDSSC LHDAMQIEVN ALWEAFRWVR SNADQIMFSD GEELQLEYMR EQIEAVFAKV CHECLTPGEA SRILLHVGAI LGLKFHRNIL MDTILVDGLS NYCTVNDLEA ALRPFGRIVS IAMVQGHGFG CCRFVDDGPL LRLQRRGLTF TIAGTKAQGM VISSLHEFNT STRYSQGENG EFSEEEHSDQ PTMQRTSACI MPSSLTAACL D
|
| |