Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50525 |
Symbol | |
ID | 7199411 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | - |
Start bp | 1369 |
End bp | 3736 |
Gene Length | 2368 bp |
Protein Length | 688 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185505 |
Protein GI | 219130717 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.196683 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTTTGTCCA TGATAAAAAG GTCGGCTCAT ACTACTGGTC TGAGTTCTCA TTAAACGCAC GGAACATATC ATCATTACAG CAAGCTCCAT ACATCTCAAG TAAAATAAGC ATGACGGATA TGATTTCCAA AGGCCCGTCT CTCGTGTTTG GTGCTTCGGG CGAACAAGGC CGAGTTGTTG TTGAAGGTTT AGTCGACACA GGCTACTCTC CAGTATATGC GTTCACACGT TTAAAGCACG ATACCTATTT GACAGACGCA ATCGGGGCCA AGCTGATTAC AGGTGATTTG GAAAATCCTG ATGACGTCCG CATCGCCCTG AGGCAAACAC GAGCGCAGAC TGTCTACCTC GTGACGACTA CCGCCTTACC CACGGAAATT GGCCAAACAA CAGGATTTTC CGCCGCTGCC ACTGCAGAAT TTCAAGCCAT TGTCAACTTT TTTCATCTCT TAAAAGCAGT GTACGATGAA GACAAATTAC CTCGTCATGT GGTGCTTTCC ACTCGGGACA ACGTTCAAGA AGTCACGCGC AAGAATTTCG AGGAAACAGG AGATATCTGG ATTGATCCAT TGGACGACGG TAGTATTGTT CCCCATTATA GTGCCAAGGG AAAGGGTGCT CAGTACGCGC TTAAGTACTT AGATGACATT GGGGATCTTA AACTTACTTG CCTCGCTTTA CCGTTCATGT ATTCCAATTT CTTGGGATTC TTTTGTCCCC TTCCAAACGA AGGGCGCACT CAATGGGTTC TCTCGGCCTC ACTTGGCGAA GGCAAAGTAG ACATGATGGG AAGTGCCGAT CTCGCTGCGA TTGTTCGTAA GTACCTGTGG TCGCATTGAA TCTGGAATTG GGCACTCTTC GCTAACAACC AGACATTCAC TTATATTGTC GTTGACAGCC AACATCTTTG CGAATTCCGA CAAGTACGAC GGCGAGATCA TTCGTCTTGT TGGCGAACGT CTCACAATCG ATGAAGTGGC AGGGGCTTTT GCAGATTTGT TTGGCAAAGA TGTCATTTAT AACCCGCTTA CGCCTGCCGA GGTAGCGGCA CTTCCTTTTG CCGCGGCTCC GGCAATGGCA CAAATGTGTC AATTTTTGGG CGATCCCCGA TCCTTGCAGC ACGATTTGGA AGTGAGCAAA GAAGTCGCTT TTCCGAAGCA ACTCCAGCGA TTTGAAGACT GGTTACTTAC ACATTCAGAC TCGACGGCGT TTACGCAGGT TGGATTGGAC GTAGATGCGC CTGACATAGA ATCTGTCACA GTCTTTGGCG CGACCAGTTC AGAAGGAGTG TCGGTTGTCA AGGGGCTTCT GGCTGATACA CGCAAGTCGT ACCGCATTCG CGCCACGACA CGTCATTTGG ACTCAGAAAA GGCCAAAGCT CTGCAGAAGC TGGACCCGTC TCGTATCGAT TTAGTGTATG CAGACTTCGA CGATATTGAC TCCTGTCGTG CCGCTCTGGC TGGTGAATTC TCACAAGGTG TCTTTCTAGC CACAGATTTT TACGAAGACG CGGCGCAGGA CATGGAAGCC GAAGAACAGC ACGCCAAAAA TGTTATAGAT GCCTGCGAAG CTGCCAAGAA CGTGAAGCAC GTGGTCTTCT CGACAATGGA GTCGGTAGAA GAAATGAATC AAAAACTAAA TCTCGGATTA CCAAAAGTTA TCGACAACAA AGGGAAAGAA GGAACAATTG TGCAGTTTGA TGCAAAGGCT CGAGCGGCGG CCTACGCGCG CACTAAGAAG ATTTCGGTAA CTTATGTTTT GATGCCGTGC TATTCTGAGG TCTTTTTCGA TATGATCGAG AAACGCATCG AACAAGGGAA GGAGAAGCTG GTTTTGACTA TTCCTCTGAA AAACGATGCG AAGGTCATGT GTATGAGTGT TGACGAGCTT GGCCCTGCGG TAGCCAACAT TTTCGATAGT TACCAAGTTT ATGCAGGCCA TGAAATCGGC CTTGTTACCG ACTTCGTTTC TGTTGCAGAA GTCAAGGATT TGATCGCGGA AATTTTCCTC GCCAATGAGA AAGATGCAAT GACACTAGAA ACTGAGGAAG TATCATCCGA CGATTGGATA GAGGCCAAAG ATACTTATAT GAAGGATTTG GGACAAATGT TTGCTTACAT GTCTCATTCG GATGCTGTCA AAATGCGCCG ATCAATTGCA AAGACAATGA AGCTTGTCCC AGAAGCTCGT GCGCTCCGAC AATGGGTCGA ACAGAACCGA GAGAATGTAG CTTTCCGCGA AAAGCTTGGT CTCCGCTGAT ACAGCCGTGT CAAGCTTCAT GAAAGACCCT CTAGGATGGA GAAGTTCAAA CTGAAAGGAA TGGATGCATA TTCTTCACAT AAATGTTAAT GAAGCAAGTT ACAGTTTC
|
Protein sequence | MTDMISKGPS LVFGASGEQG RVVVEGLVDT GYSPVYAFTR LKHDTYLTDA IGAKLITGDL ENPDDVRIAL RQTRAQTVYL VTTTALPTEI GQTTGFSAAA TAEFQAIVNF FHLLKAVYDE DKLPRHVVLS TRDNVQEVTR KNFEETGDIW IDPLDDGSIV PHYSAKGKGA QYALKYLDDI GDLKLTCLAL PFMYSNFLGF FCPLPNEGRT QWVLSASLGE GKVDMMGSAD LAAIVPNIFA NSDKYDGEII RLVGERLTID EVAGAFADLF GKDVIYNPLT PAEVAALPFA AAPAMAQMCQ FLGDPRSLQH DLEVSKEVAF PKQLQRFEDW LLTHSDSTAF TQVGLDVDAP DIESVTVFGA TSSEGVSVVK GLLADTRKSY RIRATTRHLD SEKAKALQKL DPSRIDLVYA DFDDIDSCRA ALAGEFSQGV FLATDFYEDA AQDMEAEEQH AKNVIDACEA AKNVKHVVFS TMESVEEMNQ KLNLGLPKVI DNKGKEGTIV QFDAKARAAA YARTKKISVT YVLMPCYSEV FFDMIEKRIE QGKEKLVLTI PLKNDAKVMC MSVDELGPAV ANIFDSYQVY AGHEIGLVTD FVSVAEVKDL IAEIFLANEK DAMTLETEEV SSDDWIEAKD TYMKDLGQMF AYMSHSDAVK MRRSIAKTMK LVPEARALRQ WVEQNRENVA FREKLGLR
|
| |