Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49572 |
Symbol | |
ID | 7198237 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | + |
Start bp | 80036 |
End bp | 83199 |
Gene Length | 3164 bp |
Protein Length | 975 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184301 |
Protein GI | 219128189 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.280952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAAC CCAAATCTTG CGTATCGTAC AGTGTCCTCT ATTACAAGCG GGCGGCCAAT ACAAAAAAGG TCTACCAACG CAAAGGCGTG ACAACGTTGG ACGGAGTCTT GACGATTCAT TCCGATTCCT CCAGGATTGT GTTGCGATCG GCGGATGGGG CAGATGTGCC GACACAGAAC GACGCGTTGA ATGCATGGAG TAAAGCGGCC AAGGCACCGG GAATGCTGGT ACTGTCCTCG GTCCAACGCG ACGTTGCACA GCAAACGTGG TCGGAAGACG ACGAATTTAC CGTGGGGCCG TATCGAATTG AAATCTTGAA GCGCCTGGAT CCGAACGAAA ACTGCAGCCC CGTCATCAAC AACATTGACA TTGTCCGAGG CCCGTCAGTC GTACACGGCC AGACAGTACG AACGGGAGTC CCAATGCAGG CTTTACCGTC CAGAACTCGA CTGCCGCTCA AACGCAAGGC TCTACCTTTG CAATCCACAT CCACTTTGTC TTTGGGGAAA CGAGTAATCG GCGCACTAGC TCGCAAGACA ATCCCTGCCA CAACTCCAGC AATAGGATCC ATAGCACAAT CTTGTGAACT AGTAAAGAGC GGAATAGCAA GCAAATGTAC GGTGGACAAG GGGACGTCTC GCACACTTCT GCCCACTGCG CGCCCATCCC CACCGACGAT GGTCCGCCGG ACTCCGCTGG TGTCCCGATC CAACACTACA ACCACAACCT CCCAAAGCAA TCTGAAGCGA CGCACGACGC CATTAATGGG GTTGAAATAC ACGTTGGCGA AAAATCGGGC CAAGCTGGCG TCGTGCCCGG CAACTCGTCG ATACCTACCG GGCGCAACCA GCACCTCAAC CATTCCCGCG ACAAACACAT CTCCTCCGCA ATCGTCCACC AATTTACTGG CGAACGTACC TCTACCTGCC TCGGTACGAT CCGTGCTTCG CCCTCATCAA GAAGAAGGAG TGGAATTTTT GTGGCAAGCA CTGGCACCAA TGGCCGTCTC CGGCAACGAG CAAAATGACA CCCAAAGTCC CGCAAGGGGC GCTATTTTGG CCGACGAAAT GGGTTTGGGG AAGACGCTCA TGACTATTGC CATCATTGCC GGTCTGCATC GTCGACAAAG AGACAAGGTG AGTCCAAACA GCAAAGCGTG GTGTCGTTTG TTCGTACTCG CATAGATGAC TCGCCATGCA GACAACGCCT TTGTCAGTCA AACCCTCACA GCTCTTGTGT TTTGGTTCGT TTCAGCAATT CATCGTGGTC TGCCCTTCTT CACTCGTTAC CAATTGGGCC AGGGAGTTTG ACAAGTGGAT CGGCCGGGCC AGTCAACCAA AACGTGTAGT CATTCAGAAA GGCGGCGAAG AGGGGGTCGC GGCTATGCGG GCGTACTGCG CTGGCATGTT GAAAAAGAAA AAGCAATTGC AGAAGATTGG TCAAGTATTG ATTGTCTCGT ATGATTTACT TCGGCGGCAG GTCGAGCATC TCCAAGATGC CTGTGCATTC GGTTTACTTG TCGTGGATGA AGGCCATCGT CTCAAAAATA CTTCCGGATC ATTGACCTTG ACGGCCTTGG AATCGCTGAC AGCTGATGCT CGCTTATGCA TTACAGCTAC GCCGATGCAA AACAATCTTT CCGAATTTTA CAATCTCGTC AACTTTGTTC GTCCTGACGT GCTCGGATCT CTCAACGAAT TCCGTGACAG TTTTGATCGG CCAATCTCTG CTGCCAATCA CAAGCACGCC ACACCGTCCC AAATTGCGAC GAGCCGGGAG CGATCAAGTG CCTTGGAGAC CCTAACCAAA CCCTTTATTT TGAGAAGACT ACAGGCGGAT GTATTGAAGA GCATGTTGCC ACCTCGAGTG GAAACTCTTT TATTTTGCCG GCCTTCGGAA ACTCAACGCG CTCTTTACCA CCAATTGACG GCTCGCATTT CGGGTGGCAG TTGTACCGAT GGCGGCACTG ACGCTCTCAA AACTCTGACA ACGCTGCGTA AAATCTGCAC ACATCCATCC ATCTGCAATG ATGACAATGT CAAACCATGG AATCGGCCAG AGAAAGGACC TTGCCTCAAG TATGACATTG CTCTGTCTGG AAAGATGACT GTGTTAGATA AGTTGCTGCA GTCGATTCGT GAGAACGCTC CGAATGACAA GATTGTGGTA GTTTCAAACT ACACTTCCGC CTTGACAATC GTGGAGTCCC TCATTCTCGG CCCACGTAAG CTTGGCTTTC TTCGTTTGGA CGGTGGTACC GAGTCATCAC AGCGACAGCC ACTCGTAGAG TCTTTCAATC GCTCTCATCC AGAGAAGGTT TTCTGTTTGC TCCTATCGTC CAAGGCCGGT GGCTGCGGTT TGAACTTGGT AGGCGCGAAT CGTCTCTTGT TGCTCGATCC AGATTGGAAC CCGGCCTCGG ACGTGCAGGC CATGGGACGC GTCTATCGAC AAGGCCAGAC GAAGCCGTGT TGGATCTATA GACTTTTTAC CACTGGCACC GTAGAAGAAG TCATTCTACA ACGCCAATTG CAGAAAGGAA ATTTAACAGC GTGGACAGTT GATGGTGGAA AGAGCTCACG ACAAAACAGC TCGGATTCTC GAGCTAAATT TTCAAAGGAA GAGTTGACTG CTGCATTCAC GCTCAAGGAC GAATATTGTA TCTGTGATAC AAAGCAAAAA ATGGGCCTCG CCTGGCCTGC GTACAATCGA GGGACCCTTT CGCAGTATGA CGATGCTCCT ATGAAGGAAA CGGCCATGTC CTTGTCCGAG ACTTTGAGTT TCGTACATGT AGTCAACGAC GATGCATGTG CAGAAGAAGA ACAGGACGCT GCTTTATCCA CTGCAAGCAG CACCCATTCT GCTTTGGTCA ATTTTTCATC TAACGAAGTG GTTTCTGATA CCAAACAAAG TAACTGCAAA AAACATTTGG AGGTCTTCCA TTGCGATGAT TCGTTGGCAG TGGGCGATAG TGATTCCGAA GAAGAATTCT GAATACTAGC TGCAAAATCA TCCTTGATAC TTGCTTGGCA AGTCTATCTG TACTTGGCCT CTTACATCGT CAACTCATTT TCTTGCGACA GCACTGTTTT CGAAAGATAA TGGGAGTTAA GATGGGTACC GTCACCTTTC ACAACTAATA TAGCTTGCAC TGTTATCTTT GTTG
|
Protein sequence | MTKPKSCVSY SVLYYKRAAN TKKVYQRKGV TTLDGVLTIH SDSSRIVLRS ADGADVPTQN DALNAWSKAA KAPGMLVLSS VQRDVAQQTW SEDDEFTVGP YRIEILKRLD PNENCSPVIN NIDIVRGPSV VHGQTVRTGV PMQALPSRTR LPLKRKALPL QSTSTLSLGK RVIGALARKT IPATTPAIGS IAQSCELVKS GIASKCTVDK GTSRTLLPTA RPSPPTMVRR TPLVSRSNTT TTTSQSNLKR RTTPLMGLKY TLAKNRAKLA SCPATRRYLP GATSTSTIPA TNTSPPQSST NLLANVPLPA SVRSVLRPHQ EEGVEFLWQA LAPMAVSGNE QNDTQSPARG AILADEMGLG KTLMTIAIIA GLHRRQRDKT TPLSVKPSQL LCFGSFQQFI VVCPSSLVTN WAREFDKWIG RASQPKRVVI QKGGEEGVAA MRAYCAGMLK KKKQLQKIGQ VLIVSYDLLR RQVEHLQDAC AFGLLVVDEG HRLKNTSGSL TLTALESLTA DARLCITATP MQNNLSEFYN LVNFVRPDVL GSLNEFRDSF DRPISAANHK HATPSQIATS RERSSALETL TKPFILRRLQ ADVLKSMLPP RVETLLFCRP SETQRALYHQ LTARISGGSC TDGGTDALKT LTTLRKICTH PSICNDDNVK PWNRPEKGPC LKYDIALSGK MTVLDKLLQS IRENAPNDKI VVVSNYTSAL TIVESLILGP RKLGFLRLDG GTESSQRQPL VESFNRSHPE KVFCLLLSSK AGGCGLNLVG ANRLLLLDPD WNPASDVQAM GRVYRQGQTK PCWIYRLFTT GTVEEVILQR QLQKGNLTAW TVDGGKSSRQ NSSDSRAKFS KEELTAAFTL KDEYCICDTK QKMGLAWPAY NRGTLSQYDD APMKETAMSL SETLSFVHVV NDDACAEEEQ DAALSTASST HSALVNFSSN EVVSDTKQSN CKKHLEVFHC DDSLAVGDSD SEEEF
|
| |