Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49068 |
Symbol | |
ID | 7195311 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 486592 |
End bp | 491227 |
Gene Length | 4636 bp |
Protein Length | 1472 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183758 |
Protein GI | 219127052 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTCTG AATCAAGGGT GTGTGGGGCT GCGTCTGACG ACGACTCTTT CAGTTTCACG CAGGTGAAAG AGGCGTTACA ATCAATGGCA GCTGATGAAA ACAGAGAATT TCTCGAAGCG CAGCGCCTCG CACCTCATCT GGTCGATACA GAGTCTCAGA TTTTACACTA CCTGAAATGC TATGACTATG ATGTCCAGGT CGTCGCCAGT TGCATAGCGA AGTATTGGAC GCGACGTAAG CACATCTTTG GCGAGAGGGC TTTTTTATCA CTACTCGACC TGAGTGGCGC TGGTGCGATG TCGACTGATG ATTTAGAACA TTTGGCAATT GGGTCGGGTG TACCGTTGCC GAACGATACA GAAGGTTGTT CCGTTTTCTG TTTCGATCCT TCCCGGAATG AAGCCGAAGA TGCAGAAATG TCGTGGACTC GTGGCTTACG CTGTTTCTTT TTCTGGATGC AAGTAGCTTC CTCAAACGAA AAGTCTCAGA CTGAGGGTGT TGTATTTGTT CGAGTTTTAG ACACCATACG ATGGTCAAAA AAGCTTGGAC AGCTTCTTGA TTTAATGGAT AGAACTTTCG CGCTCAAGTT GAAAGCTATC CACTACTGTC TACCGCAATC TAATGATGGA GGCATAAGCT CTCTTATCAA GAGAACCTTT CCCTACTGGA CAACTGGACT AGTTGGGTTC CTAACGCCCA CCACAGAATT TCACGGCGGC CCGGAACTAC TCTCCGAGCT CACTAGGCAC GGGCTTTCGA AGGAATCGAT TCCGGAAAAA CTTGGAGGTT CCTGGAGCTA CGACAATTTC GATAAATGGA TGGAGATCAA TACGCCAAGA ACTGGACAAG CTTTGAAGAA GTCGAGTGTA TATACTGGAC CCAAGCCTAA GTTGTCAGCA ATCTCAGAAG AAGCGTATTT TTCTTTAGCC CAGGAAATGC ATCAATTTGA GAAGTGGAGT GAATCTAGAT GCCCATCCCG CAATAAAGGA CCTGCTGAGG GAGTTGGAGC GCCGGTACCA AGGCATGATA TGAACGCACC GGAGAATTAT TCTGGTAAGT GCATAAATCC ATTATATTGA AGAGATATAC CAATGGTCTA AAATTGCTAA CAGTAAAATT TTATCTCTGT TGTGCCACAA CCTAAGCTGT AAGGCATCTG CAATGGGACA GCTTGACTGA TGAAGCTATC ACCCTTCTAC CCAAGCAGGA ATCAGCGGCT TATTTCGAGG CGAAAGAGGA AATGCCTGAT TTGGTCAAAG TCGAGTCCAA TCCGCACAAT TTTTTGGAGT TTGAACAATT CGACGCTTTG GCTGCTTCCA AAAGGCTTGC GGCGTACTGG AAGCATCGAA AGAAACTCTT CGGGCCGAAG ACGTTGCTAC CAATGGTCTC CGTTGGGACT GGTGCCCTAT CGTCGGAGGA GGTTCTCGCG CTTCAAAGTG AAAAGGTCAC TCTGCTTCCT GCCTGTAGAA ACGGGTCGTG CGTGCTCTAT TGCGATGGCA CACGAGTTGC TGATTTGACG TCCTCGGAGG CTCTGTATCG GACAATGTTT TATCTCGCTT GTATCGGGAC TGATTTTTTT AGGTCAAAGA ACAAATTGGT CATTTTGATC AGTTTCAATG CTCCACAAAG TCTTCCCGGT TCCCTTGAGT CTGTTCAGGA ATTTTTGGGA ATGCTTGACG ACGGTTTCCC AATCGAAAGC ATCCATTTCC TTACGATCTT GAAAGACACG TCACAAGCCG CTCGCCAGGA CTCTATTGCT TTCAATGCCT CTAGTCATGT CCAATACAAG CTGAGAGGCA AAACTTTCGT TCATGTTGCA CATTCACCGG CAGAAGCGCT TTTGAAGCTC GAACCACATG GCTTTACAGT TTCGGGGCTT CCGAGCAATA TTGGTGGAAC TTGGATAAAC GGCACATTCA AAAAAATGCT CGTACACAAG AACCTCGAGG AGATGTCGCG TTTAGCCAAA CCATCGATGT CGAAAATAAC CGGTTCCGGA AATGAAACTG TACAGGAAAG TCCTTTTGTT GTTCGTCATG CTATTCAGCA ACTAGAAGGT GCCTTCGATT TCATCGCTGA AGACGAAAAA AAGGACTACC TCGAGGCTCT GCGACGTGTT CCAGGCCTGG TGTTCGCGGA GTCTCCTCCC CTTCGCTTTC TTCGCTTCGA AAAGTTTAAT GCATGGGCGG CGGCGCAGCG GCTGGCCTGT TATTGGAAGT GGAGAAAGAA GATTTTCAAG GGAAAGGCCT TTTTGCCCAT GACACAGTTT GGTGCTATAG GCGACGACGT CGTGCGAACC CTGAGCTCAG GTTATTTCAT TATTTTACCG AGCGATAAAA GTGGACGTCC GGTGGCATGC TTAGACCCCT CGAAGAGAAT CAACTATTCG CTTGACACAA GACTCCGGCT CAGTTTCTAC TTGTGGACAA TACTTTGCGA ATATCCTTTA GCATGCACAG AAGGCTATAT TGGGATTGTC CTTATGGGAC AAGCAGGAGC TCCGAAATCT AGCTTCGATT CGGCAATCAC TGTGTGCTCT GATATGGTCA GGGAGGCGTT TCCAACAAGG CACCAACATC TCCACTTGGT CCACTGTCGC GCTAAATCTG AGGGAAGATC TTTTTTGCGC TTAATCCTCC CAACAGTGCT CAAAGTCTTA GGAGCAATGG TGAGTCACAG AACAAGCTTT CACAGCAAGG AAGGGCATGA CGAAATGGTT AAAAAGTTGG AGACTTTTGG TCTCGACGGA GCCTCATTGC CGAAATGCTG CGGAGGTTCG TTGAGCGACG ACTACAAAGT GCTATGGAGA GCCGACCGGC TGCAAGCGGA ACGCGAAAGT CACAATACTG CAGGGCCTTC TCTTCGATTG CTAATGTCGG CAGCTTTGGA CAACGAAGGT TCGCCCACAG TCCATGAGCC AAATGCAAGC GGAGTCACTG CTAGTACGGT CGGAACAGGC GGGGAACAAG CTCAGTTGGC AATAGAAAGA GCCCTGGGTG CGCTTCCAGT CGCGAAGAAG AATGCTATGG AGGAGGCAGC TCGTCGAAAT ACGGAATTGC ACTCGTTCAA TTCAAGCTAT GCTCGTTACT TAAGATACGC AAACAACGAT CCAGAAGTGG CAGCGCAGAA GATGGCGTCA TACTGGCAAA CGAGGAAATC GATATTTGGT AACAAGTTCG TTCTGCCTCT TACTCAAACT GGTGAAGGTG CTCTGGGTCG TAGGGAACTG AACCTTCTCG GTACTGGCTT CTTTTCTCTT CTCCCCAAGG ATCAAGAAGG CAATTCAGTC GTTTGTTTCG ATCCGTCTAA ACTCGTAAAA GCTTCGTTGG ATTGCAGACT TCGTGTCATG TTCTATGTTT TGGATCTTGC TGCTTCCAAC GCCCAGTCAC GTCAAGGTGG CGTTGTTATG GTTTGCCTTA TCAACAAAGT CAAGGCTGAA AGGGGAAAGA AATTAAACGC CGAGCTGGTG TTGAACTCAC TACCGATCCA CATCAGAGCG GTGCACTTAA TACAAAGCTG TGCGGAAGGC CAGCCTTTGA TATCTCGCGA AAAGGGCCTA GCATCAATGC GGCAGGTTTT TGGACACGTC GTGGATGAAA GGGTGATCTT TCACGACAGG TCTACCAAGG GTTCTATTGT GGAAGCAATG AGAAGGGAGG GCCTGAGCAA GGAAGGTCTC CCCAAGACAA TCGGTGGGGA ATGGGGTTTT GAACACTTCA TTCAGTGGAT GGAACTTCAG ACACGGTTGG AATGGAACTT ACCTGCCGGA TCTGGAGGCA AAAACACCGC TGAAAGGATG TTTGACTTCA CTGGTGTCAA ACCTCTGCAA TCACTGTCGG AAGAAGAAAG GAAGGAGAGA AAACGACGAA TGAATGTCGT ACACTCTCGT CGGAAGCGTG AAAGGGAAAG GATCGAAATT GAGGTCTTGC AGGAGCAGTG CACTGATCTC AGCGATCGCA ACCTTGAACT TTCTCGGAAA AACGCTTCAT TTGAAGAGTT GCTGAGCGAG GCGCAAATTG TAATCAGTAG GGTCGGGCAA TTCTCTCGGG AACAAGAACT TCGTTCGATC GCAGCTGATC AGGTTCACTC AGAACGGCTT GCTCCGGGAA TTACCGAGCC TGTAACGGGA GGAGGAATCC AGTTCGCTAG CCGTGCAGCC CCGTTCAACT TTTCACTAGG AAGCTTTTAC TCGAGAGGGT CTGATTCCCA AGACCAGCAT TGGCCGGCTC CGAGACAAGG CAGTCTGAAT CCTTCGCTTC AGCATCAGAT ACAATCCCCG TTTGATCAAG GGGTACGTAC GGCCCTACTC GAAGAAAACG AAACGCTTCG GTGGACTATT CGAGAATTGC AACGTCGCCA GCTTGAAGAA GAATTGGGAC GACAAGAACT AGAACATCAG TTACGGTCAC AAAATGCTCG ATCGCCTGAC GGACCCGGCG GTAGACTAGG CTCCCGTATT CAGCAAGGCG ATCAAAACAA TCGCAATCTC TGGAACTTTT TCTTTGGTTG AAAGACTTTC ATTGATAAAT TTATTGTCAA TACGCGTTGA AAAAAATATC TAAAAACAGT AATTGTCAGT GAACAATGAG CGCAGGTAGT CTTTGACGTA AAAGTAAAAT ATCAAATGGC ATTATT
|
Protein sequence | MKSESRVCGA ASDDDSFSFT QVKEALQSMA ADENREFLEA QRLAPHLVDT ESQILHYLKC YDYDVQVVAS CIAKYWTRRK HIFGERAFLS LLDLSGAGAM STDDLEHLAI GSGVPLPNDT EGCSVFCFDP SRNEAEDAEM SWTRGLRCFF FWMQVASSNE KSQTEGVVFV RVLDTIRWSK KLGQLLDLMD RTFALKLKAI HYCLPQSNDG GISSLIKRTF PYWTTGLVGF LTPTTEFHGG PELLSELTRH GLSKESIPEK LGGSWSYDNF DKWMEINTPR TGQALKKSSV YTGPKPKLSA ISEEAYFSLA QEMHQFEKWS ESRCPSRNKG PAEGVGAPVP RHDMNAPENY SAVRHLQWDS LTDEAITLLP KQESAAYFEA KEEMPDLVKV ESNPHNFLEF EQFDALAASK RLAAYWKHRK KLFGPKTLLP MVSVGTGALS SEEVLALQSE KVTLLPACRN GSCVLYCDGT RVADLTSSEA LYRTMFYLAC IGTDFFRSKN KLVILISFNA PQSLPGSLES VQEFLGMLDD GFPIESIHFL TILKDTSQAA RQDSIAFNAS SHVQYKLRGK TFVHVAHSPA EALLKLEPHG FTVSGLPSNI GGTWINGTFK KMLVHKNLEE MSRLAKPSMS KITGSGNETV QESPFVVRHA IQQLEGAFDF IAEDEKKDYL EALRRVPGLV FAESPPLRFL RFEKFNAWAA AQRLACYWKW RKKIFKGKAF LPMTQFGAIG DDVVRTLSSG YFIILPSDKS GRPVACLDPS KRINYSLDTR LRLSFYLWTI LCEYPLACTE GYIGIVLMGQ AGAPKSSFDS AITVCSDMVR EAFPTRHQHL HLVHCRAKSE GRSFLRLILP TVLKVLGAMV SHRTSFHSKE GHDEMVKKLE TFGLDGASLP KCCGGSLSDD YKVLWRADRL QAERESHNTA GPSLRLLMSA ALDNEGSPTV HEPNASGVTA STVGTGGEQA QLAIERALGA LPVAKKNAME EAARRNTELH SFNSSYARYL RYANNDPEVA AQKMASYWQT RKSIFGNKFV LPLTQTGEGA LGRRELNLLG TGFFSLLPKD QEGNSVVCFD PSKLVKASLD CRLRVMFYVL DLAASNAQSR QGGVVMVCLI NKVKAERGKK LNAELVLNSL PIHIRAVHLI QSCAEGQPLI SREKGLASMR QVFGHVVDER VIFHDRSTKG SIVEAMRREG LSKEGLPKTI GGEWGFEHFI QWMELQTRLE WNLPAGSGGK NTAERMFDFT GVKPLQSLSE EERKERKRRM NVVHSRRKRE RERIEIEVLQ EQCTDLSDRN LELSRKNASF EELLSEAQIV ISRVGQFSRE QELRSIAADQ VHSERLAPGI TEPVTGGGIQ FASRAAPFNF SLGSFYSRGS DSQDQHWPAP RQGSLNPSLQ HQIQSPFDQG VRTALLEENE TLRWTIRELQ RRQLEEELGR QELEHQLRSQ NARSPDGPGG RLGSRIQQGD QNNRNLWNFF FG
|
| |