Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45032 |
Symbol | |
ID | 7199537 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 1059020 |
End bp | 1063447 |
Gene Length | 4428 bp |
Protein Length | 1312 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179114 |
Protein GI | 219116638 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGCGCT CCCGAGGGCG ATCCGTTTCG AGTGGGTCGT ATAAGGAAAT GAACACCAGT TCGATCTCGA CCAAAATGAT GTCGCCTGAA GGGAGAAACG GTAGTACCGC CGAAGTAACA CGAGAGCGCG TGTCGGTGGT GCGGGAGGTA TTGCAAGCTA TCCTGATGCA CTCAAATCCT ACCTCGGGTC GTTCGTTTCT GCATGGTGAA GCGCTCTCAA GTGGTGATTC GCACATAGTT GCTGCTGAAG TCGATACTGT TATCCAGACT TCACCAGATT GAGCAGCTTC ACAAGATCCA CTCCAGCGAT CACAAGGCGG GGGAATTTGC TTGCAATCTG GAGGAACAGA TATTGACGCT CCCAGACAAA GAGTCCGGGT ACACGCCACT TCACTGGGCA ATTCTGCGAG GCGATCTCGC TAGTATTCTG TTGCTCGTGC GCCATTGCCT GACGGCCACG CACGAACACG ATTCCTTGTC TCGTCGTCTA TCGAAACGAC CTATGGACGT ACTGCAAGGT TACGGCACGA ATGTTAGTGT CATGTTAGCA AAGCTTATTG CGGCACGGGA CCAGGAGGGG CTCACTCCGT CCGATTTACT GGCGATACAG CAACGCTCCG AGCTTGCAGC TTGTCGACAA GCGCTCCTAT CAATATCCAG TCAAATAAGT TCGGTCTCGA CAGGCAGGCA AGATAACACT TGGCTGAACG CTTCTGGATT GCATTTGACT GATGCGGATG AAACAGACGA AGCGAGCGAC ACTCTCCGAG ATACAATTCA TCTTGAAACA GAAGCCACAG AAAGTTCGTT GTTGGATTGC AAAAGGAATC TGTCTGAATA CGGTTGCGAA GTTTTAACGT TTGGAAGTGC ACACATCTGC GCGTCAGGAG TGTCAACTAG TGCATCATGT TCCAGTAGCG GTACTACGTT GAACGCAAGG CCTATTGTTG GAGCGGCCAC TTCTCGTCCT CAGCGCGTCC AGGCCTTTGG GCAGGAGCGA GTTGGACGCA TGGGCGGAGC AGTTTCTGTC TCAGCTGCAT CGCACCATAC GCTGGTTCTG ACTCGAAATG GACACGTGTA TGCCTTTGGG CTTGGCAAAG GCGGCCGTTT AGGAACGGGA GACGAGTCGC ATTGCGCTTT ACCAACACGA GTTGTAGGCT TATCTCATCA CAAAGTTGTG GGGATTTCGG CAGCTGAGTC CCATTCCCTC TGTGTTACAA AGACTGGGAT CGTCTTTGCT TGGGGATCGA ATCGCTTCGG ACAGCTGGGT TTGACTTTGG ATGACTGTAG CACACGATCC GTTCCCCGGA GGATCGATAA TCTCAAGAAT ACGCAATGTG TCGCAGTTGC TGCCGGTGCC AAGCATTCCG TTGCGCTTTC CAGAATAGGG GAAGTTTTCG TATGGGGTGA CAATACGGCG GGGCAATTGG GTGTGAGTCG ACGTAACGGA ACGCACCGTG TACATCGAGT AGAAGCTTTG TGGGGTACAT CGCCTCCCAA AGTTGCAATG TCCATTTGTG CTTCAGAAAA AACGACTCTC GTGCTGACGC TACCATGGGG CCGCACCGGC GTACCAGTGA ACAGCATCTA CACTTGGGGA CACGGCAACC ATGTGCCAAG CAAGATTCAA CTGAATCCGT CGGTCGAAAC TCGAAACCGC TTAGTGAATC CTGTTAGCAT TGCCTGCGCT CGTTTTCACA ACGTAGTAGT TTCTTCGGAC GGGTTAGTAT ACACATGGGG CTTACACGCG GAGTCCTTGG GAACCCCAGT CTCAGCAAAA AAAGGCACTG CAATGGCTGT GCCACAATTG GTACAAGGTA TGCTCCCGCA CAATGGGGGA GGTGTTGCTG TTGGTGTGTC AGCGTCGGAG AATCACACTG CGGTTATCAC GGATACGGGT GCCTTGTATA CCTGGGGGGC TGCCTATGGT AAAGATGTGC TGGGCCACGA AGGAATCCGG TGGCAACCGG ACCCGAAACG TGTTCCAGGA ATTCATCGGG CCGTTAGCGT GTCGGCTGCA AAGGAGCATA CGGTTCTTCT TGTCGGTGCT ACCTTTTCAC GGAGTCCCCA TCGACGGTCG GACTTCGTGC CCTCCCTAGA GAGCCTTGCC GCCGAGATAG TTGCGCATCA TGTGGACCTA TTTAATGTTA TCCCTGTTTT GATAACCGCT GAACGGATAC ATTCCCCGTT TTTGGTTGAA TACTGCGACA ATTTTATACG CCGCAATCTG GATGCTGTTT TGGACTTTGG GCAGAGAAGT GTTATGAACG TCTACTTGGA AGAACAAATT GCAAAGATGT CTCCTCGGGG CGAAATGAGC CAGGATGGAC ATATTCATCC ATTGTTTTTT GACATTGCAC TGGCAGGCGC AGAGAACAAG AAGATGCAAT CTTTGGAACG TAGCTCGTCT TGCGGTGTAC ATCAGTGGCT GGAAGCCTGT ACAGGGCTTC GCGACAAAGC CTCTACGAAA TATTTTGAGA GGCATCACAA ACTGTCGCGC GCTTTTACAT TGGCGGATAC GAGTTGTTCC ATGAGGCCCC TTTGTCAGCC GCCTCAAAAC AGTACAAGAG GGATCAGCGG GAATGGAAAA AATGATCGTT CTTGCTCGGG CCAGTGCAAA CAGGTAACTT CCGAGATAAA CCTTTCCACC AAGTCTTCAA CGCTAGCGAA ATTTGACGCG TTGAATAAGG AGACGCGAGC ACTTCGAAAG CGCTTGGGTC AAATTTCCAT AATCGAAAAG GCTGCAGCGG GCGTGCAGTC TTTGACTCCA GAGCAGCAAG AGAAGGTTGC TCGTCGATCT GAGCTGGAGG CAGATCTGAT TGCTCTCCAA CCAGCGCTGA AGCAAATTGG TAGTCACTTG ACCGAAACGA GAATGGCAGA AAAGAAAGTG CAATCAGAGA GCCATATAGA GCAAGCGGAA GAAGCGAAGA TTCAAAATGA CAACGCAATT TCCCTATTTC GATGTGAAGT CTGTTCCATC ACATGCCCAG ACCTGCAAAA TTTAGAATTT CACTTGAACG GTCGCAAGCA TCGGAATAGA GTCGCCCATT TAGCATTCAA AGATGAAGAA CAAGCTAGAA AAGCTGTAGT TAATGAATGC AGGCGGAGGC AAATATTGGG TACGGAAGGA GGTGGCGGAG ACAGGGAACA TTTCTCGTCT CGACCATGGC AGTCACAAAA TGGCAAAGCA ATGACCTCAC TACCAAAGTA CCAGCTCCCC CCTCCACCTC ACCCTTTTTC GGGATCTGTG CCACCAGTTG GCCGTGGAGA AAAAAAAAGT TTGCAAAACA TAATGGAGGA GGAATCTCGC TTGCGGGCGG CAAGCAGCAA GGAGCCCACG AAGATACCAA AAGTAGGCAT CGCGAGACCA TCCGAAACTT TCAAGACTCT ACCACGATCT GTACCTCCAA TCATACAGCC GCTTGGATCA CCGTGTTCCA GAAAGTCAGC CCCTCGGGCT TTGCCAACTG CCACAGTCAG AACCGCAATG CCAGTTTCTA TGCTGCCATG TTCAGCAACA TCCTATTCCC CGGCTAATTC TTCACCCAAA GCATTGTATT CACTGGGAGA CTTTTGGTCA CCAAAAAAAG CAGCACCGCG GCCTGTAATT CCCGCGGTGG CTTCTTGGGC GTCCCCCAGA AGCAGCCCTT CAAATTCCGC AGTCACTACC CCGCCGAGCC AAAGTCTTAA AGATATACAG CAACAGGAAG AAGACTTGAA AACCAAGCAG GTTCCAAGCT GTGGGGAAAA TGGAAGGTGG TACATTGAGC GGCGGGAACG TGCGGACTCG TTGACCGAGA TTCAAGCAAA GGAGGCCAGG GAAAAGGAGT TCCGTCGCTT CGTGGAAGAG CAACGCGAGA TAGAAAAATC AATGATGCAC GAGCTGGTGA CCAGTAAAGA AGCGGACAAC AAAAACCCGG CACGTAAACG AATCCACTCA AAAGGCAAAA GGAGGTAACT TGAGTCTGCA CCGAAACATG ATTTGTGAAT ACGAAGACCC GAACAGGGGA TAAGGTAGAC AGGATTAAAA AGGTAAAAGG CAGCAAAGTT TTGCAGCAAG AAATGAGCTA TACCAGACGG ACGAGTTACC ATGAAAACCC TCTGAACCTT TGCACACATG CTTGTTGGCA GTATCAATCA GGAGAATCCA AGGTGTGGGC ATTCTAGCCT ATACTGAAAA CAAAAGAATA TGGATCCACT TTGTGGGAAC CATAAATTAG GCCAAGAAGG CCAGAATTCT CTGTTTTGTG TTCCAGAATT AGAGGTCCGG ACCGTCTATC TACTGTACTG GTAATCAAAA TGTCTGTCTT AGAGTCTCCG GGTGCTCTCT GGATACCGCG CAATGGTATC AACAGTCACT CAAACTCCAG AGCTAGTACC ATAGCGCTAT TGGTAGGGAA AGTAGGTAGT TCTGTACACT TGTTCTCC
|
Protein sequence | MGRSRGRSVS SGSYKEMNTS SISTKMMSPE GRNGSTAEVT RERVSVVREV LQAILMHSNP TSGRSFLHGE ALSSGDSHIV AAEVDTIEQL HKIHSSDHKA GEFACNLEEQ ILTLPDKESG YTPLHWAILR GDLASILLLV RHCLTATHEH DSLSRRLSKR PMDVLQGYGT NVSVMLAKLI AARDQEGLTP SDLLAIQQRS ELAACRQALL SISSQISSVS TGRQDNTWLN ASGLHLTDAD ETDEASDTLR DTIHLETEAT ESSLLDCKRN LSEYGCEVLT FGSAHICASG VSTSASCSSS GTTLNARPIV GAATSRPQRV QAFGQERVGR MGGAVSVSAA SHHTLVLTRN GHVYAFGLGK GGRLGTGDES HCALPTRVVG LSHHKVVGIS AAESHSLCVT KTGIVFAWGS NRFGQLGLTL DDCSTRSVPR RIDNLKNTQC VAVAAGAKHS VALSRIGEVF VWGDNTAGQL GVSRRNGTHR VHRVEALWGT SPPKVAMSIC ASEKTTLVLT LPWGRTGVPV NSIYTWGHGN HVPSKIQLNP SVETRNRLVN PVSIACARFH NVVVSSDGLV YTWGLHAESL GTPVSAKKGT AMAVPQLVQG MLPHNGGGVA VGVSASENHT AVITDTGALY TWGAAYGKDV LGHEGIRWQP DPKRVPGIHR AVSVSAAKEH TVLLVGATFS RSPHRRSDFV PSLESLAAEI VAHHVDLFNV IPVLITAERI HSPFLVEYCD NFIRRNLDAV LDFGQRSVMN VYLEEQIAKM SPRGEMSQDG HIHPLFFDIA LAGAENKKMQ SLERSSSCGV HQWLEACTGL RDKASTKYFE RHHKLSRAFT LADTSCSMRP LCQPPQNSTR GISGNGKNDR SCSGQCKQVT SEINLSTKSS TLAKFDALNK ETRALRKRLG QISIIEKAAA GVQSLTPEQQ EKVARRSELE ADLIALQPAL KQIGSHLTET RMAEKKVQSE SHIEQAEEAK IQNDNAISLF RCEVCSITCP DLQNLEFHLN GRKHRNRVAH LAFKDEEQAR KAVVNECRRR QILGTEGGGG DREHFSSRPW QSQNGKAMTS LPKYQLPPPP HPFSGSVPPV GRGEKKSLQN IMEEESRLRA ASSKEPTKIP KVGIARPSET FKTLPRSVPP IIQPLGSPCS RKSAPRALPT ATVRTAMPVS MLPCSATSYS PANSSPKALY SLGDFWSPKK AAPRPVIPAV ASWASPRSSP SNSAVTTPPS QSLKDIQQQE EDLKTKQVPS CGENGRWYIE RRERADSLTE IQAKEAREKE FRRFVEEQRE IEKSMMHELV TSKEADNKNP ARKRIHSKGK RR
|
| |