Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44344 |
Symbol | |
ID | 7198033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 283724 |
End bp | 286832 |
Gene Length | 3109 bp |
Protein Length | 980 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178200 |
Protein GI | 219114809 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.195415 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTGAGGCCG ACCATCCGGA GAAAACGCTG ACCGACACAA GCCACAATAA TTGTAAAGGG CATCAATAAC ATTTATCGAT GATCACGGTT CAGAAGGCAT CGCTGGTATT TTGGTGCGCC ATCGCTTCTA CTTTTTCCCG GGTAGTTCAT GGTCGAACTA ACTGTGAGCA GCTGTTCGGT TCAGTAATTT CGGCAATTCC TCGTGGAGGT GAGATCGCGG TGCCGTCAGT CGCGCCGTCC TTCTTACCGA CAGGTGGTCG CTCAGACGAC ACGCACAGGC GAAAGAGAGT CGTAAAAAAG AAGAAAGTGC ACAAAGGGGA ATCGAAATTG CCGATAACTC AAGACGATCG TTATTCCAAC GGCGAAAGAA AAGCAAAGCA AAAGAAGATA AAGAAAGTGA AAAAGAGAAA GGTTTCGCAG CCGGAATCTT CTGAAAGTTC TCCGAGTAAC GTTGCGAAGC AGCTGGAAGC GCCGCAATCA AGAAAGCAAA AAAGGAAAGT AAAAAAGAAA CACATTGATA CGTCCAATGA GCCCGCTGTC GCTTGGGCGA AGAAGGAATT TCGATCCCGG TCACCATCGG TAAAGGGGGG ACCTGTGAAC TCAGTGGTAA GCGAAAAGCA GAATGGTTTG CAGAAGTTAC CGCAGAAGAA AAGTAAAAAG GTCAAAAAGC GCAAAAAGGA GGCCGGCGAA GGGAAGGAGC AACCGTTATC TCACCCGGAC CAAAGAATGC CCCCCGGGAC TCCGGCTCAA GACTCTTCGC AAAGTAAGGG CTCAGAGAGC AGTGATGACT TGCCGATTAA TTACTCCGAG AGAGCAGAGA GAGAAGAGAC TTCTCTGGTT GAGAAGTCTG AAGCACAAAT CGCTCCAATC GGACAGCAAG TGGATGAAGA CTCAGAGCGA CCTGTCGAGC ACACAGTCGA AAAACGAGAC GGAGATTCGA CTTCAGGATC GCCAACACAG ATTGACGCGA ATCTCAATTC AGCAAGCAGA TTGGTCGACG TCGAAGTCGC CCAATCTCTT CAGAATGACA CGAATCTCGA CGATAGGAAA TTACACCAAA CAGAACTGAA GTCTTTTGGG ACTGAAGATT CATTCCAGGA TACTTTTCCA GAAGAGAAGA GCACCGATCC GGTAGCCGAA GACCACAATA CACCAGTAGT GGCTCCTAGA TTGAGCGGTG AGCCCTCGAT AGACGCTTCT ATTGAAGAAG TCTCTGACTC AAGTGATACC ACCGATACAG AAGACAGTTC TAGTGACAGC GACGAGAGTG AAACCACAGA GACAGAGGAT AGTTCTAGCG ACACCGACGA GAGTGAAACC ACAGAGACAG AGGATAGTTC CAGCGACGCC GATGAGAGTG AAATTTCCAT TGATATGACG GCATCAAGCG CCTCTAGTGA TAGAACCGAA AGCTACGAGG AATCGTTAAC GGCTGGCAAT GAGACGGTCG AAGCTGATCT AGAACAAGGC TATTCATTGA ATGAGTCGAA AGACGCCGAA ACCGAATCAA ATTCAGAGGT GAAAAACTTC AAAGACAACC CAACTTTATC GGGGTTAGCA GGCGAAAGCG AGAACTCGAG TCCAAATTTT GACAAACGCA TCAAAAGTTG CAATACTGAA TACACCTCAA ATCGAAACAG CTCGTTGATT ACGGATAAAC AAAGCTTGGC AAGCACAGCA GGCGGATCCG AAGGAATTGG GGTGGATGAC GTGCAAACCA ACGCATCTAT TGCAGATCGA AACTTTTCCT CTTTGGAAGA GGGAGAGGGA CACTATGACG CAACTAGTGA CGACGAAGAC AAGAAATTTA AGCAACTCGA TGCTCCATCT TTGGAAGATT CGGACTATGC GGCTGGGGAA GGCAGCATGA ACAGAAACTG GAAAGTCGAC CTCTCTGAGC TCCGATCGCT GCAGGACCAT GAAGATGATA TAAATGTGTC GATTGTAACT TGGAATTTAG CTGAAGAGTC ACCTTCAGAG GAAGACGCCT CATTTATTCG ACGTTTTCGA CGCCGAAATG ATGTACAGAA GTCCAGCGAT TTCGTACTGA TATCAGGACA GGAGTGCGAA AACATTAAGC CGAGAAGGAC AGAAGGACAT CGATCTCGAG AGTTCCGGCG GTTGATGATC AAGATGTTGG GGAAACAATA TGTGCCCATA GCGCTGCATT CTCTAGGTGG AATTCAATTC GGATTGTTTT GCAAGCGATC GATTCTAAGT GAGGTTGAAA CTATCTCTGT CGCGGACGTT ACCTGCGGAA TTGGCAACGT ATTCCACAAC AAAGGCGCTA TCGCAGCATT CGTCCAGATC AAGGCGAAAC AATGTAGCGA GGGGGAAGCC ATCGGACCAA ATCGTGACAA GTCCGTACGG ATGATGTTTG CGACCGCCCA CATGGCGGCT CACGTGAAGA ACACTGAGGC TCGAGACTCT GATTTCTGGA GAATTGTGTC TGAGCTGGAA GCGCAAGCGC CGCCGAGATT TCTCTCATCA AATATTGTCG AGTCTAGCAA GGAAAGGGAA TGCTCAGGAT CAAAGCTTCT AGAATCAATG GATCGCATTT TCTTTTGTGG GGATCTTAAC TACCGAGTTG ACCTTCCTCG CGAAATTTCT GAGCACACTC TGCTTCAGAT GAAGCGCCTC CAGGAGATCG GAGACGAAAA GTCTTTACAA AAGGCCGAAC TCTTGCGATT AGAGCTCTTG AGACACGATC AACTCATCTG TAGCATGTCT GAGAAACGAG CCTTCCCAGG CTTTGCGGAA GGAAAAATAT CCTTTGCGCC GACTTTTAAA TTTGACAAAG GCACACCAGA GTACGATAGC TCGTATAAAC AACGCATACC TGCATGGACA GATCGCGTTC TATTCAAACC CATCGGGACG CGGGTACTGG AGTATGATAG CATCTCGGAT GCTCAGCATT CCGATCATCG TCCAGTCTAC GCCACGTTTC GCGTCAGTCG TCAAGGGCGG CAAGTTCCCA AATCGAAGCC GAGAACAAAG AAGCGAAGCC GTCGGAAGTG AACGCACATA TACCTACAAG TTAGCTCCAA GTTACTGGAT TTTAAGATAG AAGCCATTTA GTGTACGATA ATCGCCGGAT ACACCTGAG
|
Protein sequence | MITVQKASLV FWCAIASTFS RVVHGRTNCE QLFGSVISAI PRGGEIAVPS VAPSFLPTGG RSDDTHRRKR VVKKKKVHKG ESKLPITQDD RYSNGERKAK QKKIKKVKKR KVSQPESSES SPSNVAKQLE APQSRKQKRK VKKKHIDTSN EPAVAWAKKE FRSRSPSVKG GPVNSVVSEK QNGLQKLPQK KSKKVKKRKK EAGEGKEQPL SHPDQRMPPG TPAQDSSQSK GSESSDDLPI NYSERAEREE TSLVEKSEAQ IAPIGQQVDE DSERPVEHTV EKRDGDSTSG SPTQIDANLN SASRLVDVEV AQSLQNDTNL DDRKLHQTEL KSFGTEDSFQ DTFPEEKSTD PVAEDHNTPV VAPRLSGEPS IDASIEEVSD SSDTTDTEDS SSDSDESETT ETEDSSSDTD ESETTETEDS SSDADESEIS IDMTASSASS DRTESYEESL TAGNETVEAD LEQGYSLNES KDAETESNSE VKNFKDNPTL SGLAGESENS SPNFDKRIKS CNTEYTSNRN SSLITDKQSL ASTAGGSEGI GVDDVQTNAS IADRNFSSLE EGEGHYDATS DDEDKKFKQL DAPSLEDSDY AAGEGSMNRN WKVDLSELRS LQDHEDDINV SIVTWNLAEE SPSEEDASFI RRFRRRNDVQ KSSDFVLISG QECENIKPRR TEGHRSREFR RLMIKMLGKQ YVPIALHSLG GIQFGLFCKR SILSEVETIS VADVTCGIGN VFHNKGAIAA FVQIKAKQCS EGEAIGPNRD KSVRMMFATA HMAAHVKNTE ARDSDFWRIV SELEAQAPPR FLSSNIVESS KERECSGSKL LESMDRIFFC GDLNYRVDLP REISEHTLLQ MKRLQEIGDE KSLQKAELLR LELLRHDQLI CSMSEKRAFP GFAEGKISFA PTFKFDKGTP EYDSSYKQRI PAWTDRVLFK PIGTRVLEYD SISDAQHSDH RPVYATFRVS RQGRQVPKSK PRTKKRSRRK
|
| |