Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46493 |
Symbol | |
ID | 7201826 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 503811 |
End bp | 506227 |
Gene Length | 2417 bp |
Protein Length | 787 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180844 |
Protein GI | 219120200 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0749134 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGTTTCTGTT CTTGAGGCCA CATCCACTTG AAACCTTGCT TGGTGTTCAT AGCATGGGAC CCAACGCGGC CGAAGACGCC GAGGCAACGG CTTTATTGCC GAGGCGAAAA GGGAACGAGA TCGGTTCCTA TCAATACCCG ACAGCCCCTT CGTCAACGCT GGAAGAGGGC CATTACTCGT CGCCTGAGAG TAATGGCAGC AGAAGTGATG GTAGGGATCA TGCCGTTCAT TTTTCTTTTA CGAGGCGCGA CATTTCGAAC AAATTTGTTG AAGAATCCTG GTGTAATCGT TTGGGAAGGT CATGCCTTTC GTGGACTTTG CTCCTGATCT TTTTCATTCT TGTTTTTGAA GGATGTCTTA TATACCTTTC TTATCGAACG ATGTCTCCAG CGGCCGTGCC AATGTTAAAT CTGTATGATT ATATTGTTGT GGGTGGTGGT CCCTCTGGTA TAATTGCAGC GACAAAGCTT GCACAATCCT TTCCGACACT TCAGATATTA CTGCTCGAGT CCGGGACCGA CAGTCAAAGT TCAGTTCTCA AACAACAATC TATACTAAAA GAAGGTGCAA CAGTCTCCGC GGCCGAATCT GGTAGCACAC TTTGGCAAGA GGACGCTTAC CAACTCAACA AATTTGACGT GCCACTACTG TGGAGCGGCG TCGCAAGCAG TCGAGGTAGA CGGGACGTCC TACATCTGCA AGCCCCGTCT TGGTCCTCCT CGCATCACTG GCCCATAGAT AAAACACTTA TGGGGCGTGG TCTAGGCGGA TCCGGGCTGC ACAACGCAAT GATTTACGTC CGTTCGCTGC CGACCGATTT GGAAGCGTGG AATGTTACGG GGTGGACCTA CGACGATATT CTACCCCACT ACGTGGCATT GGAGCAATAC GTAGAGGATC ACATACCGTC ACAGCCTTTT TGGACAAACG ATCAAGGATC CACAATTTCG AAGGCAAACT GGCGCGGTAC TACTGGTCCG ATACGAACTA TACCTGCCGG TAGTGCGGTG GACGCCTTAG CACCGCTCTT CGTACAATCG GCAGTTATAA GTGGCGAACG GTTAGCCAAG CGTGGCTTCA ATCACCCTAG CCCGGCGGCT CGTCTTGGTG CAGGGTACTA CGAATTCAAT ATTCGGCATG GTGTTCGCGA CTCGGTTGCG CACGCCTTGC TCGGAGGACA TAAGGCAGTA CCAAGGAATC TGATAGTTCG TACAGGTCTG ACTGTTACGA GAGTGACAAC CAAGCCGAGG CGGAACGAAG TACCGCGAGT GACAGGCATA GAGTATTTTC ATAGTGCAAC TGGACGGATG GGGAAATTTT TGCTGCGTTC GGATGATGTT TCCGAAGTTA TCCTGGCGAC GGGTGCCATT ATGACTCCTC AATTGTTGGC CAACACTGGT ATAAGACCTG GAGGATCTGT TGTACACCTT CCAGGCGTGG GTCGGAATTT GCAAGATCAC CCCGTAGTGG CACTCAAATT TAAACTGGTC GCAGAAATGG AGCAGGACGC CTCTTCAATT TATACTCTTG GAGATGAAAT GGAAGACTAC GTCTTATCTG TGGCTGGTTT GGAAGATGGC CAAGCTAAGC ATAAGAAGCT CTCCAACTCA AGTCTCTCTT TGCAGCAAGC TTTGTACAGT CGTCTTGGCA CACTGGGAAC GGCCGGATTT TCGGCGGGCG CGTTTTTGCA GTCACCATTC GCCAAGCACG ATGTTCCCGA TCTTCAAGTG ACCGTATTTC CTCGCGAAAT AGAGCCGCAT GTGACTCGAA AACAAAACGC CAACGAACGA GCCCAAATGC GGTGCCGGTC TATGCTGATC ACGGTCGCGC TACTACAACC GGACGCTCGG TACCAGGTTG AACCACTGTT GTCGGATTTG ACTTCAGCCA ACGAAATCTT TGAGCAGACT GCTGAGACAG AGCGATCAAT GAACGCATCT GAATCGTCCG TTCCGTTAAC ACATTATCTT GGATACAATC TGCCATCCAT TGAGCTGCCG GCAGGCCGAT CAGAATATTT GTCTAAACGA GATGTGCGAG TATTGGCGTG GGGAATAGAG CGTGTTCGTG CAATCCAAAA GATGCCACCA TTATCGCAGG CGACCGGCGA TGAGCTGGTT CCTGGTGCCG AGCTAGTCGG TGAGTATTTG GAAAATCATA TTCGGGTTGA AAGTATGCCC AACAGTCACT GGGTCGGGTC GACTAAAATG GGCCCAGACA GTGATACTTT GGCTGTTGTC AACGAGCGAC TAGCCGTACG CGGAGTACAA GGACTGCGGA TTGTGGATGC CGGGGTTATT CCTCAGGTTC CGAATGGCAA TACGCACAGT ACGGTATGTG TTGTAGCTAG TCGTGGCGCC GAACTCATTG AGCAAGATCG ACGGAAGGCA AGCCAGCAAT CCAATAATCC AAACTGA
|
Protein sequence | MGPNAAEDAE ATALLPRRKG NEIGSYQYPT APSSTLEEGH YSSPESNGSR SDGRDHAVHF SFTRRDISNK FVEESWCNRL GRSCLSWTLL LIFFILVFEG CLIYLSYRTM SPAAVPMLNL YDYIVVGGGP SGIIAATKLA QSFPTLQILL LESGTDSQSS VLKQQSILKE GATVSAAESG STLWQEDAYQ LNKFDVPLLW SGVASSRGRR DVLHLQAPSW SSSHHWPIDK TLMGRGLGGS GLHNAMIYVR SLPTDLEAWN VTGWTYDDIL PHYVALEQYV EDHIPSQPFW TNDQGSTISK ANWRGTTGPI RTIPAGSAVD ALAPLFVQSA VISGERLAKR GFNHPSPAAR LGAGYYEFNI RHGVRDSVAH ALLGGHKAVP RNLIVRTGLT VTRVTTKPRR NEVPRVTGIE YFHSATGRMG KFLLRSDDVS EVILATGAIM TPQLLANTGI RPGGSVVHLP GVGRNLQDHP VVALKFKLVA EMEQDASSIY TLGDEMEDYV LSVAGLEDGQ AKHKKLSNSS LSLQQALYSR LGTLGTAGFS AGAFLQSPFA KHDVPDLQVT VFPREIEPHV TRKQNANERA QMRCRSMLIT VALLQPDARY QVEPLLSDLT SANEIFEQTA ETERSMNASE SSVPLTHYLG YNLPSIELPA GRSEYLSKRD VRVLAWGIER VRAIQKMPPL SQATGDELVP GAELVGEYLE NHIRVESMPN SHWVGSTKMG PDSDTLAVVN ERLAVRGVQG LRIVDAGVIP QVPNGNTHST VCVVASRGAE LIEQDRRKAS QQSNNPN
|
| |