Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47465 |
Symbol | |
ID | 7202479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 697340 |
End bp | 699559 |
Gene Length | 2220 bp |
Protein Length | 731 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181785 |
Protein GI | 219122922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCGATGTA CGCAAGTTTA GAGTATGTTC TGGGATAGAG AAGCTACCAT CCAGGTCCGA GGTGAACCGA CCGACCGGAA GGTCGATGAC CTACATGGTT CCACTGTTAT ACGAGTAGAT AAAGTTATTG AAAGGCGTCA GCACCTTCCT GTTGAGAGTG TTGAATTGGA GGAGCGTGAG CAACCTTGTG TTGCAACACC ACTGACCGAA GGCGTTGGTC AGACCTGTCA TTTTAGCTCC AGTATAAAAA ATCAAAAGAA GGTTCTATCT GATAACGGCG AGACTGGTAT GCCAGTTGAA GAAGAGCCAA ACTGGTTAGA TATTTCGAAT TGGTTCACTG CCTTAGGGAA ACAAGGGCTA GAAGACCACC ATCAACCGAA AGCTCGAATT GATTTCATTG AGATTGATAC GAAGGTTGAA GACCACGAAC TGGTTCAATT ACAAGAAATG TATCTTCCAA CTTGTTTGAG ATGGCTTGAC CCAACCTTCC AGCCTATCGA CGACCTCGAA GGTGCGACAA CAACAAACGA GCAATTGTCG GCGGGCGTGT CAACCAGAAT AGGCTTTTCT GACGTACAAT CCGTTACTTT GGCCAATGGC GAAGGGTCCA TTGCAGATTT GGTAGCCAGC AATTCTTTCG ATCTTGGAAA AAATGAAGAT TTTCCCGAAA GAGTAAAACC TACCTTCGAC GTCGCAAACA AACTGGCCGA TACCAGCAAG TCAGAGGCGA CTGCTACTAT CAGAACGAGC GACAATCGTC TTTGTTTGGC TAAAGGAAAA AAAGCGATGT CGGAGGTGCA CAATGAGGAG CGCCGACAAA TTCTTGTCAA GGAATTGTTA TCATCGATAT CCACTTACGG TCGCTACGAT CCTCGTGTCG CCGACGCCAG CGCTTCACTA GGTGATCATC TCGATGAGTC AGGTGAGCAC AAGCAATCAC TCAAGCTTTA CCGAGACGCT GTATCGATAT ATAGTTCAAA GCTCGGTGAT GACCACAAGA AGACAATGGA TGCCCGTGTC AAGCTCGGTA GGATCCTTGA GCACGCTGGG GAATACAACG AAGCCATCAA CACGTACTAC CTCGCCACAG TCATGCGCAA AGCGGTGCGC GGAGAAAAAG ACCCTGCAGC AGCGGATTCT ATCGTCTGCA TTGCGCATAC TTTACGAAAA AAGGGCGACT ACCACCAAGC CATCAAGGAA TTAAAGCGCT CCCTCAAAAT ATACCGTGAA TCCCTGGGAG ATGCCCATCC GAAAGTCTCT AGTGCTGTGG ATGAGATTGC CTCATTGTAT GTTACACTAG GAGACTTCGA CAAATCTGCT GCGATTCTCG AGGAGGTTGT CAAACTCAAG GCGGCGACGC TGGGTATGAA TACCAAGGAA GTAGCTTCTA CGTTGATCAG CCTAGCGACG ACTTACGAAT GCTCCGAGCA AGTTGAAAAA TCCTTGAAGA CATTGAAAAA GGCGTACAAG ATAGAATCTG AGATTGGCGG GTTTTCCTCC GAGGGAGCCA TCAGTATTCT GAACCGTATC GCCATGCTAT ATGAAGGAAC GGGTGACTAC AATCGTGCCT CAATAGCTTT CCTCGGTGTG CTTCGTGGAC AGAAGAGTAT CTATGGGGAG GAACATCTCG TCGTCGGCGA AGCATACTAC AAGCTCGGAT ACTCCCTCCA TCAAATGGGT CACATCCATA AAGCTCTTAA GTGCATGAAA GAAGCCCTCC CTATTTTTGT GCGTGAAGGC ACCGAAACCA GTGACGTCGA GCGCATTGCC GAGATTTTGC ACGAGATGGG TCTCATGAAC AAGGAAATGA AGAACTTTCA CGAATCTACC TGTATGTTCA AACAGGAGCT AGGAATTCGT CGGAAGATTG GTCAGAGCGA GTTCCCTCTG ATAACCCGTG CATTGAACCA GTTGGGCGTG GTTGAGTTTG AGATGAAAAA TAGCTCCCGT GCTCTCAAGT ACCTAGTAGA AGCTCTGAGC ATAATGCAAA AGCACGGTGA TCCAGGTTTG GACTGTGCTG AAGTATTGTA TAACTCTGGG TTGGTTTTTG AAGTGTGCAA CAACAAGGAC AGAGCTTTGG AGGCTTTTGA AGAATCTGTT CGCATCTTGA TGAAACTTGG ATTCGAGGGC GTGCACCCAC AGGTGGTGAA GGCTCAAAAC AAGATTGAGA TGCTTCAAGA TAAGAGAAAG CAGCGAGGAT ATTGGACGCC TGGACAGTGA
|
Protein sequence | MFWDREATIQ VRGEPTDRKV DDLHGSTVIR VDKVIERRQH LPVESVELEE REQPCVATPL TEGVGQTCHF SSSIKNQKKV LSDNGETGMP VEEEPNWLDI SNWFTALGKQ GLEDHHQPKA RIDFIEIDTK VEDHELVQLQ EMYLPTCLRW LDPTFQPIDD LEGATTTNEQ LSAGVSTRIG FSDVQSVTLA NGEGSIADLV ASNSFDLGKN EDFPERVKPT FDVANKLADT SKSEATATIR TSDNRLCLAK GKKAMSEVHN EERRQILVKE LLSSISTYGR YDPRVADASA SLGDHLDESG EHKQSLKLYR DAVSIYSSKL GDDHKKTMDA RVKLGRILEH AGEYNEAINT YYLATVMRKA VRGEKDPAAA DSIVCIAHTL RKKGDYHQAI KELKRSLKIY RESLGDAHPK VSSAVDEIAS LYVTLGDFDK SAAILEEVVK LKAATLGMNT KEVASTLISL ATTYECSEQV EKSLKTLKKA YKIESEIGGF SSEGAISILN RIAMLYEGTG DYNRASIAFL GVLRGQKSIY GEEHLVVGEA YYKLGYSLHQ MGHIHKALKC MKEALPIFVR EGTETSDVER IAEILHEMGL MNKEMKNFHE STCMFKQELG IRRKIGQSEF PLITRALNQL GVVEFEMKNS SRALKYLVEA LSIMQKHGDP GLDCAEVLYN SGLVFEVCNN KDRALEAFEE SVRILMKLGF EGVHPQVVKA QNKIEMLQDK RKQRGYWTPG Q
|
| |