Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22803 |
Symbol | |
ID | 7195145 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 484210 |
End bp | 486114 |
Gene Length | 1905 bp |
Protein Length | 608 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183362 |
Protein GI | 219126224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTTTC CCTACTACAA GGAAAGCAAA ACGGGACGTA TCCTGTTGAC CGGTCTCCTC GGACTCACAC TCGCCAACTC GGGCGTTTCC GTGCTCTTCA GTTACCTCGG TAAGGACTTT TGGAACGCTC TCAGTGCCAA GGATACCGCA GACTTTTACA ACGTCCTGCA AAAATACCTC GGCGCTTTGC TGCTCGGGGC CCCCGTCGCT ACCTTTTACA AGTACCAGCG CGAACAACTC GCCGTACACT GGCGCGAGTG GATGACGGCT CGTACCTTTT CTCTATACGC ATCCAATCGC GTCTACTACA ACATTGAACG ATCCAACGCG ATCGACAATC CCGATCAGCG CATCGCCGAA GACGTCAACA CCTTTACGGC CTATTCCTTG CAACTCGTGA TCACCCTATT GACCTCCTTG ATTGATCTAC TCTCCTTTGC CACCATTCTC TGGAGTATCT ATCCACAATT GTTCGGCGCC ATTATACTCT ACGCCTTTGG TGGGACCTTC ATCACTACCT TATTGGGACG CTCCTTGGTT TCGCTCAACT TTTCGCAGTT GCAGAAGGAA GCCAATTTCC GTTACAGCTT GGTACGCTTA CGGGACAATT CCGAATCCAT CGCCTTTTAC GGCGGCGAAG ATTTGGAAGG ACAGGCCATC GAACGCCGTC TCGAAAACGT CATGGGCAAT CAACGTAAAA TCAACGCCGC CCAACGGAAT CTCGAACTCT TTACCAACAG TTACCGCTAC CTCGTCCAGA TTCTGCCCGT CGCTGTCGTC GCCCCCAAGT ACTTTGCCGG AGAAATACCC CTCGGTGTCA TTTCCCAATC GGTCGGCGCC TTTAACCACA TTCTCTCCGA TCTCAGTATC ATCGTGAATC AGTTTGAACG GCTCAGTTCC TTTTCCGCCG GTATTGAACG CTTGAGTGGA TTCTACCAAG CCATGCGGGA AGCCGATTTG GAACGCGCCG ATGATGGACC CTTGTTGTCC CTGACGAACG CCACGGATGC TGCGGAACAT TCTCCGGCAG TGTGCGATCC TTTGAACGCC TACGGTCGCA TCAGCTCCCG GACCTTTGAC CCTCACAACG GCGCCTACCG TGACCGCACC GTCTTGTCCA TTGAACACCT GGACTTGTGC ACACCCGATC AGAAACGTTT GTTAATCAAG GATCTCAATT TACAGCTCCG GGAAGGCGAA AATCTATTGA TTGTTGGAAA CAGCGGCGCC GGAAAATCGA GTTTGTTGCG TGGGATTGCC GGATTGTGGA CGGTCGGGAA CGGTGTCGTC AGCCGACCCG TCGATGAGGA AGTATACTTC TTACCCCAAC GACCCTACTG CACCATTGGG AGTCTCCGCG ATCAGTTACT CTATCCCGCA ATCAACGCGC AGGAATACGA CGGGGCCGAA GCCAACGGTC AAAAGATTGT ACCGCGCTCC CACATTCTAA AGGACCAATG GACGGACGAG GAGCTGTTGC TGGTACTGGA AAAGGTCGAT CTCGTGGAAG TGGCCGAACG CGCCGGAGAC GGCGACGCAA CCAAGGGTTT GGAAGCTGTC TTGGACTGGA GCAATATGTT GAGTTTGGGC GAACAACAGC GATTGGCCTT TGGACGATTG CTGGTCAACC GACCCCGTCT CGTAATATTG GATGAAGCCA CATCGGCCCT GGACATGGTC TCCGAAGCGC GCATGTACAA CGTACTGAAG AATATGGCCC GCAAGGAATT GACCGGTAGC GCCAAACTAT CCGCTCCCGG CTTGACGTAT GTAAGTGTAG GACATCGACC AAGCTTGTTG GCCTATCACG ACAAACGACT ACGGCTCATG GGTGAAGAAG ACCACGAAGT GACAACTGTT GAGAAGGAAC AAGTCCAACT TCAAAACCAA ATACAAAATC TGTAA
|
Protein sequence | MAFPYYKESK TGRILLTGLL GLTLANSGVS VLFSYLGKDF WNALSAKDTA DFYNVLQKYL GALLLGAPVA TFYKYQREQL AVHWREWMTA RTFSLYASNR VYYNIERSNA IDNPDQRIAE DVNTFTAYSL QLVITLLTSL IDLLSFATIL WSIYPQLFGA IILYAFGGTF ITTLLGRSLV SLNFSQLQKE ANFRYSLVRL RDNSESIAFY GGEDLEGQAI ERRLENVMGN QRKINAAQRN LELFTNSYRY LVQILPVAVV APKYFAGEIP LGVISQSVGA FNHILSDLSI IVNQFERLSS FSAGIERLSG FYQAMREADL ERADDGPLFS RTFDPHNGAY RDRTVLSIEH LDLCTPDQKR LLIKDLNLQL REGENLLIVG NSGAGKSSLL RGIAGLWTVG NGVVSRPVDE EVYFLPQRPY CTIGSLRDQL LYPAINAQEY DGAEANGQKI VPRSHILKDQ WTDEELLLVL EKVDLVEVAE RAGDGDATKG LEAVLDWSNM LSLGEQQRLA FGRLLVNRPR LVILDEATSA LDMVSEARMY NVLKNMARKE LTGSAKLSAP GLTYVSVGHR PSLLAYHDKR LRLMGEEDHE VTTVEKEQVQ LQNQIQNL
|
| |