Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45153 |
Symbol | |
ID | 7200334 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 334174 |
End bp | 336073 |
Gene Length | 1900 bp |
Protein Length | 523 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179188 |
Protein GI | 219116787 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.978768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACTTAGCTT GTGAGGCAAC CGCCATTCTA TCGCCGCGTT CACTATTTAC TGTTAATTTC ATTGTTTGGA ATTCTTACGG GTCGCGCCAA TTTCTTTTGA TTCCGCAGGA GACTCGGATT GACACTTCAC CGTAAACACG CAATATGGCG GCCTCGTTTT ACAGTGGCGA TCAGGTTGGA GCAGTGGTAG CCGACGTTGG ACATTATTCC ACCAAGATTG GTTGGGCCGG TAGTGATACG CCGAAAAGCT ACTTTCGATC GGTACGTAGG GAGCGATCAG AGAATATATA AACGTATGCA AGCCGTCACA AGGAATTGCT TCTCGCTCAC GAATGTCAAA TTTGTCCTTA CTTTCTTGCG TTATCACTTT CTCAGAATGT TGCAGTGTCC CGAGAAGAAA GCGATGGGAA CACTAGCGGC AACAGCAACA GTCAACAGGA TGATCCTTCT CTAGCACATC ACCACAGTAT ACAGAAAGCG AATTACGACT ACTTCCACGG TCCTATTGAT CCCAAAAGCA AGTCGGACGG GACCTATAAA GTAGTCAATC CAGTGCATAC CACGACGGGT CTCTGGTACG ATACCGAGCC CAAAGTGGAC GGCTCGGACT GGAACGACTT GTTACCTACG TTCCTCACTC ACGGCTACAA AGCCTCTTTA AAGGCGGATC CTTCGGAGCA CCCCTTGCTG TTGGTGGAAC GCGCATACAA CACTCCCGCC ATTCGACAGC AAACGCTCGA AGTCTTGTTC GAAACCGTCG ACGTCCCAGC CACGTTTCTG GGTAAGGATG CCGTCTTGAG TTGTTACGGC TGCGGACGGA CGACGGCTAC CGTGGTTGAC GTGGGGTCGA GTCATACCAC GGTTACCCCC GTCTTTGAAG GTTTTGCAGA AAAGGCCAAC ATCAAGGTCA GTCCGATCGG GGTACACGCC ATGGACGAAC TGATTCTGCA ACAGTTGGAC CAATTGCACA AGTCCCCCAT CCTGCCTCTG TATCGATTTC GGCAACCCGA ATTGCAGCGT GGTAAGGATA TTTACCACGC AGCTCGACTA GCCTTGGCAC AGGAATGCCG GGAACTTGGG GCGGGCGCCG CCATCAACAT GGCTCCGGCA GCCGCCACGG CAACCTTCCA CGCCCCCCAC AAAACTTTTT ACCTTCCGGA CGGGACCGGA GTGGACGTTC CCTCGAAGGT TCGCTTTGCC GTTGCGGATC TGTTGCTCGG CGATGATCCC ATATCCGTCA CGCGACGACA AGAAGCACTA CAAGCTCAAC AAGCCGAGCT AGACGAATTG CTCCAAAACG CGGAGGCTCA ACCCAGCGAC GGCGATGGAG CCGACGACGA GTTGTATTCC GAAGCGGCCG CAGTTGGCTT GTCCAAACGA CGAACAAAGC GGCGAAGCAA GCCCGCCGCG AAACGTCGAT TTTCCAACCG TCCGCTCCAA AAAGCGTGCG CTCCGCACTG GCATACCCTC CGCACGCAAC TCACGGCGGC ACCGCTCGCA CAAATGGTCT GCGACGCCGC CTACCGATGC GACCGTGATC AGCAAGGCGC CCTGCTGGGT AACGTGGTAT TGGGTGGGGG CGGATCGTGT CTGGGACCGA CCGAACAAGC CTTTCCGGAA ACGCTACGGG AAAATATAGA GAGTTTGATT CATGTGCACA CGCCGGGGTG GAAGGTCAAG CTGTTGGCAC CCGCGGTGGC GGAAAGAGCT ATCTTGAGCT GGCTGGGTGG AAGCATTTTG GGTAGCCTCG GAACATTTCA TGATATGTGG ATTACCAAAG CCGAATACGA AGAATGGGGA TCGGCTATTG TAAATAGGAA GTGTCCATAA ATGTACTAGA GCGCTGGAAG ACGGCAGGGA ATAAGTTATG CAGCTAATAA ATTGTCTTTT TGCTTGTGCC
|
Protein sequence | MAASFYSGDQ VGAVVADVGH YSTKIGWAGS DTPKSYFRSN VAVSREESDG NTSGNSNSQQ DDPSLAHHHS IQKANYDYFH GPIDPKSKSD GTYKVVNPVH TTTGLWYDTE PKVDGSDWND LLPTFLTHGY KASLKADPSE HPLLLVERAY NTPAIRQQTL EVLFETVDVP ATFLGKDAVL SCYGCGRTTA TVVDVGSSHT TVTPVFEGFA EKANIKVSPI GVHAMDELIL QQLDQLHKSP ILPLYRFRQP ELQRGKDIYH AARLALAQEC RELGAGAAIN MAPAAATATF HAPHKTFYLP DGTGVDVPSK VRFAVADLLL GDDPISVTRR QEALQAQQAE LDELLQNAEA QPSDGDGADD ELYSEAAAVG LSKRRTKRRS KPAAKRRFSN RPLQKACAPH WHTLRTQLTA APLAQMVCDA AYRCDRDQQG ALLGNVVLGG GGSCLGPTEQ AFPETLRENI ESLIHVHTPG WKVKLLAPAV AERAILSWLG GSILGSLGTF HDMWITKAEY EEWGSAIVNR KCP
|
| |