Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47445 |
Symbol | |
ID | 7202567 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 640416 |
End bp | 642119 |
Gene Length | 1704 bp |
Protein Length | 538 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181772 |
Protein GI | 219122895 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAAG CATTTCCCAA TTTGCGAATA CGCATCAAAG TGATTCCCAG ATTTTTTCGC TGGTATGCGT GCCATGGCCT CGTGTTCGTA GGCCTCCTCG GGGGCTCAAC CGTGGCTACC GCGAAAAGAC GAGCCGATAC CCGCCTGAGT ATTCCTAGAC AGCTGAAGCG TTCACTGAAA GACCTAAGAG AGGCTGGTTC CAAACGGAGC AGGGAAATTT TTCAAAACGT AAGTTCCGTA CGTGCAGGTG GCTCTGCGTT TGATATGGAC CAGTTTGGCC GGTCGGTATC GACTGTACTG GGATTCATAG CGGGAACTCA GACACTGGGT ACGATACTGA CTGCGAATAA GAATGCCATC TTTGATCAGG TGTGTGATCC GAAGTCGTGA TTATGACGGC CGCAAAAATC GGTTTGTTGA CAGTGTCTCA ATTTTGTCTG TTACTTTCGT CTCCAGTTGG CTGTCCGTCT ATTGGAACCA GACAGCGTCG CTGCGATGCA TACGGTGAGA AAAATTCTGA GCTCTGTTGA CGATCACGGG ATTGTTGACG TCTTTGCAAA GTACCGTAGT AGAGACGTGC TACTGTCGAT CTACGCATTG AGTCGCCTCC AAGACGCCGT GGCAGAAAAC GATCGTCGAC GAGAAACCTA TTCAGCGTTT CTTGACAACG AGCTGATCAC TGACTTAGCC CATTACTCTG TTTACGCCAG TGTGGCTTAT GGTTGGAAGA TGCATTTTGC CTTCGGAGGA GGCTTGCACC TAGGGGATTT ACAGGTGTTG CTGAAGCGGA CAGGAATTTT TCTTGCAGAC TTGCTCGAAC ACAAAAAGGA ATCCAAGGCG CATCGTCCCG CTTATTTCAT CGTGAGGGAT CGATCCAGAC GCAAGTTAGT GTTGTGTATA CGAGGGACTC TGTCAGCACA CGACCTTTTA ACTGACCTTT GCTGTTCGCC AGATGAGTAT GAATTACCAA GGTCGACGTC TCGATCGCGC ATCAAAACAT TATCGGATTA TTGGTGGAAC GGCGGAAGCG CACATATAAA GATGCGTGCT CATCAAGGAA TGCTGCAAGC TTCTCGTTTG CTCAAGAAAG ATGCAGAGGA TCTCATTCGC AGCCACCTTA AGGAAAATCC CGGTTTCTCT CTGGTTCTCG TAGGGCATTC CATGGGTGGT GGTGTCGCAG CTTTGCTGGG CACATTGTGG GAAGACACAT TCGAGAATCT CCAGGTTTAC GTTTTCGGCC CCCCCTGTGT TTCGTGCTTT GGCGTGGCAC CTACCGGTAC CCGGAACATT GTATCCGTGA TTTCGGATGG TGATCCCTTC CGAAGCTTTA GTCTCGGACA CGTCGCCGAC TTGTCCATTG GCGTGGCTCT GCTGTGTGAT GATCCTCATC TTCGAAGGAT GATCCTCATG AAGACGAATG GTCGAACAAA AGAGATTGGA GCCCTTGACT TGCAATGGTG CGTACAAACC ATGAAGAGAA TGCGTGGAGA CATGAAGTCA GAAAAGCTTT TTCCGCCAGG TCGACTACTG TTGCTGTCAA GCAAAGGTGG CATGTGCAAA GTTCGAGAGG TACCTACCGA GTTTTTCGGA GAGCTTGCCA TCAATCACAA GATGTTTGAC GTCTCAAAAC ACATACCTGC GAGGTACGAG TCCATTCTTC GATCTATTCT AGAGCATCGA GGAGCTAGTC CATCAGTAGA CTAA
|
Protein sequence | MKQAFPNLRI RIKVIPRFFR WYACHGLVFV GLLGGSTVAT AKRRADTRLS IPRQLKRSLK DLREAGSKRS REIFQNVSSV RAGGSAFDMD QFGRSVSTVL GFIAGTQTLG TILTANKNAI FDQLAVRLLE PDSVAAMHTV RKILSSVDDH GIVDVFAKYR SRDVLLSIYA LSRLQDAVAE NDRRRETYSA FLDNELITDL AHYSVYASVA YGWKMHFAFG GGLHLGDLQV LLKRTGIFLA DLLEHKKESK AHRPAYFIVR DRSRRKLVLC IRGTLSAHDL LTDLCCSPDE YELPRSTSRS RIKTLSDYWW NGGSAHIKMR AHQGMLQASR LLKKDAEDLI RSHLKENPGF SLVLVGHSMG GGVAALLGTL WEDTFENLQV YVFGPPCVSC FGVAPTGTRN IVSVISDGDP FRSFSLGHVA DLSIGVALLC DDPHLRRMIL MKTNGRTKEI GALDLQWCVQ TMKRMRGDMK SEKLFPPGRL LLLSSKGGMC KVREVPTEFF GELAINHKMF DVSKHIPARY ESILRSILEH RGASPSVD
|
| |