Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_34060 |
Symbol | |
ID | 7197613 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 997564 |
End bp | 999365 |
Gene Length | 1802 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178339 |
Protein GI | 219115087 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATG ACGAGAGTAA GGTTGGCAAT TCCAACAACG AGGCTCATGT CGAGATTACA TCCTTCAATG AAGCCCTGGT CCGCCCTATT GCGCCGAAAA CGAAAAAGAA GCGCGCGTTT AAAAAGAAGA ACCCGGCAAT GGGTGACACT GCATTTCTAC GCAAACGAAC GGCTAATCTG TTGAGTATAA CGAAGCAAAA TTCCGACAAC AAGGAAGGTG CCCTTGGGGG CGGTATGAAA GTTGATCGGA AGACCTTCCA CTTCCTCATG GATGCTTGGG CATTTTCCGG AGAAATTGAT GCTGTTGAAC AAGCGAGTGG ACTGCTGTCC CGTATGGAGG AATTGGCATC TTGGGGAAAT TTCAATATCG AACCTGACGT TCGTTCGTAT ACCAAAATGA TCAACGCCAT CAGCCGTTCG ACAGCCCCTT CTGCAGGGGA CGAAGCCGAC GCAATCTTTG CAAAAATGGA AGAGCTCTAT CAAACAGGGA GCAACCGCGC GGCTCGTCCG AATACATATA CCTATACTGC CGTAATAGAA GCTCATGCCC ATTCCGGCGC ACGGGGCAGT GCTAAACGAG CCGCAGAATG GTGCGAACGA ATGATTGATG CGTACGAACC ATCACCGCAT AACGAGAACA AAGAATCAAG TGTCCGCCCA ACTGCTCGTG CTTTTAACGC TGCAATTTCG GCATACGCCA AGTCAGGAGA AGAAGGAGCA GCGGCTCGAG CAGAACGCTT GTTTGATCGA ATGGAAGAAT TGTACGAGAC GGGAGTTGAA GAAGCAAAAC CCAACGCTTT CAATTTTAAC TCGCTCATTA CAGCATGGGC TAACTGCTGC GAGGAAGGTT CAGCACAGCG CGCGGAGGAA ATTCTAGAGC GCATGGAGTA TTTGTACAAG CAAGGAGATG AGAAATGTAA GCCGACAACA ATATCATTCA ACGCTGTTAT TGACGCGTAC GCGAAATCGG GTGATGAATA TGCAGCGCAA AAAGCTGAAG AAGTTTTACG GCATATGGAA GATCTCTATG GATCGGGACA AAATCTTGAC GCTCGTCCGA ATGTAAGATC GTTCAATTCA GTCATCAATG CTTGGGCAAA AAGTCGAAAC GAAGAGGCGG CTTGGAAAGC GCAGGATATG CTGGATTTGA TGGAGAAGCT CTACGCTAAA GGTAACAAAG AAGTGCGACC AGATGTACAC AGCTTCTGCA CGGTAATTAA TGGTAAGGGA CACACAGATT GAAAAATACA CGTCTGGCAC CGGCTCCACT CACACTAAGT TTCTGTATAA CAGCTTGGGC CCGGAGTCAA CAGCACGGTA AAGCTGAGCG AGCCCTGAAT TTGTTTCGTG AAATGAAGCA GCTTCATGAG GCTGGTAACA AGCACTTGAG ACCGAACACG GTAGCAGCGA ATGCGGTAAT GAACGCGTGC GCGTATACGT CCGGAGATGT TCATGAACAG AACCGAGCGG TAGAAATCGC TCACACTATT TTAAAGGAAC TGGAACAATC TCCTTATGGA AAACCGGACC AGGTGACCTA CGGGACTTTT CTGAAAGTAT GTGCAAATCA AATGCCGGAC TGCAGCACAC GCAATCAAGT TATTTCCGTC GTTTTCAAAA AGTGTCAGAA GACGGGGCAG GTGGGGAATT TTATTCTACA GCAGCTCAAA GCCATGGCGT CAGAAGAAAC ATATATGATG TTGTTGGGTC GAGGGATCCA CGAAGACATC CAGATAGCGG ACCTCCCCTC CGAGTGGTGG TGTAATGTTG TCGAGAACCG GTGGAGGCGT CGTGGAAACT AA
|
Protein sequence | MDDDESKVGN SNNEAHVEIT SFNEALVRPI APKTKKKRAF KKKNPAMGDT AFLRKRTANL LSITKQNSDN KEGALGGGMK VDRKTFHFLM DAWAFSGEID AVEQASGLLS RMEELASWGN FNIEPDVRSY TKMINAISRS TAPSAGDEAD AIFAKMEELY QTGSNRAARP NTYTYTAVIE AHAHSGARGS AKRAAEWCER MIDAYEPSPH NENKESSVRP TARAFNAAIS AYAKSGEEGA AARAERLFDR MEELYETGVE EAKPNAFNFN SLITAWANCC EEGSAQRAEE ILERMEYLYK QGDEKCKPTT ISFNAVIDAY AKSGDEYAAQ KAEEVLRHME DLYGSGQNLD ARPNVRSFNS VINAWAKSRN EEAAWKAQDM LDLMEKLYAK GNKEVRPDVH SFCTVINAWA RSQQHGKAER ALNLFREMKQ LHEAGNKHLR PNTVAANAVM NACAYTSGDV HEQNRAVEIA HTILKELEQS PYGKPDQVTY GTFLKVCANQ MPDCSTRNQV ISVVFKKCQK TGQVGNFILQ QLKAMASEET YMMLLGRGIH EDIQIADLPS EWWCNVVENR WRRRGN
|
| |