Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32830 |
Symbol | |
ID | 7197466 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 897041 |
End bp | 898313 |
Gene Length | 1273 bp |
Protein Length | 393 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177712 |
Protein GI | 219111921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAA GCAAGAAAAC TATCACAATC AACTTGACCA GAAATGACAG TAAAATTGCT TCGGCGAGAG GAGCAGCATC GAAGCCTTAT TCATTCCATG GCGGTATGAC GACGACTAAT CTTTTGTTCG CAAAGGCGGG AAAACCTTTA CAAGATCGGG CCGAAACTGT AGCCTTTATG AAAGTGCTAA TGGGTCAAGA GCAAGCAGTC GATGGCTGCA ACGACGGTAC ATCTTCGGCA CCACGTCCCC AGCGTCCTCA ACTGTCAAAC GTACATGGCG CCGGGCTTCT GCTGATCTCT GATATACCAG ATTCATGTAG AAATAAACAG GGCGTCGTTG TAGGAATAGA CGAAGCAGGA CGAGGCAGCA TCCTCGGACC CATGGTTTAC GGTGCCGCCT TCTGGAACCC GTGCGATGAG GACCGCATTC CGAACGATTT CAATGATTCC AAACAGCTTT CGGAAGACAA GCGTGCTATG TTGCTGCACA AAATTATGTA CGACACTCCC GAGATGGGAT TTGCCGTGCG CGTCTTACAC GCCAGTGAAA TTTCCCGAAA CATGCTTCGC ACCGAATCAT ACAACCTGAA CCAAATGTCA CACGACGCGG CGGCTGGTAT CATTGAACAT CTTCTGGAAG CGGGAGTTCA AATCGGGGCT TGTTTCATCG ATACGGTCGG AAATGCGGAT CACTACAAAC GAAGGCTGCA GCAAGAATTT CCAGGCATTG ACTTTACTGT AGAATCCAAG GCTGACGCCA AATACCCACC TTGTTCCGCT GGATCTGTCG GTGCGTGTGT TTGTGCGATT AGACCGCGTG TTTTGAAAGG ATTTGCTTAT TTGCATTCAC ATAATGCTAA CCATTGCATC TCGTGTCGCA GTGGCTAAAA ATGTTCGTGA TCGCATGATG GAGAGTTTTC AGTATTCGGA GATGAGTCTG AAACGCGACC CGAAATTTGG CTCTGGCTAT CCCTCAGATC CTGTGTGCAA AGACTGGATG GAAAACAACC AAAATTGTAA GGTATTTGGA TATCCAGACG TTGTGCGATT CAGTTGGAAC CCGGCGAAAA AAGCGTTAGA AAGGAACGCG GCTTCGGTCC TATTTCAGGC CGACATAATC GACGAAGACG AGGAAGGAGA ATACTGTATT GGAAAGAAGC AACAGCAAAC GCAAATGAGT GCATTTTTGG GGAAGCAAGA CGCGACACGT AAACGAAAGC GATATCCGTT TTTTGAACGT AATCGACTCC AAGTCGTAAA CAAGCTATTC TAA
|
Protein sequence | MKTSKKTITI NLTRNDSKIA SARGAASKPY SFHGGMTTTN LLFAKAGKPL QDRAETVAFM KVLMGQEQAV DGCNDGTSSA PRPQRPQLSN VHGAGLLLIS DIPDSCRNKQ GVVVGIDEAG RGSILGPMVY GAAFWNPCDE DRIPNDFNDS KQLSEDKRAM LLHKIMYDTP EMGFAVRVLH ASEISRNMLR TESYNLNQMS HDAAAGIIEH LLEAGVQIGA CFIDTVGNAD HYKRRLQQEF PGIDFTVESK ADAKYPPCSA GSVVAKNVRD RMMESFQYSE MSLKRDPKFG SGYPSDPVCK DWMENNQNCK VFGYPDVVRF SWNPAKKALE RNAASVLFQA DIIDEDEEGE YCIGKKQQQT QMSAFLGKQD ATRKRKRYPF FERNRLQVVN KLF
|
| |