Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45800 |
Symbol | |
ID | 7200930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 315394 |
End bp | 317796 |
Gene Length | 2403 bp |
Protein Length | 800 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180215 |
Protein GI | 219118897 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATGG TCTCTCAACC GCTTTCGCCA ATAACGCCGT CTGCAGCCTC GTGGTGGTCG CTCATGGACG ACGAACGCTT TGACGACGTA AAGTCAATCA ACCTTGATCG ACTGCCAGGG TCACCGTCAA CCCAGCCAGC CACAAAGAGC AGCTTGTGTG CAATGGCCAT CGCTGAAAGC TCTTCTGACG AAACGACACT TGCTAGCTTT GATGATGACG ATGACAAACG AGCTTGGAAG TCTCCAAAAG ATGCAGACAA ACGGGTTGGT CAGCAGTCTC CAAAAAAGGT TCAGTGGCAA CATCCAATTC AAAAAACACT TCAGTTCCAT AATAACCATG ACTGCACTCC TTCAATTCGA GATGCAACTG ACAAGCTGCT TGATTTGGTC GAAGGTAGCG CCTGCAAGTT CAATACCAAA AAATCTGAAA GCTCAAAACG GCTAGAAATC CTCGAAAGTA ACGTATTGGA AAACAAAAAA TCAGCTGTCT GGGATGCTGT CGACGATAAC TCTAGAAGCG AGAGTCGCAA TCACCCAAAG AGAATTTCCT TTGTCAATGG ATCCTTTGAT TTCAATCCAG ACCATGGGAA GGGCGTAGCA TTTAGCTCAG ACGATCGTTT CTTTGACCTG GAAGATGAGC TGGGCAACAT TCGGCAGCAA AGAAGTACCG GTTCTGAATT TTTCTCCAAA TACAAAGACT GGCTGAGGGG ATCAGGATCT GTGTTTGATC GAGAAACCCC AAATTTGGAA AGGGAATACA GGCATTCGAA AAATTGCAAA AGTCTAGCTG CAGAAATTGT AATTCGAAGA TATAAGAAGA ACGAATCCCA CCCGGCTCTT CCGCCAGACA TAAAACGCAA ACCCCACAAG ACAGGGTCGA AGGCGAACAA AGCTGTCCCA GAGACCGTCA ACATCTCTCC GAAAACAGAA GTAAACCTCC GTTCACTTCA TCAAGTGTTC AGTGGTCTTA CTCCAAGTCC TGGAGGGAGA AAGTGTTTTC ACGACGAATC ATTCGCAGAT GAAAAGATCC CGAAAACCTG CATGTTGTCA ATTTCACCTG TCCTCGCTTC CAAGACGCGC AAGATTTTTA AAAGACTTGC ATTTGCCCAC TCTGCATCAA CATTGTCAAC ATCCATGATG ACTGGAGAAA GTCGACAACT TCATACTCCG GAAAACAATC TTGAAAATTC TGCAAAAGCT AGTTTTTACT TTGCAGACAA CAGAAACTCC TACGTGGCTT ACTTCCAAAG AGGCGATGAC GCTCATAAGT GTGTTGACCT CTACGAGCAG CCATCACCTT CAATTTTCCC AACTCTCGAG TCTGAAGTTG TCGTCCGCAT TGAAGCGTCA ACAATTTCAA AAGCAGACTG TTTTGTTCGA AGTGGCTTAT GGTGGGGAGA GGACAGCATG TCGAAACTCA CGCTCCCTAT CGTTCCAGGT GTTGCCTTCT GCGGAATCGT ACACCAAATT GACAAAAGGC GTCATCGAAG CGGTTTGAAG AGAGGTGACC GAGTGATTGC TCTTGTACGG GTTGGGGCTA ACGCTCGGCA TTTGTGCGTC CATACTGATC GAGTAATAAA AGTCGCAACA GATCTGACAG ATGTGCGATC ACTGGCTTGT CTTCCCGAAG TGTATCTCAC GGCATTTCAG TCGCTAAATA TTATTCGCAA GAATTCTTGT CGCTATCGTT CGTCTTCATT GACAGGCAAA TCTATTCTGG TACTCGGAGG GGAAACCTTG CTTGGTCGAG CAGTCCTTGA GCTGGCGTGC GCTTCCAACG CAGCCACTGT GTACGCCACG GCACCAAAGA GCCAGTTCAA TCTTATTGAA CAATGGGGGG CCGTTCCTTT GGAAGAGAAT CCCCATCATT GGTTTTCCCT GCTCAAAGGG CGCCTAGACA TGCTTATCAG TGTCAAGGAT TCCACAAGCG ATGAATCGGA GCTCAAATCT GAGCACGCAC AAGCACTTAA CCGAAAGGGT ATCATTGTAG AAGTCGGAAA GCCTGAGCGA AAGGAACGAT TGATGGTTTC CCTAGACAAT CTCGAGACTT GTGGTACTGA AATGGATCGG AAGCTTTATC ACTACAATGT GTTTGATGCC TGGGAGAAAG ACCCGAAACA AGCTAAGCGC GACCTGTCCC ACCTCTTGAA CATGTTGCAA AAGGGGTTCA TCCGGCCCAA GATCCTCAAA ACTGTTCCTT TAAGTAAAGT TGCCATGGTC CACAACTTTC TTGAAAGTAA GTGCCGAGAC GGATTCATGC TTTGTGAGCC TTGGGCGAAT TCAGCGAGGC GTGAAATTAG CACCTCGGGA ATTGGTTTTC ACGGAGAGTT GGCTAGCTTT CCTGTTGGTG ACGGCCGAAA AAGAAAAAAA AATATAACAG AGTCACAGCG AGTCGCCATC TGA
|
Protein sequence | MKMVSQPLSP ITPSAASWWS LMDDERFDDV KSINLDRLPG SPSTQPATKS SLCAMAIAES SSDETTLASF DDDDDKRAWK SPKDADKRVG QQSPKKVQWQ HPIQKTLQFH NNHDCTPSIR DATDKLLDLV EGSACKFNTK KSESSKRLEI LESNVLENKK SAVWDAVDDN SRSESRNHPK RISFVNGSFD FNPDHGKGVA FSSDDRFFDL EDELGNIRQQ RSTGSEFFSK YKDWLRGSGS VFDRETPNLE REYRHSKNCK SLAAEIVIRR YKKNESHPAL PPDIKRKPHK TGSKANKAVP ETVNISPKTE VNLRSLHQVF SGLTPSPGGR KCFHDESFAD EKIPKTCMLS ISPVLASKTR KIFKRLAFAH SASTLSTSMM TGESRQLHTP ENNLENSAKA SFYFADNRNS YVAYFQRGDD AHKCVDLYEQ PSPSIFPTLE SEVVVRIEAS TISKADCFVR SGLWWGEDSM SKLTLPIVPG VAFCGIVHQI DKRRHRSGLK RGDRVIALVR VGANARHLCV HTDRVIKVAT DLTDVRSLAC LPEVYLTAFQ SLNIIRKNSC RYRSSSLTGK SILVLGGETL LGRAVLELAC ASNAATVYAT APKSQFNLIE QWGAVPLEEN PHHWFSLLKG RLDMLISVKD STSDESELKS EHAQALNRKG IIVEVGKPER KERLMVSLDN LETCGTEMDR KLYHYNVFDA WEKDPKQAKR DLSHLLNMLQ KGFIRPKILK TVPLSKVAMV HNFLESKCRD GFMLCEPWAN SARREISTSG IGFHGELASF PVGDGRKRKK NITESQRVAI
|
| |