Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44994 |
Symbol | |
ID | 7199512 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 923123 |
End bp | 925846 |
Gene Length | 2724 bp |
Protein Length | 840 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179091 |
Protein GI | 219116592 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0685164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAAGC TTGTCTTTCT GGCGACCCTT GCAGCGTTTC TTACGTTGGC GGATGCCAGT GATGGTTCCA TTGTCGGTAC GTGCAGAAAC CCCCTCTTGA GTTTGTAGTT GTATATTCAC TGTCACTCCA GATTAGAAAA CTGACGCATG AGGTCCTCAA TGTCCCTTCC ATTGGTCACC ACAGACTCCA TCGAAGAACG AAAGTTAGCT GGGACAACGT GCACTACTAA GAATTTGGAC TTTAGCGAAT TTGCTGCCGG AACCTACTTG AGTAACTTGG AGGCTGCCTA TGGGGTGACT ATCACTGCCG TTTCCCGTAC AAGCAAGGGC TACACACCCA ACGGAGCCGC CCGTGTTTTC GACACATCCA AGCCCACCGG TGCAACGGGA CAATCAATGT GCTCCTCTGG TGACGGTGAC TCTGATCTCG GATCACCCAA CTCCGCTTGT CCCGGAGGTG GACCAGGTCA CGGACCTGGA GGTGCACCAA AACTCGCCAA CGGACAGAAC AATCCTTATA AGAACTGCTC GCCCCAAGGC AAAGTACTCA TCATTCAAGA AGGCAACAAA AATTGTCCCG ACGACAGTGC GGACGGTGGT ACTATCCGCT TCGACTTCTC CAAAACAGTG GACCTCGAGT CGGTGACGTC CTTGGATATT GACGAGGGCA GCACTCCCGA AATCACCGTC TCGTACGGCA ACGGCCAGGA GGCTTTTTAT AAGCTTCAGG CTACGGGCGA TAACGGTGTT TTTACGCAAA CGATCAACAA GAGTGACGTC AAGTGGTTCC AGATCAAGTT CTACGGCTCG GGATCCGTAT CAGGCTTCAA GTGGGATGAG TGTGTCACAG CCCCAACGAA AGCCCCCACG AAAAGCCCCA CAAAGGCTCC GATACCGGCC CAAACCAGAG ATGATACTTG TCCAACCAAG AACTTGGACT TCAGTGAATT TGCCACCGGA ACCTACTTGA GCAACTTGGA GGCTGACTAT GGGGTGACTA TCACTGCCGT TTCCCGTACA AACAAAGGGT ACACACCCAA TGGAGCTGCC CGTGTTTTCG ACACATCCAA GCCCACCGGT GCAACGGGAC AGTCAATGTG CTCCTCCAGC GATGGGGACC CAGATCTCGG ATCACCCAAC TCCGCTTGTC CCGGAGGTGG ACCAGGTCAC GGACCTGGAG GTGCGCCAAA ACTCTCAAAC GGTCAAAACA ATCCTTACAA GAACTGCTCG CCCCAAGGCA AAGTACTCAT CATTCAAGAA GGTAACAAAA ATTGTCCCGA CGACAGTGCG GACGGTGGTA CTATCCGCTT CGACTTCTCC AAAACAGTGG ACCTCGAATC GGTGACGTCC TTGGATATTG ACGAGGGCAG CACTCCCGAA ATCACCGTCT CGTACGGCAA CGGCCAGGAG GCTTTTTATA AGCTACCGGC TACGGGCGAC AACGGCGTTT TCACGCAAAT GATCAACAAA GGTGACGTCA GGTGGTTCCA GATTAAGTTC TACGGCTCAG GATCCGTATC AGGCTTCAAA TGGGCCGAGT GCGTCCCAGC CCCAACGAAA GCCCCTGCCA AAGCTCCGAC AAAAGCTCCT GTCAAAACTC CGACGAAAGC CCCAGTAAAA GCTCCAACGA AAGCCCCAGT AAAAGCTCCA ACGAAAGCCC CGACTAAAGC TCCAACGAAA GCCCCAGTAA AAGCTCCAAC GAAAGCCCCA ACCAAAGCTC CCACGAAAGC CCCAACCAAA GCTCCAGCGA ATGCTCCAAC GAAAGCTCCA ACAAAAGCCC CTGTAAAGGC TCCGACGAAA GCTCCAGTGA CAGCTCCAAC AAAGGCTCCT GTCAAAGCGC CGACCAAGGC GCCAACCGGT ACCCGCGATG AAATATGTGT CGACGAAGTC CTCGACTTTA CTGACTTTTC TACAGGCGAG TACGTCCATG ACCTGGTACG ATCTCGCGGC GTTACAGTGA CAGCAATTGC ATCCGGAAGC GACGGATACA CCCCCGGCGG TGCGGCTCGC ATTTTCGACA CTCGCTACCC TTCCGGCAGC ACTGGACAAG CGCTCTGCGC CCAGAACGAA GGTGAAACAA CTCTCGGGTC ACCCAACCTT TCGTGCCCCG GCGGTGGATC CGGATCGGGT AACGGAGGCA AAGTCAACAC GCCCTTCGCC AACTGCGACG CTCGTGGTAA GGGTCTCATC ATTCAAGAAG GAAACGTGGC CTGTCCTGAA CACGCTGGAC AAGGCGGAAA AATTGTGTTT GAGTTTGCGG TACCGGTTGA GCTCAACTAC ATCGATTTGC TGGTTAGCAC CGACTCCAGT CCGGTAATTA CGGTGTACTA CGGCGTAGAC CAATCCATTT CGTTTGATAT GCCGGTGATC GGCGCCAATG GCTACCGTCG ACAAGTGATC GATCGATCGC AGGTTTACAA GGTCGAGGTG GGCTTCTGTA GTGGAGGTAC CGTCACTGCC ATTGACTACA TTCGTTGCGA GCCTGAAGAG GAATGTCCAC CGAGCACTGG TTCAGTCAAA CCCCTCCCTC CGATCGAAGT GCATCTTCCC CCGCCGAACA GCAAGCACAT GGTTTTTGAC TTTGTTGTTA TGAAGAATCA AGAATCGTGT CCTCCGGAAT GGCTTCATTA ATTTGGAGCA TGCGTTGTGA TCGACACGGG GTCGCGCATA GTGCAGACTG TCAGAAGGCA TTTCATAATT TATGTAAACA CAGGTACTCA GCCT
|
Protein sequence | MMKLVFLATL AAFLTLADAS DGSIVDSIEE RKLAGTTCTT KNLDFSEFAA GTYLSNLEAA YGVTITAVSR TSKGYTPNGA ARVFDTSKPT GATGQSMCSS GDGDSDLGSP NSACPGGGPG HGPGGAPKLA NGQNNPYKNC SPQGKVLIIQ EGNKNCPDDS ADGGTIRFDF SKTVDLESVT SLDIDEGSTP EITVSYGNGQ EAFYKLQATG DNGVFTQTIN KSDVKWFQIK FYGSGSVSGF KWDECVTAPT KAPTKSPTKA PIPAQTRDDT CPTKNLDFSE FATGTYLSNL EADYGVTITA VSRTNKGYTP NGAARVFDTS KPTGATGQSM CSSSDGDPDL GSPNSACPGG GPGHGPGGAP KLSNGQNNPY KNCSPQGKVL IIQEGNKNCP DDSADGGTIR FDFSKTVDLE SVTSLDIDEG STPEITVSYG NGQEAFYKLP ATGDNGVFTQ MINKGDVRWF QIKFYGSGSV SGFKWAECVP APTKAPAKAP TKAPVKTPTK APVKAPTKAP VKAPTKAPTK APTKAPVKAP TKAPTKAPTK APTKAPANAP TKAPTKAPVK APTKAPVTAP TKAPVKAPTK APTGTRDEIC VDEVLDFTDF STGEYVHDLV RSRGVTVTAI ASGSDGYTPG GAARIFDTRY PSGSTGQALC AQNEGETTLG SPNLSCPGGG SGSGNGGKVN TPFANCDARG KGLIIQEGNV ACPEHAGQGG KIVFEFAVPV ELNYIDLLVS TDSSPVITVY YGVDQSISFD MPVIGANGYR RQVIDRSQVY KVEVGFCSGG TVTAIDYIRC EPEEECPPST GSVKPLPPIE VHLPPPNSKH MVFDFVVMKN QESCPPEWLH
|
| |