Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49678 |
Symbol | |
ID | 7198161 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 386790 |
End bp | 389231 |
Gene Length | 2442 bp |
Protein Length | 729 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184454 |
Protein GI | 219128509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.452781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTCT CTACTGCCGT TGTATCACTC ATAACCGTCG CACCACTGGT CGTGGGCGCC GCTGAGGAAT CTCGATACCT TAAGACCGGA ATGGTGCGTC TCTCAATCGG CTCGCTGCTT GTTGGATCTC CCGGGTGCTT GTTACCTCAA CGATTATCTC TTTTTACTGT AGAAATCCAA GTCTGGCAAA GGCAAAGGCG TCCGCCAGCT ATCTGGACAT CGAAAGAGTG AACATCCTGC GTCCGATAAT GGCAAGGGAA AGGAAAAAGG AAGTAATCGC ATGAATAGCT CTCCTAAGAA GAATAGAGCG AAGAGCAGTG GAGGTGAGGC GGACAAGTGT GCGGGAAAGA ATGGGTTTTC TCGGTTCCCC TTTAAAGGGC TGGACAACTC GCAGCGTCTT CTGCTTCGCG AAGGTGTCGT TGACTCCATC AAGCCTTCTT TGTGTGGAAA CGAAGGCAAC AAGAATGTCA TCTTGGTGGT TGGAGATGGT ATGGGATGGG AGATGGTCAG ATCCGGTGCT ATTGCTAAGC AGGTGGTGGA TGAGCTGGAA GGTCTCGGTT GCGATACCAC CACAGGTTGC CCAGACAACA GTGCTGCGAT GAATGCTTTC CGTGGACGAA CACTAGACGA CTATTATACT GAGGGTAAGC TTTCTTGCTG CAAGTTATTG CTGTGTAAGA GTAAACAACC CAAAGCCTCA CGACCAACTC CTGCTTTCTC TTAAGGTAAG GGAAGTGGTA TGTCTTTCCA AGAGTTGGAT GGCTATGCTT TGATGACAAC CACCACCACT GTCACTCAAG AACCCAACCC TGGAAACCAC TATGCTCCTT CTCGAAGTCT TCTTGAAGGT GATGTCTCTG AGCACGAGAG TGGTCAGGCT GCCCTTGCTC TTGACGAGTG TGGTTTCCCG ATCGATTTCT CTCCGCTTGA CTTTGAAGCC GACGGCGGCA ACATGGTCCT TTGGGACAAT AAAATGGGAG GAGAATTTCC TTGGGACAAA CGCTACTATC AGGAGCGTCC CGATACTTCA ACCGGATTCG ACCCGGAGTA CATTATGCGT CATGCGACGG ATTCGGCTTC TACAGCTGGA ACAATGGCCA CTGGTCACAA GGCTGCCGTG AACATGATGT CACAAACACT TTACGAAGAA GACGTTAGCA CGCTTGTGGA AGACGCCATG TATTGCGGTA TGGCTGGAGG TGTCGTTACT TCCGTTCCCA TGCTCCATGC TACTCCTGGA GCCTTTGTGA CGCATACGAA CTCCCGCTCC GATCGCGACT CCTTACGTCG TAGCTTCATG CAGGTTCGTC CCACAATGGC CAGTGGTGTT TGTGGAGGCC GCTACTATCC CTTCGAAGAA GACCTGGAGA GCATGATGAA CGGCGCCCTT TCCAGTGAGT GGACCTTTTT GTACCAGAAT AACATGACAA CGGCCGACGC TTTCTACGAC CCGATTGCGG ACCTTGATCC TGACAATGGC GATCATCTCC TTGTTTGCTT GGGTGGCGAC TACACCACTA GCGGCCAACA AAATCTTCCG TACCGTGGTG TTGACGGTAC ATACTCGAAT CGTTGGTGTA GTTCTGGTGA AGGACAAACA GACCCCGATA CTGGTGCTGT GATCGGAATC ACTGCCACAA CTCCAGATGA ACTCTGCAAC CATTACGAAC AGGAAGAAAT TGAACAGATT CCCCATATCT CCGAAAATGT CAAAGCTGCT TTGGACTTTC TCGGAAAGGA CGATGATGGT TTCTTTCTAA TGTATGAGCA GGGAGATGTA CGTTGTGTTC CTGTCTACCC ACCCCTCGCT TTCTCCCGGA ATCTCACGTG TTTCCACTTT GTATCTCTTA GATTGATTGG TCCGCTCACG CCAACCACAT GGACGACATG ATTGGAACCA TGTTTGACGT TTCGGAGTCG GTGCAGGTCA TCATTGACTG GATCATGGAT AACGGTGGCT GGGATAAGAA CGCCCTCTAC GTCACTGCCG ACCACGACCA CTACCTTACT CTGAAGGACA ATTTTCCCGA GGCCTTGGCC CACTTGCTCA TCCGCGGTGA ATCCCACAAC ATTACGCCTC AGAGTAATTC CGGCGTAAAC CCGTGGGATG CCGGTATCGG AGTCGGTCGT CACGAAGATG ACTCCCAGAG TGTCACCGAG CATATTAACG ACTTTTCTAC CTGGTCGGAA GACGACGTTG ACGCGGTGGG CCACTTCTGG GGCGCCAACG GTTCCGGCGG CAACGGCTGG GGTAGCCACT CGACGCGCCC CGTCCCGGTC AGTTACATGG GAGACGATGG CTGCATCGAA GCGTTGACTG GTACCGGCTT TCAGGTTCTT GGCCGCGACG TGAAGGGGCA TCACGGTAAA ATCGACCAGA TGCATTTGCA CGCTTGCATG CTCAAGAACC TGTTCGGTCT CTAATCGGAT TCTTGGTGAT TG
|
Protein sequence | MKFSTAVVSL ITVAPLVVGA AEESRYLKTG MKSKSGKGKG VRQLSGHRKS EHPASDNGKG KEKGSNRMNS SPKKNRAKSS GGEADKCAGK NGFSRFPFKG LDNSQRLLLR EGVVDSIKPS LCGNEGNKNV ILVVGDGMGW EMVRSGAIAK QVVDELEGLG CDTTTGCPDN SAAMNAFRGR TLDDYYTEGK GSGMSFQELD GYALMTTTTT VTQEPNPGNH YAPSRSLLEG DVSEHESGQA ALALDECGFP IDFSPLDFEA DGGNMVLWDN KMGGEFPWDK RYYQERPDTS TGFDPEYIMR HATDSASTAG TMATGHKAAV NMMSQTLYEE DVSTLVEDAM YCGMAGGVVT SVPMLHATPG AFVTHTNSRS DRDSLRRSFM QVRPTMASGV CGGRYYPFEE DLESMMNGAL SSEWTFLYQN NMTTADAFYD PIADLDPDNG DHLLVCLGGD YTTSGQQNLP YRGVDGTYSN RWCSSGEGQT DPDTGAVIGI TATTPDELCN HYEQEEIEQI PHISENVKAA LDFLGKDDDG FFLMYEQGDI DWSAHANHMD DMIGTMFDVS ESVQVIIDWI MDNGGWDKNA LYVTADHDHY LTLKDNFPEA LAHLLIRGES HNITPQSNSG VNPWDAGIGV GRHEDDSQSV TEHINDFSTW SEDDVDAVGH FWGANGSGGN GWGSHSTRPV PVSYMGDDGC IEALTGTGFQ VLGRDVKGHH GKIDQMHLHA CMLKNLFGL
|
| |