Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48833 |
Symbol | |
ID | 7195132 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 396158 |
End bp | 397908 |
Gene Length | 1751 bp |
Protein Length | 562 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183480 |
Protein GI | 219126471 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.973466 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTGCA TTGATAATCC GTATGGGAGC GGAAGGCACA CTTCGAGGCT TGCTATGATC TACACGGTGA TAATTCTCGG TTGTTTCAAC TTGTGGAATG ATTCCGCTCC CATTGATCCT CGTGTACGCG CGTCCGATAT CTCCTGCTCA CATTCCTCGG ATCGCTTCGG AGGATTCATA CACCCTTACC TCAATTTCTC AAACGGAATC AAGAGTGTTG TAAGTAAAGA ATGGTGCATG CTTTATGCAG GGATCGAAAT GCAGCTGGTT CTAAAATTTA GTTCGTAAAC GGGGTACAGG GAAGCTGTAA GGAGCTTGTT CACTACAAAG CAATCGCTCC TAAAAACAAG GCCATTTCGT CGTATGCTTC CTCGGTAATT CGAGCGCATC GATTTCCAAC AGTTTTTCAG CGAGTGCGGT TCTACATGTC ATCCTGGTAC GAGCCTCCCT GTGACGAGAG CGAGTTGCTA GAAATTGTTG AGTACGTTGG AAACGAGAAG CATGAAGGCA AGACGATAGT CGCGGGGTCA CAGTTATTAA CACATGCTGT CCCGGCGGAC GAACAAGATG GAGTCGAAGC GGCAATATTT CTTCCTTCGT ATTCTTTTCA GCGCCAAACA CTGCTTACGG ACACCAGCTT GTCGGTGGCA GTACGGGCAA CTTCTCTGCC TGGGACTGAT CTCGTCTTTG TTCTTACTAA ATCCACTCTC GACCTTTGCG AATTCAATCC CCGTCCCGAA CAGCGAAAAA GCTTGCAAAG AGTTTACTGT AGAGAGCTCC GTGACAATTT ACTTCTACCG TTTCAAAAGC AAACCAATCA AACCGGAGAC CTGGCAGTAC TGCTTGCTCA AGTTGGTGAT TCCTTGGCTT CGAAAATCAT GAATGAATTC GGATCCCCGA CTCGTCAAAA GCAAAGACCT TCTATACCTC ACTTCAAAAA AGTGAGACGA GCATGGGACA ACGTGACCGG ACGAGACTTG ATGCTGAAGA TATCGACCAA GTCTTGCGCC ACTTTAGAGA CACGGCCAAC CAATAATCAA GACCTGGAAC CAATCATCTG GAAAATGGGG GTCAAGCGGC ACTATGGTAA GGTTTCGAGT GTTCCCGAGA AGGATATTCC ATGGGAAGAA AAAGGCAATG CGGCCGTGTT TCGGGGTACG ACTACCGGCC TCGTGCACCC GGAAACATCA TCACGCGAAC GCTGCCTGAA AAATCCGCGT TGTCGACTAG TGTTGATGTA CCACAATTCG TCCTATGTGG ATGCCAAGTT TACCACAATT TTGGACTCGA GCAAGCTGCC ACCAATTATC GACAACATCA CAATCAGTGG AGAAAGTCTT TCCATGGAAG ATCAGTTGAA GTACAAGGCG TTGATATTCA TGGAAGGCAA CGATGTCTCT ACCGGATTGA AATGGGGATT GTACTCCAAC TCGGTTGTAA TGATCACAAA GCCATCAATT TCGTCTTGGG CCATGGAAGA GCTCTTGGAA CCGTACGTAC ACTACGTGCC TTTGCGGGAC GATCTATCGG ACGTGGAAAC GCAGATGAAA TGGATCGTGG AGCACGACAG GGAGGCGAAG GAGATTGCGT TGCGGGGGCA GCTTTGGATG CATGACCTGC TGTACGCCGA GGAGTCCGAG AGGGACAATG CGGCAATCAA TGAAGAGATT TTGCGGCGAT ATCAGACGCA TTTCCGACCG GGCATTGCGG TCAAGGAAGA GCTTCTATTC TATCCCAAGC CGTTGAAGTA G
|
Protein sequence | MPCIDNPYGS GRHTSRLAMI YTVIILGCFN LWNDSAPIDP RVRASDISCS HSSDRFGGFI HPYLNFSNGI KSVFVNGVQG SCKELVHYKA IAPKNKAISS YASSVIRAHR FPTVFQRVRF YMSSWYEPPC DESELLEIVE YVGNEKHEGK TIVAGSQLLT HAVPADEQDG VEAAIFLPSY SFQRQTLLTD TSLSVAVRAT SLPGTDLVFV LTKSTLDLCE FNPRPEQRKS LQRVYCRELR DNLLLPFQKQ TNQTGDLAVL LAQVGDSLAS KIMNEFGSPT RQKQRPSIPH FKKVRRAWDN VTGRDLMLKI STKSCATLET RPTNNQDLEP IIWKMGVKRH YGKVSSVPEK DIPWEEKGNA AVFRGTTTGL VHPETSSRER CLKNPRCRLV LMYHNSSYVD AKFTTILDSS KLPPIIDNIT ISGESLSMED QLKYKALIFM EGNDVSTGLK WGLYSNSVVM ITKPSISSWA MEELLEPYVH YVPLRDDLSD VETQMKWIVE HDREAKEIAL RGQLWMHDLL YAEESERDNA AINEEILRRY QTHFRPGIAV KEELLFYPKP LK
|
| |