Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44888 |
Symbol | |
ID | 7199592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 549477 |
End bp | 551185 |
Gene Length | 1709 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | hypothetical protein |
Protein accession | XP_002179021 |
Protein GI | 219116452 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTGGCATGC TTATGTTGTA TTGGTCACCA GCGTGCCAAC CATGCGCGGC GTACCTTTCT CTGGCGTAGT GGGTCTAGCC ACAGTACTCG TTATCCAACC GCTGCCTTTA CGAGGATTCG TTGGCCAGCG CTCGTTCCGT GTTGCGCCGT TTCCTACGAA ACTCACTGTT CCTCCAAGTC AACAGCAAGG ACCGCGACTT TCCAAGCGAC TTTCTCCGAA CCGTGCCGCA TCGAACGAAT CGATCTCTGC GGATAAAGAT CATGAAGCTG CTCAGTATAC GTCTGATCGG ATACAGAGTT TGAAAAACAA ACTAACTCGT GACTTTGTCC AGATTGGCGG TCCCGCGCTG ATACAGTTGG CGGCTGAACC GTTGGCGGCC TTGGTGGATA CAGCGTATCT AGGAAGACTC GGTCCCGAGG TACTGGGAGG CGCAGGTGTA GCCATTTCGG CCCAATATGC CGTATCGAAG CTCTACAACG ACCCTCTCTT GCGGACATCG ATTAGTCTGG TAGCGTCCCA AGACGGCAAA GCTCGTGGAA AGGAAGCAGC GACTCAAGCA GACACCGACA AAGCTGCTAA GGAACTAAGC GTTGCGGTGT CTTCGGCTCT ACTTTTGGCC GCCTCTGTTG GAATTATTCA ACTTCTCGTG TACTCCATAT TTTGCAAAGC AATCACCGGC GGCATGGGCT TGAATCCGTC CTCTCCCATG TGGCATTCCG CCGTTTCTTA TTTGCAAGTC CGGGCCTTTG GCACACCCGC GGCAACGCTC TGGCTCGTGG CCAACGGAAT TTTTCGAGGT CTCGGCGATA CACGCACACC ACTCTGGTAT TCGCTCTTTT TCACAGCTCT CAATGCCGTT CTCGATCCGC TTTTTATTTT TGTGTTTCAT TGGGGGGCCT CGGGGGCTGC GGCGGGGACA GCGTTGGCGC AGTATACTGC ACTGGTCCCT TTGCTCTTTG CCTTGAATCG TCGGGTACGG GTGGACATAC TCGGCCAGCT TGGTGCACTA GGCGAATCGC TGCAAAAGTA TCTAAAAGCC GGTAGTTTGG TATTGTTCCG CAGTCTCGGG AAAGTATTGG CCTACTCTGT CTGTGCCCGT CAGGCCGCCA TGCTGGGCTC CGTCTCGGCG GCCGCCTACA ATTTGACTTT CCAACTAGGA TTCGCAACGA CACAGATTTG TGAAGCAGTC GCGGTTGCCG TTCAAACAAC ATTGGCCCGG GAACTGGCCG ATACGGATTC ACATCCCCCC AAAGTCCGAG CCCAGCTCAT TCGACATTTG ATCTCCACTT CGATCTGGTT GGGCGGGGGT GTCGCGACAG CCTTATCCCT GTCGACCTTT TGGCGTCGTA ACTGGATTCT GGCTAGTCTT ACCACCAATC CGGCTGTACA GGCAGCAGCG GCAGGTATCT TTCCAGTTGT ACTGCTGACT CAAGTACTAA AAGGTTTGGC CTATCCTGTG AACGGCATTA TTATGGGAGG TTTGGATTGG TTTTACTCTA TGATCGTTAT GTGGATTGCA AACTTCGCGT GTGTCGGGCT GGTTCGCTAT TTTGTCACAA CGTCCGGAGC AGTTAGCTTG GCACAAATTT GGTGGGCACT GGCGGCCTTT ATGGGGACGC AAGTAGTCGC TGGTATTGTG CGATACGAAA GCAAAACAGG AGTATGGCAG GTCCTTCAAG GCGATAGCCT AGCAGCAGCG AGAGCCTAA
|
Protein sequence | MRGVPFSGVV GLATVLVIQP LPLRGFVGQR SFRVAPFPTK LTVPPSQQQG PRLSKRLSPN RAASNESISA DKDHEAAQYT SDRIQSLKNK LTRDFVQIGG PALIQLAAEP LAALVDTAYL GRLGPEVLGG AGVAISAQYA VSKLYNDPLL RTSISLVASQ DGKARGKEAA TQADTDKAAK ELSVAVSSAL LLAASVGIIQ LLVYSIFCKA ITGGMGLNPS SPMWHSAVSY LQVRAFGTPA ATLWLVANGI FRGLGDTRTP LWYSLFFTAL NAVLDPLFIF VFHWGASGAA AGTALAQYTA LVPLLFALNR RVRVDILGQL GALGESLQKY LKAGSLVLFR SLGKVLAYSV CARQAAMLGS VSAAAYNLTF QLGFATTQIC EAVAVAVQTT LARELADTDS HPPKVRAQLI RHLISTSIWL GGGVATALSL STFWRRNWIL ASLTTNPAVQ AAAAGIFPVV LLTQVLKGLA YPVNGIIMGG LDWFYSMIVM WIANFACVGL VRYFVTTSGA VSLAQIWWAL AAFMGTQVVA GIVRYESKTG VWQVLQGDSL AAARA
|
| |