Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48891 |
Symbol | |
ID | 7195168 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 600955 |
End bp | 602675 |
Gene Length | 1721 bp |
Protein Length | 427 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | hypothetical protein |
Protein accession | XP_002183389 |
Protein GI | 219126281 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGAACTCT CAGTTTCCAG TTTCGCCGTA GAAATGCGTT GAGATGGCTC TTGGGCCGTG TCCAATGAGA AGCATTCGAA TACCGGGGAT ACGGGCCGAG TCAAACCGCG AACTTACGCT GCGGAATTGA TTTTTTTGTC GGTAGGCCGT CATTCGACAC AAGAAATTCT GCTCGGTCGA CACACACAAA ATTACCCAGC ATCTACTATC CCACCATGTT GTTGCGCTTT GTTCCGACAT TTGCTTACTT TTCATTACTC GTGCCCTTAG TAACCTGGGC TGGCCGTACG ACCGTTTTGA CCCACAAAAG TGCTTTGCAC GTCCAAAATG CCAACAGTCA GAATGCCTGC GTCGATGGCA GCAGTGAGGG CTTTTGGGCC GGCGCTAACA ACTCCACACG GACGTCTTGT AGTTGGAATG ACGAAGCCCA GATAATTGCG GAGTCGACAT ACGAGCTGTC GTCACCGGCG GCCCCTGAGG CGGACAATTT GGGATCCAAA TTGGGCGTTC CACAAACCAT CGATGCGGCG CGATCGCAAG AGATTCTGGA ACGCATCGCT AGCGCCCAAA CTTACATGGA CGAGGTAGTC GCCATCGACG AAAAGTACGC TAAAGTACGG GAATTGTGCG ATAACAAGAA CGCCAACTGT GCGTTTTGGG CGGTTATCGG CGAGGTACGT ACCAAGGTGG ACCTAATTGT TGCAGACAGT GATGCTATGA AATGATGATT TCTAATTTTT GGCGTTTTTG GACTTCTTTT GCTCCCTTCT TACATTAGTG CGAGAACAAT CCTGGCTACA TGAAATTGAA TTGTGGTCCA GTTTGTAAAT CGTGCGAGCA GCTTCACGTC GAAACCCGGT GTCCCATGGA CCCTGACGCC GTCGATGCCT TGTACCCGGG AACCCTCACG CATATGTTCG AAGGCATCCT CGCCAACCCC GATTTCCAAA AGTATGAAAT ATCTGTCTTG TCTCGGCCGA CACTAGCTCC TGGGGATACA GAAGAAACTG CAGACTACTT TGTTGGTGGC CCGTGGGTGA TCATGCTCGA CAACGCCTTG TCGTCGGAAG AGGCAGATCG ACTGATTGAG TTGGGCGGTA TTGAGGGCTA CGAACGCAGC GCCGATGTCG GTCACCAAAA GGCCGACGGT ACGTTCACAG CAGAAACTAA CTCGGGCCGC ACGTCGACTA ACGCCTGGTG CCAACACGAC TGTTACAAAG ATCCAACGGC GCGCGCCGTC ATGGACCGAG TAGCCAACAT TACCAGTATT CCCGAAGTGA ATTCCGAGTA TTTGCAAATG TTACAGTACG AGAAATCACA GTTCTACCAA ACTCATTCAG ACTACATTCC TTATCAAGTG AATCGACCAA CAGGTGTCCG CATTTTGACA TTCTACTTTT ATTTAAGCGA TGTGGAAGAG GGTGGTGGTA CAAACTTTCC CAAGTTAGGC CTCACCGTGA CGCCGAAAAA GGGTAGGGCC GTGTTGTGGC CTTCAGTCTT AGATGATGAA CCCAATCAAA AGGACGCCCG CTCCGATCAC CAAGCCTTGC CGGTAATCAA GGGCGTCAAG TATGGTGCCA ATGCATGGAT TCACCAACGA GATTATAAGA CATGGAGCCA GAAAGGATGC TAAGGAATCC TTTGTGGTGC GGAATTACAT TTCAAAGGCC TCCGTCAAAC ATATCCATAA AAATTAACTT TTTCGTCTAT ATTTCACAGC T
|
Protein sequence | MLLRFVPTFA YFSLLVPLVT WAGRTTVLTH KSALHVQNAN SQNACVDGSS EGFWAGANNS TRTSCSWNDE AQIIAESTYE LSSPAAPEAD NLGSKLGVPQ TIDAARSQEI LERIASAQTY MDEVVAIDEK YAKVRELCDN KNANCAFWAV IGECENNPGY MKLNCGPVCK SCEQLHVETR CPMDPDAVDA LYPGTLTHMF EGILANPDFQ KYEISVLSRP TLAPGDTEET ADYFVGGPWV IMLDNALSSE EADRLIELGG IEGYERSADV AETNSGRTST NAWCQHDCYK DPTARAVMDR VANITSIPEV NSEYLQMLQY EKSQFYQTHS DYIPYQVNRP TGVRILTFYF YLSDVEEGGG TNFPKLGLTV TPKKGRAVLW PSVLDDEPNQ KDARSDHQAL PVIKGVKYGA NAWIHQRDYK TWSQKGC
|
| |