Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37072 |
Symbol | |
ID | 7202091 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 106060 |
End bp | 107480 |
Gene Length | 1421 bp |
Protein Length | 381 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181300 |
Protein GI | 219121911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.730274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCCCAA CGAAAGGACT TCCGCCGTAC GCAAGTGTGC TGGTGAAGCA TGGATCGGTA GACAGTAGAG ACAACGACGA AGAAAAGGTA AGAAGCGGTT AAATCTGAAT CCAGAATGTA CACATGGTTC AGTTGTCAGT AAGGTGCGAC GAAACTCGAA CTCGCAGTAA AGTATACTGG AGGAGCGAAA CATGGTCTCT TGCCGAATAC TCATTGATGC TATTTTTTGT TAGAGATCAA AGCTCTTGCG TCATCTTGGA AATGGATGTG AGGCTATAAA TAGAGCCCCG GTTCAACCCG GACAGAATAT TTCCTCCTAC TCAAGCTTTC CTGGTGCTCC AATGCCGACT CCTCTTCTGC AAACGAGCAA TTTTCCATCG GGGTTTGGCA GTGAACTTCC TTTGCCCTCC GTTTTGCGAG CAAACGCTTC TCTCCACTCT CTTGGCTTTG GCTCCCCAAC CAACGCATCC ATGACAAGCA CGGGTCATCC TTTGATTGGC CAATCCATGT TCTCGACGCC AATACATGGG TGCTTCTCCA CAAGTGATAG AGGAAACGAC TCCTCGAGCA CATTAGCTAT TCTGTTGCCC GTTTCAGAAT CAATGTCCGC GTCGAGTCTT GGAATCGACA GCACACTTCT CCAGTGCGCT GACTATGCGC GCAGCGCTGA TGACTTCCAC CTAGAACATC AAATACAGTC AATTACAGCT GCTTCTTTGT TAGATTTAAC ACCCTCTCTT TCTCACCAAG AATTTCAAAC CTTCGTCGAG ATCTCCGAGA ATCATAGCGC TAGTCGGCCC ACGTCAAATA GCAGACTCTT GAGCAAACGA GATTCGATCT TGTCTACAGA TATCCAGAAA GAGAAGCGAC AGAGGCCATT TACATGTGCC GACGCGATTG CCCATGCTGC TACCGAGAAC ACGGGAATCG CAATCCGTAG TCGACGATTG TTTGAGCAGG TTCCCAAAAC AGTAAAGCCT TGCAAGTGCA AGAACACGCA CTGTCTGAAG CTGTATTGCA CCTGTTTTCA GAAGGGATCA TTTTGCGATC CAGACATTTG CAAATGCATC GATTGCTACA ATTTGAGGGA ATTCAACGAG ACCGGGGGCA AGCGACAGGA AGCTGTTTCT GAAATCTTGT TACGGCGCAT TGACGCCTTT GAATCCCGTC CAAAGAAAAA GACTGGTGAA GGATGTGCTT GCAAAAAGAA TCGGTAAGTC AGAAGGGATA CACCTAAATG CTGTAGCCTG TACAACTCTC AAATCTTGCC ATATATGAGC AGATGTCTTC AAAAGTACTG CGACTGCTTT GCCACAAAGT CGGACTGCAC TGAACGCTGC AGGTGTAGCG CCGCATGTGG CAATAATCGG TTCCCGGCGA TTGAAGACAA TCTATCAAAC GAAGAACCTC CCAGTCCATA A
|
Protein sequence | MGPTKGLPPY ASVLVKHGSV DSRDNDEEKR SKLLRHLGNG CEAINRAPVQ PGQNISSYSS FPGAPMPTPL LQTSNFPSGF GSELPLPSVL RANASLHSLG FGSPTNASMT STGHPLIGQS MFSTPIHGCF STSDRGNDSS STLAILLPVS ESMSASSLGI DSTLLQCADY ARSADDFHLE HQIQSITAAS LLDLTPSLSH QEFQTFVEIS ENHSASRPTS NSRLLSKRDS ILSTDIQKEK RQRPFTCADA IAHAATENTG IAIRSRRLFE QVPKTVKPCK CKNTHCLKLY CTCFQKGSFC DPDICKCIDC YNLREFNETG GKRQEAVSEI LLRRIDAFES RPKKKTGEGC ACKKNRCSAA CGNNRFPAIE DNLSNEEPPS P
|
| |