Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49480 |
Symbol | |
ID | 7195946 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 377020 |
End bp | 378512 |
Gene Length | 1493 bp |
Protein Length | 486 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184119 |
Protein GI | 219127806 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAGC TCGCGCGGAC GATGGCGTCG TCTTGCGCTC GTCCGCAATG GCAAACACAG GATCGTACTG TGTACATAAC GTATTGTCGT TTCGGAGCAA TCAAACAAGC ACGTCCGCTA CATTTGCGCT GCATCCGACG ACTCGTGAGC GTGTGTACAG AAACGCAGGA TTTTCCTGTA CGGAGCCTAC CCGCACAGTT TTTACGGGCG GCAGATCCAT CCAACTACGA TGCCAACGGC CAACGCGTCG ATGTGGCCTA CGATTCAGAA ACGGGCAAGC CCATGCATGT GAGCTCTTCC AAGACAGATG TCGATATTCG AGACTGTGTT TTGGTCGACA ATGCATACCG AGTCACATGG AAGGATGGCC GTGTATCCGA ATACACGAAA GCCTGGGTGC AAGCTCAGGT GGAAAGTTGG CAAGGCACAC AGTACAAAGT GGAAGAAAGT AGGAAACGAG TGTTGTGGAC GGGCTTTACC GCTGAATCGG TACGGTCGTC TACGGAATTG TATTTGCAGT TTGGTGATGT CTTACAGCCT ATTGGATGGA AGCAAGCCTT GCGGTCGTTG TATCGGTACG GGATCGTCTT GGTCCGAAAC ACACCCACAA CCGATGGAGG CGCCGGAATC GCAGCTCTGG CAGCCGCCAT TGGTGGCGGA TCCGTGAAGA ATCACACCTC TCTTGTACCG GGATATTTGG ATGGTAGTAG CGACGTGATT TGTTCACCGC AAGGTACGGA CGGACCGCTG CGCACCTTGT ACGGGACGGT GTGGTCTACG ACGTCGTCGG GACAACCAGT CGGAACCAGT ACGGCCGATT CCGCCTACGG GCACGCTAGC CTACCTCTGC ATACGGATAT GACTTACATG CGTGATCCCC CTGGTCTGCA GATCTTCACT ATGGCCCGTC CGGCGACCAA GGGAGGGGAA AGTGTGTTTG GTGACGGGTT CGCCGCTGCC GAATTCCTGC GCGGGACCAA CCCGGCTGCT TTTGCAACTC TCTCGAGCAC CACTCGTCGC TATCGTAGCG TGGATGCGTC CACGGGTTGG CATTTGGAAG CTTCGGGACC AATCATTACG GTACTAGACC GCAATGGTCA CCAAGACAGT GTGGTGGGCA TTCGCCACAA CGATTTAGAT CGCTTACCGG ATTTGCCTCC ACCTAGCGCT GATCCGGACG AATTTTACAG CCAATTGATC GAGGCACATC AGGCGTGGGA CGAGATATTA GCACGGGACG ATTTCCGACT CGTCATTGGT TTGGAACCGG GGGAAACCAT GGTGGTGGCC AACCAGGTAC GTCGGAGGCA GGGTACACGG ATGCGTAGGT AAAGATGTCT GGTTTGGACC GGATGCTACC GTCGACCGAC CCTTTTGAAT CGGCACGGCA CGTCACGTAT CTCACCTTTT TCCGACCGTT TGTCAATTTT AAAAGCGCTG TTTCCACGGA CGCTTCAGTT TCGACACTAG CGTCGCATCG CCGCGTTCCG TGA
|
Protein sequence | MSKLARTMAS SCARPQWQTQ DRTVYITYCR FGAIKQARPL HLRCIRRLVS VCTETQDFPV RSLPAQFLRA ADPSNYDANG QRVDVAYDSE TGKPMHVSSS KTDVDIRDCV LVDNAYRVTW KDGRVSEYTK AWVQAQVESW QGTQYKVEES RKRVLWTGFT AESVRSSTEL YLQFGDVLQP IGWKQALRSL YRYGIVLVRN TPTTDGGAGI AALAAAIGGG SVKNHTSLVP GYLDGSSDVI CSPQGTDGPL RTLYGTVWST TSSGQPVGTS TADSAYGHAS LPLHTDMTYM RDPPGLQIFT MARPATKGGE SVFGDGFAAA EFLRGTNPAA FATLSSTTRR YRSVDASTGW HLEASGPIIT VLDRNGHQDS VVGIRHNDLD RLPDLPPPSA DPDEFYSQLI EAHQAWDEIL ARDDFRLVIG LEPGETMVVA NQVKMSGLDR MLPSTDPFES ARHVTYLTFF RPFVNFKSAV STDASVSTLA SHRRVP
|
| |