Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_43972 |
Symbol | |
ID | 7204189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 579116 |
End bp | 580726 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186085 |
Protein GI | 219113003 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.549222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGTCT TCCAGTTACG AAAAACGATG CATGTGCTAG AATGCGTGCC CAACTGGTTC CAGGTGTCGG AGATGCAAAA ACCTAAGTAT GGTGACAGAA ATGATACCGG TAAATGGTTT GTCTTCACCA TGCCTCGAGA ATATTATACC GTTCTGGTTA CCTTTCTCTG TTCAATTGGC ACTACCAATG GTTTTGGTCT CGGTGTTGAC TTTCAAAAAG AACAGGGCCG GGAGCATCGT CTATTTTTGA AAAGGAAATC GCGCCTGGCC TCTGCGCTGT TTATGGAGGT TCCGGTAGGA ATTCCTATCG ATCCCGAAGG TCGGCCATAC AAATTTCCGG CCAAAGAACA TTGTTCCAAA TGTGGCTTAT GTGAAACAAG CTACGTGGCG CGTGTGAAGG AGGCCTGCGC ATTCTTGGAA CCGGGCATGT CTCGGATCGA CACACTCGAG ACGAAAGTTC ACGGCCGGCG ACGAAAGACG ACCGACGACA AAACAATTGT GCAGGCTGAT GAACGACGCT TTGGGGTTCA GTACCAGCCA CTTCGACTCG CTCGAGGAAT CAGCATGCCG GGTGCACAAT GGACAGGCGT GGTGTCTTCT ATTGCTATTT CGATGTTGGA GACCAGACAA GTCGATGCCG TTGCCTGTGT GGCTTCCAAT GAAGAAACCT GGAGTAATCC CAATCCAATA CTAGCCCAAA CTACCGACGA AGTTCTGAAA GGAAGAGGTG TAAAGCCGTC TCTTGCTCCT AGTCTTAACA TTCTGGACGA AGTAAAAAAT GATCCATCGA TAAGGCGACT CCTGTTTTGC GGAGTTGGCT GCTCCGTCCA GGCGCTTCGT TCCATCGAAA ATGAGTTGGG TATAGAAATT TTCATATTGG GCACCAATTG TGTTGATAAC AGCCCTTCCC CAGGAGCTGC AGCTGCATTT ATCGAGAAGG GGGCGAAGGT CTTTTCAGAT TCGGTCCGTG GCTATGAGTT TATGCAGGAT TTTCGTGTTC ATGTCAAAAC CGAGGAGACC TACTTGACAA TACCTTATTT TTGTCTACCT GGCACTATTG CTGAATCGTC TATTGCCAAG TCATGCCGAT CTTGTTTCGA CTATACAAAT GCTTTGGCGG ATGTAGTGGT TGGATACATG GCAGCGCCAC TTGATGGAAA GTCGAGAATG GACGAATCTT GGCAGACTGT CACAGTCAGG AACGAACGAG GCAATCAGAT GGTTGAGACT GCGATTACAC AAGGACGTCT AGAAGTTGGA GACATTGTAC GAGGATCTGG CGATCACCAA CAACTTGCAA TTGCGACTAC GAAATCCGAT GCTCTTGTGC AAGCCATGGT GGGTGGCAAA GTTCAAGAGA ATGGGATGCC GCTATGGCTA GGGAACATAA TGGCAACGGT TCTCCGAAAA GTTAGTGCCA AAGGAATCGC ATTCGCCCGG TACAGCATTG ACTACCACAT AGTGAGGAAC TATTTTCATG TTCTGAACGA GTGGGGCGAG CATCGCGCTC GATCCTCAAC ACCGCAATTC GCTTTGGAAA TTGTCGACGA ATACCTCGAA ATGGATTCTA CGCTAAAGGG ATATGCCGCT AAACTTACTT CGAAACATTG A
|
Protein sequence | MTVFQLRKTM HVLECVPNWF QVSEMQKPKY GDRNDTGKWF VFTMPREYYT VLVTFLCSIG TTNGFGLGVD FQKEQGREHR LFLKRKSRLA SALFMEVPVG IPIDPEGRPY KFPAKEHCSK CGLCETSYVA RVKEACAFLE PGMSRIDTLE TKVHGRRRKT TDDKTIVQAD ERRFGVQYQP LRLARGISMP GAQWTGVVSS IAISMLETRQ VDAVACVASN EETWSNPNPI LAQTTDEVLK GRGVKPSLAP SLNILDEVKN DPSIRRLLFC GVGCSVQALR SIENELGIEI FILGTNCVDN SPSPGAAAAF IEKGAKVFSD SVRGYEFMQD FRVHVKTEET YLTIPYFCLP GTIAESSIAK SCRSCFDYTN ALADVVVGYM AAPLDGKSRM DESWQTVTVR NERGNQMVET AITQGRLEVG DIVRGSGDHQ QLAIATTKSD ALVQAMVGGK VQENGMPLWL GNIMATVLRK VSAKGIAFAR YSIDYHIVRN YFHVLNEWGE HRARSSTPQF ALEIVDEYLE MDSTLKGYAA KLTSKH
|
| |