Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47179 |
Symbol | |
ID | 7202067 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 722835 |
End bp | 725714 |
Gene Length | 2880 bp |
Protein Length | 902 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181429 |
Protein GI | 219122179 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCC GTACCCATAC AGTACCTTCG TTAGATCGAT GTCGACGTCG TCGGGTAATT GTTGATTCAT CCGGCAGCGA TAGTGGAGAT GGTACGGAAG TCGAAGTCTT TCCGTTGTCG CGTCCGACCC GTTGGCTCGG TGATGACGAT TTTTCCGATA CTTCCTCATC CTTGGAAGGG CACCTTCGTA AATTGGATTT GAAGCACTCA CGTTTGGACG ATGAGATTCA AAGGTTAGCT CGAGCATCGA TAGACGGCGC TGTTCGGGAT ACAGTGGAAG AGTCGTTTGA CCTTTCCATA GGAGACTCGA GTTCCAGTGG AAGCGTGGAT GGCCCAAATC TTTGCGAAAA AGAGGATCCC ACAGAGCTTC CATTGGGTTC CAACTGGATT TGTGATCGGA AAAGCAATGA AATGTTATTG CGTGCCCAGG AAAGCGACAC AACAGAAGTC GACTGGCCAG ATCTGCGCAT ACCTCGTGGG CTCTTTCAAA AGCTTTTTGA CTACCAGAAA AGTGGTGTCC AGTGGATGGG GACACTCCAT CAGGTTGGCA TCGGAGGTGT GCTAGGAGAT GATATGGGAA TGGGTAAAAC ATACATGGCA TTGACCTTCT TAGGAGGATT GATGCGAACT GGCGTAATAC GGAACGCACT CATCGTTTCA CCTGTCTCTG TTTTACGCTC CTGGGAAAAA GAGGCCCAGA ATGTTTTGAC CCAGTGCGTT CACGATGTTC GCATTGCTGT CCTTTCCAGC ACCCAGAGTC AACAGCGCGA TAGAATTTTG CTTAAAGCCT TGGAAGACGA ATCGTCAAAT TATTTGATTA TTACGAGTTA CGGACAAGTC CGGTCGGCCA CTTTGAGTTT CGTTCAAAGT GATTGCTGTT TCGACTACGT GGTGCTGGAT GAGGGTCACC AAATCAAGAA TCCTACCAGC GCAACCAGTC GGGCTTGTCG CCGGATCTGC CGCAGTCGCG AGACGCATCG GCTCTTGCTG ACAGGAACGC CTATACTCAA CAATCTTAAG GTATGTTGGA AGAGTTCGAG ATCTCCAAAT TTTGTGCCCC GTGCGTGGTC TAATGCCCTC TATTTTGCAG GAACTCTGGG CACTTTTTGA TTGGGCAACG AGTGGGCAGA TTCTCAACAA GCTGAAAACT TTCACGAATT ACTTCGCTCG ACCAATCGAA GACGCTCGCA ACAAGAATGC GACAACACAT GCAATCAAAC TGGGACAACG GGTGAACAAG GAACTTCAGG AGAAGCTCAA GCCGTACTTT CTGCAACGCC TCAAAGTTGA CTTTCTCATA GACAAACTTC CGTCGAAAAA CGAACTTGTT GTTTGGACGC ATTTGAGTTC AAAACAGCGT ACAATGTACT CCGACTTCGT GGACTCCAAG GAATCAGTGG TAAGCTCGAT CCTTTCTGGT GAAACCAGAT CGCCGTTGGA AGCCGTTACA TGGCTGAAAA AGCTCTGCGG GCATCCTATT CTAGCAGAAG AACTCGCAAT CAATGTTGGA CGTTTACTTG CTACGGCCAG TCCTGATGAT TTGGTCCAGC AATCGGCCAA GCTCTGTATT CTCTTGTCGT TGATCGAAAA CTTTCGCCAG AACGGCCATC GAACCCTCAT TTTCTCGCAG AGTACGAAAA TGTTGGATAT CATAGAGAAA ACGCTTCTAT CCGAGGGGGT GGAACTGTTG CGTATTGACG GTAGCTCCAA AGAACAAGAC CGACAGCGTT TTGTGGACGA CTTCAACTCA AACACTTCCA CAACGGACGC GATGCTACTG TCAACCAAAG CAGCTGGGGT CGGCCTTACC CTCGTTGGTG CCGACCGAGT GATTATTTAC GATCCAAGCT GGTACGTCTC AGAAGCTCGT TTCGCGTTGC AAGAGCAGCG CCACCTGGAA TGTGCTATTG CTGTGACACT ATTGTTGTTC TCACTCCTTT CCAATAGGAC TCCTGCCGAA GACTCACAGG CTGTGGATCG CTGCTATCGG ATTGGCCAGA CTCGTGACGT TGTGGTGTAT CGCTTGATCG CTGCTGGTAC CGTGGAGGAA AAGATGTATG AGAAGCAAGT GCACAAGGAT GGAATCCGTC GTACTGTGTT CACAGAAGAC ACGTCGGTGG AGCGCTATTT CGACAAACTA GAGTTGCGCA AGCTCTTTGC GCTGGGAGCT CCGGGGTGCG TTTGATTGTG AACGCGCCAG CCCCCTTGTA TGTGACGCAA TTTTTGGTGC TCATCGCTTT GTTTTCTTCT GTGTGTGTGT GTGATCTTAC GTACAGTGTT TGCGAGGTCA TGGAGAAAGT GCAGAAAGCA ACGCAAGGCG TCGAGAGCAA GTGGGATCAG CACGAATTCG TCCTGTCACA AAGTGGGGTT GTTGGTCTAT CCCGTCACGA CGGCTTTTAC TCGCAAGCCG CCGAAGAGAT TTCTGACAAC GAAGAACCCC ACGAGCCGTT GTTCTCGGGC AAAGCTGCGG GTGCGCAAGT ATTTGGTCGC GCACAGCGCA TTCTAGAGAA AGAAAGCTAT TCCCAAGTCC GGGCACGTCG CCAAGCCCGT CAGCACTTGT CGAACCAGGT TGCTGTGGAG GGTGACAAGG AAAACGCTGC CGTACAGGTC TCCGTGACGG ATTCTGGTTG CAACCCCAAT GGTGCTGTGG GAAAAGACAC GCCCACGGAA TTGCCGGTAC GCACTGAAAC CGAAGTGGCG CCTTGTGACG TACTACAGCA CGTGGAGGAG CTGTTGACTA ATGGGCAGCC CAAGCGTGCC ATGGAGATCA TGGTGGAATT GTTGGAAGGT CGGTACGATG AGTTGAGCAA GGATGAACGG ATGCAGCTAC ACCAACAATG TTCAGATACT GCCGTGTTGC TAGGCATTTC CTTTTCGTAA
|
Protein sequence | MASRTHTVPS LDRCRRRRVI VDSSGSDSGD GTEVEVFPLS RPTRWLGDDD FSDTSSSLEG HLRKLDLKHS RLDDEIQRLA RASIDGAVRD TVEESFDLSI GDSSSSGSVD GPNLCEKEDP TELPLGSNWI CDRKSNEMLL RAQESDTTEV DWPDLRIPRG LFQKLFDYQK SGVQWMGTLH QVGIGGVLGD DMGMGKTYMA LTFLGGLMRT GVIRNALIVS PVSVLRSWEK EAQNVLTQCV HDVRIAVLSS TQSQQRDRIL LKALEDESSN YLIITSYGQV RSATLSFVQS DCCFDYVVLD EGHQIKNPTS ATSRACRRIC RSRETHRLLL TGTPILNNLK ELWALFDWAT SGQILNKLKT FTNYFARPIE DARNKNATTH AIKLGQRVNK ELQEKLKPYF LQRLKVDFLI DKLPSKNELV VWTHLSSKQR TMYSDFVDSK ESVVSSILSG ETRSPLEAVT WLKKLCGHPI LAEELAINVG RLLATASPDD LVQQSAKLCI LLSLIENFRQ NGHRTLIFSQ STKMLDIIEK TLLSEGVELL RIDGSSKEQD RQRFVDDFNS NTSTTDAMLL STKAAGVGLT LVGADRVIIY DPSWYVSEAR FALQEQRHLE CAIAVTLLLF SLLSNRTPAE DSQAVDRCYR IGQTRDVVVY RLIAAGTVEE KMYEKQVHKD GIRRTVFTED TSVERYFDKL ELRKLFALGA PGVCEVMEKV QKATQGVESK WDQHEFVLSQ SGVVGLSRHD GFYSQAAEEI SDNEEPHEPL FSGKAAGAQV FGRAQRILEK ESYSQVRARR QARQHLSNQV AVEGDKENAA VQVSVTDSGC NPNGAVGKDT PTELPVRTET EVAPCDVLQH VEELLTNGQP KRAMEIMVEL LEGRYDELSK DERMQLHQQC SDTAVLLGIS FS
|
| |