Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49415 |
Symbol | |
ID | 7195793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 197978 |
End bp | 200613 |
Gene Length | 2636 bp |
Protein Length | 586 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184080 |
Protein GI | 219127725 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTCTTCGTA CCATGACGAC CGCAGCCTTT GGCTATCTAT CTCTCTTTCG AAAATCAGCC ATGGCGTTTG GGTCTCTCCC AGTATCACGT CTTGCGTTTA CAGAATCGGC GAGGCACATG GCCACCAACA CCAGACCTCT CCATCCGCCA ATGGGCATTC CAGGATCTTT ACGGCATTTT ACAAGTCTCC GAGCCTCGTC GAGCGACAAC GTCAAGATCG AAACAACAAA ATCTCCAGTT CCCATAACGC TCCTTAGTGG ATTTCTGGGA ACCGGGAAGA CGACAGCGTT GAAGCACTTG CTCGAAACGA ACGACAACAA AAAAATCGGG GTCATCGTCA ACGACGTTGC CGCCGTCAAT ATTGACGCCA AGCTGATTCA GTCACAGAGC TCGGATATGG TGGAACTGCA AAATGGATGT GCATGCTGCT CCTTGGCGGA TGAACTGTTC TTCTCGGTGG AGAAGATCTT GTTGGGTCGC GACCTCGATG CTATTGTCGT GGAACTTTCC GGAGTCGCGG ACCCAATGGC CATTCGAAAC AATTGGAAAA TGGCCCCATC GGAAGTTCGC GACATGGCAG ATATTGCACG AGTCGTGACG TTGGTAGACT CACAAACCTT TGGGACCGAC TACATGACGT GGGATACTGC CGCCGACCGA CCGGGATGGA CCAACCCAGT CGATCCGTGT GCAGGTAACG CCAAGGTTGC CGAGCTCTTG GCCGAACAGG TTGAAGCCGC CAACCTTGTA CTCATCAATA AATGCGATCT TTCAAATGCG GAAGAAGTGC TGGTAGCAGA AAAGGTCACG CGGGCACTGA ACGGCAAAAC CAATGTCGAA AAAGTAGTCT TTGGAAAGAT TGCCCCGAAG CTGATACTTG GAACGGTAGA AGAGATTACC GCAGCCTGCA CGGACCCGGC TTGCGACGAC GAATCACACA GCCATTCACA CACCCAGGAT CACGGTTGCA TGGAATCGGG ATGCAACGAT AGCTCGCATT CACACGCGCA CGATCACGCG TGCGTGGATG CCGGCTGCAC CGACGTGTCA CACGAACACA GCCATGCGCA TTCGGAGAGT ACCTGTGCCG ATCCAGATTG TACCGATGCA ACTCATGAGC ACACTCATTC CCATAGTACG TCTACGGATC AGTTGGGTAT TGTCAACTTC GTCTATAAGG CCGCTGTTCC CTTCGAACCG AAGCGATTGA TGACGATGTT GGAGACGTGG CCCATACCTC TTAAGGATAC ACTTGATCTA GGTTTCTTGC AAGAAGAGCA GACCAAAGTC TTGTTCGAGG ACGGTATGGA CGAAAGCCCT TTTAGCGGCG TGCTGCGGAG CAAGGGATTT TGCTGGTTCG GGCCGTCGAA ATGGAGTGGG GCCAACAGTG ATGCATGGCG ACACGAAACT GCCATGTACT GGTCACATGC CGGTAAACAC TTTAGCATCA CGTCCGGTGG CAAATGGTGG GGAACCATGC CACGGGAAAA GATGACAAAA TTCTTTGATG AAAATATGGC AGAGTTTGAT CGCATTGTGC GGGATGATTT TGTCTCAGAG GAGTTCGGCG ACCGTCGGCA GGAGATTGTA TTTATAGGAA TCGGATTGAA CGAAAACGAG ATACGAGCGG CTATGGACGA ATGTCTAATG ACCGAAAGTG AAATGGCCAA GTACAGGCAA AATCTGCAGA ATCTTTTGGC CACAACTATG GCGACATCGT CGGGTCCGAG TCTCTTTGAT GTGGGTACGA TTGACCATGC TGATACGAAG TAAACTTAGA GTGTATGCAT TGGATGTAAT TCGATTTTCT AAAATTTACA GCGATTCTCC TTTTTCGTAT GCATTAATAC AACTTACACT CTCTAATGTA GCATCTGCTT TCAAAAATGA TTAGTTTCGA CAATAAATAC CTCCCGTTTA CAACCCTTTC GGATCAGATA TATCCGCCGA GCTACTGCCC GCACTAGCTC GTCGCAAGCC TCGAAAGATA CTGGGAACAA AAGATCGTTT ATCTGAGCTG TCGGATTTTC GACGGCTGAG AAAGGAGATA CGCCGCTGAG GCTCTTCGTC GCTCTCTTCG TCGATGCTCA TGCTGCTGAC ATCGGACCGT TTTCTTCCAA AAAACAACGA CATCCGTTGT GCCGTAGGGG TTTTTATATC TTCGCTGCTT TCCAACGGCT TCTTGCGTGG AAAAGCAAAG GGCATCTTAG AACTAAAGTG ATCGCTTTCA TCACTATCTC TGTCGGCAAT TCCTGGTATG CCAGGAAATA TCGACTTCGG CCAATCGCCT TTCCCTTCAA TTCCAGCCTT GCGACGTCGT TCTCTCATGC CTTCAACTAG GTCCTGTGCC GACATCTCGT CCACGTCATC GCACTCATGG CTGTTCTCGG CTATAACAAC AGGGAATTCA GGCGACTGAA ACGACTCTGA GCTCCTGTTG AAGGATTCGT TGCCGCTGAA CGAGCCGTTG TACGAGCTGT GACCATTCCA AGAAGTATGC CCATGCTTTT CGAAGCTGTT TTTCCTGTTA ATTTTTTCTT GACGATCGGA AAATGAGCTG GATTTGCCGA TCGAACCGAT TGATAAACGT TGACGAGCGG CTTGGAAGCC CTCTTGGAGT TGTTTCGAAG CGGACT
|
Protein sequence | MTTAAFGYLS LFRKSAMAFG SLPVSRLAFT ESARHMATNT RPLHPPMGIP GSLRHFTSLR ASSSDNVKIE TTKSPVPITL LSGFLGTGKT TALKHLLETN DNKKIGVIVN DVAAVNIDAK LIQSQSSDMV ELQNGCACCS LADELFFSVE KILLGRDLDA IVVELSGVAD PMAIRNNWKM APSEVRDMAD IARVVTLVDS QTFGTDYMTW DTAADRPGWT NPVDPCAGNA KVAELLAEQV EAANLVLINK CDLSNAEEVL VAEKVTRALN GKTNVEKVVF GKIAPKLILG TVEEITAACT DPACDDESHS HSHTQDHGCM ESGCNDSSHS HAHDHACVDA GCTDVSHEHS HAHSESTCAD PDCTDATHEH THSHSTSTDQ LGIVNFVYKA AVPFEPKRLM TMLETWPIPL KDTLDLGFLQ EEQTKVLFED GMDESPFSGV LRSKGFCWFG PSKWSGANSD AWRHETAMYW SHAGKHFSIT SGGKWWGTMP REKMTKFFDE NMAEFDRIVR DDFVSEEFGD RRQEIVFIGI GLNENEIRAA MDECLMTESE MAKYRQNLQN LLATTMATSS GPSLFDVGTI DHADTK
|
| |