Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40779 |
Symbol | |
ID | 7198544 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 373434 |
End bp | 375597 |
Gene Length | 2164 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184698 |
Protein GI | 219129023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGGGA CACACGCTAG TACCGCGACG GTACGACTAC TCCGGTGTGC GACGGTACCA CCAGTCGGTC GATCGTCACC TTCTTTGAAA CGGCGGTACC AATCGTACGC TCCCCGAGTG GTGGCGGCGA CTACGGCAAC GGCAGATATT CTGCGGAATG CCAACGGATC CTTCGATAGT CCTAATTACC GGACACTGCT GATCCGCCGG TGGGCTTCCT CCACGACTAC TACAACCAGT ACTACTGCGA CTACTACAAC GGCCACGTCC ACAACGAGTG GGGCTCCGGA TACTACTTTG TCAACTTCTG CACTCGGTCG GATAGAATCC CTAACGTCGA CGACTTCTCC ATTGGCCCTG ATCGAACCCA TGTCCATGCC ACGTATTTAC CGCTATTTGA ATTTAGCAAC GTACCAACGA GATGAGCTGG AAAGCGTGTT TGACAGAATA CGAAACGGAT ACGCTCTCCA AACTACACAC GACCAAAAGA CTGCCGTCGG GACTGGGCCC GAAGCCACGG ACTCCGACTC GGTCGATCCA GAGGAAACCA TTACGGACTC TCAGATCCAG CGGTATCTAC TGTCGCGCAT TTACGAGCTC GAAGAAGAAA GTGACGAAGT CATTGAAGAA GGTCCAGTTA CTCAGACCTT GCGCGAACAG TACGTACAGC ACGAAAGTCA ACGCTTTCTC CGAGCCTTTG CGGATTACGC GACGAGGGCA CCCGGGAATG GTACGCTTCT GACCACAACG ACCATCAACA AACCCGAATT TTGTGACCTT TTGACAACCA AAGCCAGTCA AGTTGATCTA CAGCGAACCT GGCCCATTAC GGTGAGTATG CTCTTGGTAG GCTCGTCTGT GGGAGTCATC ACGCCCGCCA TGCCTTTTGT TGTTGAGCAA TTGAGTTTGA CGGCGAGTCA GTACGGTATG GTGGTCTCGG CCTTTGGATT GGCCAAAATG CTGGGCAACA TCCCGTCCGC GATTGCCGTG GAACGACACG GACGGAAACC CTTCATGACC TGGAGTCTAC TCATCATTGC CTGCGGCGTG GGCGGCATTG GTTTAGCCAA CAGTTTTGAA GAACTCTATA TTTGTCGATT ACTGACGGGT ACGTGACAAA AGGGTCGCGG CAACCTCCTT TATTTGTTTC TAGTTTACAC ACAACACTCA TCGCTAGTAC GTTTCACATC TTGCCTCTGA CTGTGCGACT GCAGGTCTTG GTGTGAGTTT CTTGTCCACG GCCGGCACGC TCATGATTTC AGACTTGTCC ACGCCGTTGA ATCGCGCGTC CACGTACGCG CCGATTATGA GCGCCTTTTC GGCAGGCACC GCACTCGGAC CAGCCCTCGG TGGGATACTA GTCGACCAGG TGGGCCTGCA TCCAACATTT TACATGGTCG GGGTTTCGTA TTTGGGAGTC GCAGCCTTGA ATCGAGCCAT TTTGAACGAA ACCAAAACAC ATGCCGTCCA TTTTCCGTGG CAACAACGGC GGTCCGGCGA TGATGTCGCG GGCGACAGTT TGTCGAGCTC AGTCCAGGAC GCAGTGGGTC AATGGGTACC GTTGTTGCAG AATTCGTCCG TCCGCAACAT TATGATTATG AACGGGATGT ACTGGATTGC ATTGGCGGGC TCACAGATGA CATTGTTGCC GTTAATGCTG ACCAATTCTG GAGGCTTGGC CATGTCCGCG ACTCAAGTTG GTCAGGTCTA CATGTCCATG AGTCTCGTCC AAATCGTCGG CAACCCGCTA TTTGCCAAGG TTATGGATAA GACGGGCAAG GCGCCGGCGA TTGTGACGGG TTGCACATTG ATCAGTACAG CTATGGTTGG ACTCGCCTAC TGCGACGATT ATACACAATT AGCAGCAGCG CTGGGATTAT GGAGTATTGG ATCGAGCATG CTCAGTACAG CCCCTCTTGC TCACCTTTCG GATAAGGTAG ATGATGCCAA GCGGGCCCAG GCAATTGCAC TACTAAGAAC CTGCGGGGAC GTTGGATTTT TGATAGGGGC CACTGGAATC GGGGCGTTGG CCGACTGGAC TGGGAGCCTG GAAACAGCTA TGCAAAGCAG TGCCGGTTTG TTGTTCACAG CAACGGCGTG GTATGCAACC AGACAGGTAC TGGATTCACG GATAGGAGCG CCTGCGAGAA AGTCGACCTC ATAG
|
Protein sequence | MPGTHASTAT VRLLRCATVP PVGRSSPSLK RRYQSYAPRV VAATTATADI LRNANGSFDS PNYRTLLIRR WASSTTTTTS TTATTTTATS TTSGAPDTTL STSALGRIES LTSTTSPLAL IEPMSMPRIY RYLNLATYQR DELESVFDRI RNGYALQTTH DQKTAVGTGP EATDSDSVDP EETITDSQIQ RYLLSRIYEL EEESDEVIEE GPVTQTLREQ YVQHESQRFL RAFADYATRA PGNGTLLTTT TINKPEFCDL LTTKASQVDL QRTWPITVSM LLVGSSVGVI TPAMPFVVEQ LSLTASQYGM VVSAFGLAKM LGNIPSAIAV ERHGRKPFMT WSLLIIACGV GGIGLANSFE ELYICRLLTG LGVSFLSTAG TLMISDLSTP LNRASTYAPI MSAFSAGTAL GPALGGILVD QVGLHPTFYM VGVSYLGVAA LNRAILNETK THAVHFPWQQ RRSGDDVAGD SLSSSVQDAV GQWVPLLQNS SVRNIMIMNG MYWIALAGSQ MTLLPLMLTN SGGLAMSATQ VGQVYMSMSL VQIVGNPLFA KVMDKTGKAP AIVTGCTLIS TAMVGLAYCD DYTQLAAALG LWSIGSSMLS TAPLAHLSDK VDDAKRAQAI ALLRTCGDVG FLIGATGIGA LADWTGSLET AMQSSAGLLF TATAWYATRQ VLDSRIGAPA RKSTS
|
| |