Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_42051 |
Symbol | |
ID | 7204413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 168869 |
End bp | 170608 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185644 |
Protein GI | 219120825 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGAGTG CGAAAGCCGA GTACAAATTC TTGACGGGGC AGGACTTTGG TCCTCCACCC AAGGAAGAAA AGAAAAAGAA ACAGGACAGT CCCGCTGAAG CAGACGAGGT TCCTTCCGAA AAGAATAAGG AAAAGCGTGC AGCCAAAGCA GCAGCCAAGG CTGAAAAGGA GGCGAAAAAG GTTGCACAGC GGGAAGAGCG AGCCCGTCGT GAAGCCGAGA AGACCGCCAA ACTCGCCGGC ATTGGGCAGG ACAATTTTGG CGATGCGCCC TTGATCCAGT CGCAGATCAT CACGGACAAA GTGTGGACTC CCATTCAGAA TTTGAAACCG TCCCTCGCGG GGGAAAGTGT TCTCGTTCGG GGGCACCTGC AAACAGCTCG AGCCGTTGGG AAGGGAGCTT TTGTCCTCGT GCGTTCCAGT CTTTACTCCG TACAAGGAGT CGCCTTTGAA TCGAAAGACG ACGATGGCGC CGTCAGTTCC GCTATGATCA AGTACATGGC TGGACTGCCA GCAGAGTCGG TCGTGGACAT GCGAGGGATC GTTACGGTCC CCGACCAGCC CGTAGATTCC GCGACGCAAA AAATGGTCGA AATTAAAATT GAATCGTTTC ATTGCGTCGC CAAGGCCAAG AAAGCCTTGC CCTTTCAAAT GGAGGACGCC TGTCGACCTG ATTCCGGAAA GGAAACCGAC ATTGGTGCCT ACAATGAAGA TGATCCGGAA GTTGTCGACT CCGAAGACGG TCTCATCCAT ATCGGTCAAA AGATGCGTCT CGATTATCGT TGGATCGATT TGCGGACCCC CGCTAACCAA TCCATCTTCC GCATCGAGAG TATGGTCGGC TGCTTGTTCC GTGAGTTTTT GCTTCAACGC GGGTTTGTAG AAATTCACAC ACCTAAGCTC ATTGGAGGAG CCTCCGAAGG TGGCTCCGAC GTCTTTACGC TAGACTATTT CGGACAGTCT GCCTGTTTGG CCATGAGCCC GCAGCTCCAC AAACAGATGA CGGCCGCCTG CTCTGGCTTT GAACGCGTTT TCGAAACCGG TCCTGTATTT CGGGCCGAAA ATTCGAATAC CCGTCGGCAC CTTTGCGAAT TTACCGGACT CGATCTGGAA ATGGTCATTC ACGAGCATTA TGATGAAGTG CTGGCGGTCA TGAGCGAGCT CTTTATTTAC ATATTCGATG GCGTTAACGA GCGCTGCAAG CCGGAACTGG AACGTGTTCG GGAGCAGCAT CCGTTTGAGG ACTTGCAGTA CCTGAGCCCG ACGCTCAAAC TGACCTTTGC CGAAGGCTGC GCCTTGCTCC GTGAAGCTGG CATCGATCAA GACAATTACG AAGACTTGAG TACCGAAAAC GAAAAGAAAC TCGGCGACAT TGTCAAGCAA AAGTACGGGA CGGACTTTTT CTTCTTGGAC AAGTTTCCTT TGGCGGTGCG GCCATTCTAC ACCATGCCCG ACCCGAACGA CCCCAAACTT TCCAACAGCT ACGATTTTTT TATTCGTGGT CAAGAGATTG TGTCCGGTGC CCAGCGTGTT CACGATCCTG ACTTGATTGA AGAGCGCGCC AAGGCTTTGG GTATTGATGT CGAGAGCATC GCCGACTACG TTGAATCCTT TCGTCACGGT GCCCTACCGC ACGGTGGCGG CGGTATCGGA CTGGAGCGTG TCGTCATGTT GTTTTTGGGC CTACCCAATA TTCGCAAGGC GGCCTGGTTT CCCCGTGACC CAAAGCGTAT TTCTCCGTAA
|
Protein sequence | MLSAKAEYKF LTGQDFGPPP KEEKKKKQDS PAEADEVPSE KNKEKRAAKA AAKAEKEAKK VAQREERARR EAEKTAKLAG IGQDNFGDAP LIQSQIITDK VWTPIQNLKP SLAGESVLVR GHLQTARAVG KGAFVLVRSS LYSVQGVAFE SKDDDGAVSS AMIKYMAGLP AESVVDMRGI VTVPDQPVDS ATQKMVEIKI ESFHCVAKAK KALPFQMEDA CRPDSGKETD IGAYNEDDPE VVDSEDGLIH IGQKMRLDYR WIDLRTPANQ SIFRIESMVG CLFREFLLQR GFVEIHTPKL IGGASEGGSD VFTLDYFGQS ACLAMSPQLH KQMTAACSGF ERVFETGPVF RAENSNTRRH LCEFTGLDLE MVIHEHYDEV LAVMSELFIY IFDGVNERCK PELERVREQH PFEDLQYLSP TLKLTFAEGC ALLREAGIDQ DNYEDLSTEN EKKLGDIVKQ KYGTDFFFLD KFPLAVRPFY TMPDPNDPKL SNSYDFFIRG QEIVSGAQRV HDPDLIEERA KALGIDVESI ADYVESFRHG ALPHGGGGIG LERVVMLFLG LPNIRKAAWF PRDPKRISP
|
| |