Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46903 |
Symbol | |
ID | 7204445 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | - |
Start bp | 757373 |
End bp | 759600 |
Gene Length | 2228 bp |
Protein Length | 682 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185946 |
Protein GI | 219121444 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCCT TTGCAACGTT AGAGTCTGCC TCCCGGAATC TATGGACTAG ATCCACGGTC TCGCGTACAG ATGGTGTATT GCGTAGACGT TCGACTCGGT CGACTCTGGC GAGGCTGTCT TCCCTACCGC ATGGGTCTGG TGCAACGGAG CATCGGCCGC CTCCGAAATG GCTTGTCAAT CGGCGAAAAG TGAACAGGCA AACTTGGTCG TCGGCTTGGC AAGCGTTTTC GTCGGCTGCC CAACCCACTG ATTCAGACGA TGTCAATTTA TCAAAAGACT CCGACTCTGC ACTGAAGACG CCGATAGATT TTAGCAAGGA AGACGAGCAC GTCGAAGAGC GCAAACGCAA GCGGCTTTCT GAGGTAAGGC CCAGTGTCAC CGTGTGCAGA TGGACATTTA CGCTTGAAGA AGGCTACTGT CATTTTCACT ATTGAACTCA CCGGAATAGC TAATCGCCAC CGTGTCATTT CAAACATGCT TTCTCACTCT ACTACTCCCC TGCTCGGACC CAAGGTACGC ATTTCTGAGG TGCTTCAAGC AAAACACTCT TATCGCTGGG TGGATCCAGT TATTCCCCGA ACTGCAACAG TTCAAGAAGC GATTGTCACT ACGATTGAAG GTGGTTTGTC GGGTGCCATG GTCCTCGAAG ACGAGCACCA CAATCGAGTC TGCGGTCTCA TTACCTCCCG CGATTTACTC CGCATCATGG CTTCAGGAGT CAAGGAAGGA GACACCCCGG AAGAAATCAT GAACCGGTTG GTGGGTGATT ATATGACGCC CATTTCGCAA GTTGTTTATG GTCGTCCCGA CGAAACGGTT GGCATGTGTC GAACAATCAT GGCCAGACTG GGCATTAAAT GCTTGCCAAT CTTATCACGT GAGGGTCGGG TGGAAGGACT CGTGACGGCA AGAGACATGT CCGATTTTGG ACTTTCGGCT AAAGATCGGG GTGGCAAGAA ATCGTACCTA AATGATGTTT CTGAACGTGT CGGCTTGAGC AGTAACACCA GTATGGCGGA ACCACCAATT TACCTGAAAG CACAGTTGGC ATTGGGACCC ACACCTCTGT TTATCAATGT AGGCGTCGCC GAATTGCCGC ATCCATTTAA AACAGCAGAT GGTGGTCTCG TTCAAGGAAT GCGAGGTAAG TCCTTTTCCG GTAAACGACA CAGTCCACCC GAATGAATCC AGACCTAACA GGGTTTACTC CAACCCAGAC CTTGGATTTA ACGATCTATC TACGGATCCC ACTTTGTCGG AAGACTCATG TTTTGTCACT AACGTCAAAC TGCCGGATGG AAAGAAAAAG ACATTGCGTG ACTTTACGTA CATGGGAGTT GCCGACGGTG TTGGAAGTTG GCGTGAGTAC GGCGTAGATC CGCGACTTTT TGCAAGAAGA CTGATGGAGG AATGTGAAAA TATCCTTCTT GAGGCACAGC GAAACGGTCA AATGGACGGC AATAACTTTC GACAAGTCAC GGCCCCCTCC GATATTATGG CACAAGCATT CGAACGGGTG AAGGCGGAAA ATGTCATTGG ATCAAGTACA GCCTGCATAG GAGTCTTTGA TCAGATTCGG CATCAGCTCC ACTTTAGCAA CTTGGGCGAT TCTGGGATCA TTGTTCTGCG CCACATTGAT TCGGATGTGG CGGGTTCGTT GAAGCGTGAT CGGGTGACAC CGAGGACCGA AAGAACGTCT GATATAAGGG TAGCTTTTGT AAGTCAGCAG CAACTGAAAT CCTTCAACCA TCCATTTCAA ATTGGCTGGA CGGGCGAGGA ACTGAAAGAA GGGGAAAGCT CATCCTTTCG GAACGCAGGC GAATCGTGTA CATCTTCCAT TCATTTACGA CGTGGCGATG TTGTTATTAT GGCGACGGAT GGTTTATTCG ACAACGTGGA GTTGGACGAC ATTTGTACCA TGGTGCTGGA GTGGGAGCAG CAGAATGGTT TTGTCCGTGC TGGCGATACC CAGGCTCGCG AAAAGCGATG GCAGATGGGC AATTCGTTGA CTCTTTTGTC AGCTGGCCGA ATAAATGATT TGGCCCAGCG ACTGGTTAAG AAAGCTCGTG AAAATTCCTT GGACTCGTCC CTGGATTCTC CTTTTGCCAT TCTGGCAAAG GAAAATGACA TTATGTGGTC AGGTGGAATG CCAGACGATT GCATTGTGAT TGCAATGCAT GTTGTAGGAC GAGACGCGAA CGATACAATG GACTCCACAA AGATGTAA
|
Protein sequence | MAAFATLESA SRNLWTRSTV SRTDGVLRRR STRSTLARLS SLPHGSGATE HRPPPKWLVN RRKVNRQTWS SAWQAFSSAA QPTDSDDVNL SKDSDSALKT PIDFSKEDEH VEERKRKRLS EVRISEVLQA KHSYRWVDPV IPRTATVQEA IVTTIEGGLS GAMVLEDEHH NRVCGLITSR DLLRIMASGV KEGDTPEEIM NRLVGDYMTP ISQVVYGRPD ETVGMCRTIM ARLGIKCLPI LSREGRVEGL VTARDMSDFG LSAKDRGGKK SYLNDVSERV GLSSNTSMAE PPIYLKAQLA LGPTPLFINV GVAELPHPFK TADGGLVQGM RVHPNESRPN RVYSNPDLGF NDLSTDPTLS EDSCFVTNVK LPDGKKKTLR DFTYMGVADG VGSWREYGVD PRLFARRLME ECENILLEAQ RNGQMDGNNF RQVTAPSDIM AQAFERVKAE NVIGSSTACI GVFDQIRHQL HFSNLGDSGI IVLRHIDSDV AGSLKRDRVT PRTERTSDIR VAFVSQQQLK SFNHPFQIGW TGEELKEGES SSFRNAGESC TSSIHLRRGD VVIMATDGLF DNVELDDICT MVLEWEQQNG FVRAGDTQAR EKRWQMGNSL TLLSAGRIND LAQRLVKKAR ENSLDSSLDS PFAILAKEND IMWSGGMPDD CIVIAMHVVG RDANDTMDST KM
|
| |