Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32557 |
Symbol | |
ID | 7197109 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 240508 |
End bp | 242352 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177581 |
Protein GI | 219111659 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCGG AAGACTTTAT GCTAACTTTT ATCCGAAACG CAGGTTCCAT CAAAAACAAT GTCGTCGTCA ATGGCAAAGT GCCTTGGACG GAAGCCTTTG CCCGGGATTT CTGGGGCACG CCGATGTTGG ACGTACAGTC ACACAAGAGC TTCCGACCGC TAACGACTCT TTCGTTCAAA CTCAACTGGA TTGCGGCAGA ATTCGGCACC GCCACCAATA ATGTCACCGA AACCACATCC GCTATCGCCA ATTTGCAGCC TTCTACATTC GGCTTTCACG TCGTCAACGT TCTTTTGCAC GGTCTCGTGA CGGCTCTCGT CACCGAGGCA TCCAAATTTC TGTGGGACAA CTGTAAATCT GAGGGATGTA TTGTTGGTCA ACTCCTGACG GGATTTCTGT TCGCCCTGCA TCCTGTGCAT GCCGAAGCCG TCAGCAACCT GACTAGTCGA GGTGAATTGC TCATGTCGGT CTTCTTCTTG CTGGCGTTTC TGTCGTTCGC CAATTACATT CATGCTGGTT ATACGTGGAA ACGCATGTTC TTCGTGTACA TTGTTCCATG GACTTGCATG ACGGCCTCGC TCTTTTCGAA AGAGCAAGGG GCGACAACGT TGATTGCTTT GGTCATCTAT GAGTTTGTCC AGTTCCACGG ATCCCTACAC GAGTTTTGGA TTTCTCTTGT AAAGCGACGC GAGGCGACAG CCGTGGAGTT CTGTCAAAGA ACAGCAGTCT TAGCGATGCA GACAATTGCA GTCTGTACAT GGCGGTATTG GCTCAATGGA GAAACGAGTC CAGATTTTAT TCCCGATCAG AACCCAGCAG CATTTGCCAA AGATCGTTTC ACTCGAGTCT TTTCGGTCAG CTGGGTATAC TGCCTCTATG TAAAAGATGC ACTGTATCCG CGTCATTTGT CTCCGGACTG GTCCGGTATC AGTATTGACC TCATTGAGAG ATGGGATGAC CCGAGGGTTG CTGTTGTGCT TTTGCTTTGG ACGTTTGCCG CCGCTCTCTT GGCCTCTTTG ATGTGGGAAA TGCCAATCGG GACGCGAAAA GAGTATCAAG GATTCCGCAA ATCACTGCTT ATTGGTTTCT GGGGTTTTTT ATTCACACCT TTCTTCTTGT CATCGAATTT GTTGGTCGTT ATTGGGCTGA TGAAGGCTGA TCGCGTAATT TATTTGCCGC TAATGGGGTT TTGTCTCCTG GAGGCACAAC TATTTACCAT GCTTTGTACT GCTGCTGACG AAGCTACATC ACAAACGACA AGACGATCTC GGATTGGGTA CATTCTTGTC ATGCTCCAGC TATTCCTCTT TGCCTGCAAA CTTCACGAAC GCAATCTTGC TTGGGAAAGC TCGCTGAGGC TCTGGATGCT GGCCTACGAA ACCAATTCCA AAAGTCACCA TACCATTTAC AACTGTGGAT ACGAGCTATC GTTGCAAAAG CGATACTCTG AGGCGGAAAC TGTCTTGCGT CCAATCGCCG ATCCACACGT CGACGGCCCG AGCAACACGT TCGTCTACGC AATGACACTG TACAATATGG GGCGCTGTGA TATTGCTGAA CGGTACATCG ATAAAGCCAT GGAAGTTTTG CGAGAAAAGC GATCGGAAGG TGGAGTCCGC AATCGTCCCA AGGCATTGTC GAGAGTGGAG AGTAATCTCC TAGTGGCCCG ATCGTTTTGC CATAGTAAAG ATAGTATACC AATGGCGGGA CAAATCATGT ACGAAGCCGT AAAGACAGAT CCTACCAATG AGTACGCGAT TCAACAAGCA CAAGTCATGA TGAAGCAAGT CGAAGCATAC CGAAAGTTAG AAGAGCACAA ACAACGAATA GGATTGAAAT ATTAG
|
Protein sequence | MVAEDFMLTF IRNAGSIKNN VVVNGKVPWT EAFARDFWGT PMLDVQSHKS FRPLTTLSFK LNWIAAEFGT ATNNVTETTS AIANLQPSTF GFHVVNVLLH GLVTALVTEA SKFLWDNCKS EGCIVGQLLT GFLFALHPVH AEAVSNLTSR GELLMSVFFL LAFLSFANYI HAGYTWKRMF FVYIVPWTCM TASLFSKEQG ATTLIALVIY EFVQFHGSLH EFWISLVKRR EATAVEFCQR TAVLAMQTIA VCTWRYWLNG ETSPDFIPDQ NPAAFAKDRF TRVFSVSWVY CLYVKDALYP RHLSPDWSGI SIDLIERWDD PRVAVVLLLW TFAAALLASL MWEMPIGTRK EYQGFRKSLL IGFWGFLFTP FFLSSNLLVV IGLMKADRVI YLPLMGFCLL EAQLFTMLCT AADEATSQTT RRSRIGYILV MLQLFLFACK LHERNLAWES SLRLWMLAYE TNSKSHHTIY NCGYELSLQK RYSEAETVLR PIADPHVDGP SNTFVYAMTL YNMGRCDIAE RYIDKAMEVL REKRSEGGVR NRPKALSRVE SNLLVARSFC HSKDSIPMAG QIMYEAVKTD PTNEYAIQQA QVMMKQVEAY RKLEEHKQRI GLKY
|
| |