Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40716 |
Symbol | |
ID | 7198521 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | + |
Start bp | 220793 |
End bp | 222958 |
Gene Length | 2166 bp |
Protein Length | 542 aa |
Translation table | |
GC content | 44% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184675 |
Protein GI | 219128975 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCAGG CAGACAAGCT TTTGTCCTAT CTTGAGAACA CTCCCGAGAT CAGCTTTTGT GCTATATACG ATGACCCAGA TTCACCTCTC TTCACAGTTT ACAAGCAGAG AGCAAAAAAA GACCACAGGC ATCTACACAC AAGTATTAGA GTTAACTCCG GGGTATCTGC TGAAACGGGA ATTCTTGACA ACACTACACT AGATGCAATG GATCCCAATG GAGAACTTGA TGACTATGTT GACCGCACAC GTTGCGCATT CAAGCTCCTT GGCTCGCAAA ACATGCTGTT AGGAGTTGCC TGGACAAATA ATGAGAGCAG AAATGTATTC TCTCGTTTTT CGGAGATAAT GGTAGTAGAT GTGACGGAAG GCACAAACAA CGCCAAGCGA CCGCTATTTC TATTCTCTGG GAAGACGTCA AACCAAAACA CTTTCACAGC ACTGTGGGCT TTTTTACCAC AGCAGGCTTG TTGGGCCTTC CATTGGGTGT GGACTCGATG CATACCGCAA CTACTCCCAA AGCAAGGAAT TCAACATGCG CCTGACAATA ACAGATGGAG ATCCAAAACA GTACAGGACC TTTGTAGATG CAATACCAAC TTTCTACCCT CTTTGCAAAC ACAAGCTTTG TCACTGGCAT CTACTGTATC GCAGTAATCT CATGAAGGTG CAGACTGGAA AATGTGGAGT TAAAGCTACT ATTCTATTCC GCGTAGTTGT TCTTTGGATT GAGAGCTGGA TGACCAAAAT TGAGACACAA GAGGAATACG AACTTTCTAA AAGGCTCTTG GCTGATTGGC TTGCAACCCC CGAAGCTATT GATGTCAAAT TGGGTGGTAT GGGGCAAACT ATTGTATCGC AAATTAATGC GTACATGACA CTGTCACTTT TTCCTCATGA ACAGCGCTGG GCTAGATATC GCTATTTATA CACACAAGCA TTCAACACAT CTGCAAGCTC GTATGCCAAG GCAGAAATAG TGCTTTAAAA CGACGGGGCG ACGGGGTCAG GCCAAGCTTT TCCGTACCAA AAGCAACTCA GGTTATAAAC AAAGGGACAC AACCTAGGTC AAAGAAGAGG CATCAAAAAG CTGTTTACAA TTTAAATGCT GCCAAGACGA GAAAGCCTGC CTACTACGCA AACATTCGGG ATTTAGTGGA TTACATTCAA GATTCTCTTT CCAAAGATTT TGAAGCAGTT GCTTCATTTG TGCTCTTCTG TCCAAATGCA GACCAGTTTT GGGTCAAGCA AGCCACTCGC AAAAGCAAAA ACACGGACAT TCGGAAAATC AACGACAGTA GCTATTACAA GTACATGATT CCGCAGTTTG AACGCACACG AATTGTGGAG CTTGTTAATA TTGATGGTAC ATTCTATTTG GTGTGTAGCT GCGGAAAATT TCAGCGACAA GCTTCCCCAT GTGCCCATCT TTACAAGGTT CTTGGTCAAT CACCCACATC AACCGATGTC TCTGTACGCT GGACAAAGCA CTGGGATGTG TATTTGCACC GAAGTGGCCA CAGTGACCTG TCAAAGCATT TGGAAGACCT GTACAAACAG GAGCGACCAG GTCCAGTATT TGTTGATAGT GGTCAGTGGG TGATCGGAAA AGGTGAAAAA GGGTCAAATT TTTTCGAAAC TTCGCTTCCG TACAAGCCCC CTGTCATACG AGATTTTAAT CGATGGGCAG TGTCTTCGCA AACGACTGGA GCTGATTTGA GTGGGACCAA AAATACCACA AATATGTATT TTTCGAGTGG AATGGTGCAA GAATCAACAA GCCTGTCCAG AGAGCATGCA TTCCAGGATT CATTGCATTA AACTTCCACA AATTCGCAAA ATTCGGATTG TTTTGATGAT ACACAGGCAG TGACAACGGA AAGCGTATCT TCTACGAAAA GAATGTTTGG TTCAAGTGCT TTTGTGCACA ATTTCCATTT CTACCAGGAA ATGTCCAAGC TGGCAGGATT TGATATGGAA GCTGCTGAGT CGATGAATAT AGCTATGCAA GAAGCATTGG AAAAGGTACA AGCCAATGTT GCAAAAAGGG CTGGGAAGAT GGATTACACT ATAGGACCAG CCATTACAAA GGACCATGTA GGTCTAAGAT TGAAGCCAAG CTACAGTCCA AAGAAGAGGA GGAAACCAAA CTTACAAAGG AAATGA
|
Protein sequence | MTQADKLLSY LENTPEISFC AIYDDPDSPL FTHCGLFYHS RLVGPSIGCG LDAYRNYSQS KEFNMRLTIT DGDPKQYRTF VDAIPTFYPL CKHKLCHWHL LYRSNLMKVQ TGKCGVKATI LFRVVVLWIE SWMTKIETQE EYELSKRLLA DWLATPEAID VKLGGMGQTI VSQINAYMTL NSALKRRGDG VRPSFSVPKA TQVINKGTQP RSKKRHQKAV YNLNAAKTRK PAYYANIRDL VDYIQDSLSK DFEAVASFVL FCPNADQFWV KQATRKSKNT DIRKINDSSY YKYMIPQFER TRIVELVNID GTFYLVCSCG KFQRQASPCA HLYKVLGQSP TSTDVSVRWT KHWDVYLHRS GHSDLSKHLE DLYKQERPGP VFVDSGQWVI GKGEKGSNFF ETSLPYKPPV IRDFNRWAVS SQTTGADLSG TKNTTNMYFS SGMAVTTESV SSTKRMFGSS AFVHNFHFYQ EMSKLAGFDM EAAESMNIAM QEALEKVQAN VAKRAGKMDY TIGPAITKDH VGLRLKPSYS PKKRRKPNLQ RK
|
| |