Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45250 |
Symbol | |
ID | 7200120 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 608518 |
End bp | 610821 |
Gene Length | 2304 bp |
Protein Length | 743 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179468 |
Protein GI | 219117347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.325781 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAAAAGAA CCATCAAAGA ACAAGCTATC TGTAAATGCG CGTACCATGA AGAGACTGCG GGGGAAGAGA ATAACTTTAC TGCTTGCCAG TGTGATTCCT TTGCTTGCGG TTGACGATGT TCCAATGTGG CAGTCCACGA GGGACTCCGA TGAGGAACAT CCTGCTGATC AGGAGAAGGA ACTGTCTTGT AGGGAACAAA CCCAAATCGA CCGTGGAACA AAGCAATTCC GCGAACCGTT GCACAGACCT AACGGCGACC GGGACTCTGC ACTGCTGCGG CAGTCACTAA ATCCGGATAT TTTATTTGAA GGAAGCGAAT TCGGTCAACA CTACAACATT GTAGTGCCGT TTCGGATGGG ATATCCCCAT AAACTTGTAG GATCATACGT TGAAAATCGT TACAGGCCAC CAGAAAACGA TACGTCATTG ACCTCTATTA GCACCAGATT GCGAATGGAC GTGGTCCGCG ATAAATTTGT GTCAAAAGAC GAAAAAAGCG CGGCAGAAAT TGGTGTTGTA GAAAGGAGTA GAGAGACAAT AAAGAATGAG AAAGTATCCG CATATATCGA TGACTCGCGG GCTGGCGAAG ATCTCCCAGG ACCAGAAACG ATGCATGCCT CTTCGGGTGA CTCCAAAGGT GATAAATTTT CTGAAGAGGA CGGACTGAAA CGTGTCTTGG TAGACTATGC CAGCAAATCT GCTGGTGCTT TAATTTTGGA GAAATCGTCG AGTTGGAACG GGATTTCCAA CGTCCTGAAC GGTGACAAGG ACAAGTACGC AATTATTCCC TGTGAAGAGC CCCAAAAGTC CGTCGTCATT GGACTCTCCG AAGACATTCT TGTGAAACAA ATAGTACTTT CCTATTATGA ACGATACAGT TCGCATATTG GAACCTTTCA AGTGATGGGT TCGCCCCAGA CAATGGGTAA TTGGGTTGAT TTGGGTACAT ACACATCACC ACGAGGGAAT GGCAAACACG CATTTGATTT ACACGAGCCG TCTTGGGCAC GGTATTTGAA GTTTCGATTT GTATCGCATT ACGGAGATGA GCATTACTGC ACCGTGAGTC AGATCAGCGT CCATGGGAGT ACCATGTTGC AAGGTTTTCA CGAGCAATGG GCCGAAACGG TTGAAGAGCA GCCAAACGAC AAAAACGAGA GGGACGTAGA CGTATCAGGG TCGAAAATAG ATCCTACGTT TTCGGCTACG GATCAAGAAA ATGGCAATGA CGGCAGCGTC CAAGGGACTG TATCCACTAT TGGACAATGC TACACCAGAT TGGATGCTGT ATGTCAGATG GATTACAGTT TTGAACGGAG TGCATTTTTG TTCGCATCCG GTAGGAGCTC TACACCTGAC TTTGATTTAT TGAGTGCGCT TTCATCCGCG TCCTTTTGTC AACTTGGCAG GCAATCGGCA AGAACAAATT ATTCTCATTT TGTTGAGCTA GGCCGTCGTG CGCTTACCAT CTCTCCGAAA CGTGGTCGTA GTACCAAGAG TAAATTTGTT GCCGATTTAT CCGATCAAGC TTTGTTTCAC TCACTTACGG AATCTGTTGT TGTTAAACAC ATTCAGAGTC TCATCTCTCG AACGACAGGG ATAGACATAC ATGTGGAACG TTTTGGTGTA TTGGCAACAG TTGATAGGAC ACCCGACAGA ATTTCTGTCG ACGATTCGAA TCCCCCTGCA ACTGTTTCAT CAGCTTCTGG GGTGATCGCC GGCACCAAGC TGGTTACGTC TGAAGTCGAA GCCATTGAGC GTATCACGGA TGAGATTGAA TCTCAGCCGT TATTGCAAGC CATTCAACAG ATGGAAGAAA AGATTCCTTT CGATACCGCG TTTCATGCAT CTGGATTTTC ATGGAGCAAG ATATTGGAAC AGCTTCCTAG CGCAGCTTGT CTGGAAAAAC TCGATTTCGC TGATTTTAGA TCCGGCAAAA AATTGAACTT GCGCAATGGG GGGCCGGGGT CGCACGGCAA CGCCCAAGGC GGAGGTGGTA TGGAGCCAAT CTTTAAAAAG TTCACTGACG AGATAAAAGC GCTGCAGACT AGTGTTTCGA TTCATGATCA ATTTTCCAAG GCACTGGCCT CTTGTTACCA ACAAGTATTC CTGGAATTAT TGGTGGAAAT GGATGTCAAA CGCAGTGACA TTGATAACCG GATTTTCCAG TTGGAGAGGA AAATGCAGAG TGGTTTATTT TTTTTCTCTG CAGTATCTCA ATGGATGTCT CCAATTATTG GTGGTGTTGT AACGATTTCA AAGCTACCGA TATCGCTTTC TTTCCAAAAC CGCACAATCA TTGA
|
Protein sequence | MKRLRGKRIT LLLASVIPLL AVDDVPMWQS TRDSDEEHPA DQEKELSCRE QTQIDRGTKQ FREPLHRPNG DRDSALLRQS LNPDILFEGS EFGQHYNIVV PFRMGYPHKL VGSYVENRYR PPENDTSLTS ISTRLRMDVV RDKFVSKDEK SAAEIGVVER SRETIKNEKV SAYIDDSRAG EDLPGPETMH ASSGDSKGDK FSEEDGLKRV LVDYASKSAG ALILEKSSSW NGISNVLNGD KDKYAIIPCE EPQKSVVIGL SEDILVKQIV LSYYERYSSH IGTFQVMGSP QTMGNWVDLG TYTSPRGNGK HAFDLHEPSW ARYLKFRFVS HYGDEHYCTV SQISVHGSTM LQGFHEQWAE TVEEQPNDKN ERDVDVSGSK IDPTFSATDQ ENGNDGSVQG TVSTIGQCYT RLDAVCQMDY SFERSAFLFA SGRSSTPDFD LLSALSSASF CQLGRQSART NYSHFVELGR RALTISPKRG RSTKSKFVAD LSDQALFHSL TESVVVKHIQ SLISRTTGID IHVERFGVLA TVDRTPDRIS VDDSNPPATV SSASGVIAGT KLVTSEVEAI ERITDEIESQ PLLQAIQQME EKIPFDTAFH ASGFSWSKIL EQLPSAACLE KLDFADFRSG KKLNLRNGGP GSHGNAQGGG GMEPIFKKFT DEIKALQTSV SIHDQFSKAL ASCYQQVFLE LLVEMDVKRI GEENAEWFIF FLCSISMDVS NYWWCCNDFK ATDIAFFPKP HNH
|
| |