Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44773 |
Symbol | |
ID | 7199889 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 202282 |
End bp | 206306 |
Gene Length | 4025 bp |
Protein Length | 1153 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178950 |
Protein GI | 219116310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.749806 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTCCT ATACGGATCG ACGTGACTAC ACGTCGGGTG GTCATCGTGG AGGTGGCGGG GGATACCGCG GTTCTGGTGG ATACGGGGGC GGTGGTGCAC GGGGTGGAGG ACGCGGTGGT GGGTATCGCG GAGGACGGGG TGGATATCAC CGTGGAGGAC GTCAGGCCCA TACCGGAACA TCTCCCTACT CGCGTCACAG TGGCGGGACT ACGAACGGTG GACCCCGACG CTCCGGAAAT CGTTTCGCCG TGGAATCCAC GCGGCGAGAT CCTCAGCAAG AACTCTTACG CAATCTACTC GTCCTTTTGC ATCAAGTAGG AGATCTGCAG TCCACGAATC GCAACAACAA CAACAACAAC AACAGTAACA GTAACGACAA TACCTCCCCG GTGCGTACAA TAGTCACAAC GCAGAAGCAA AACATTCAAG GATTGACGCA AGTCTTGTGT GGCAACAATC CCGAACTCTT TCTCCAACAC GAGGCCGAAG ACAGTCCCGC AAACATTCAG TACGCGGAGC AATTGGCCGG ACCGTTGGCG GCGGGATTGG TCCACGCTAT TATGGCGGCG CCACTGCAAA CACCCTGCTA CGTGGGGTTG ACTCTGGCGG TTCACGTCAC GGCGCACGCC CGGGACGCAA CCTTGTTTGG TGGATTCGCA TCCCGCTGTG TCCGCTACGC CACCAGGGCC ATGGCCCGTG ATCTCGACGC TCTGCTGCTA GATTCGGACC CGAATAGCAG TACTAGTAGC ACTACTCATA GCGGTATTGG AGACAGTGGC AGCGGTAGTA CTACTACTCG ACAGTCCCGC GTCCCATCCC ATCGATTGGT GGTTCGGGTG CGTATGTTGC TCCGCTACCT GGTTCTACTG GCACGCGCGG GTATTCTGCA ACTCGACCAC GATACCGCAT ACGCCGTCAC GGGCCCCGCA CACTCCAACC CGGTCAGTGT CTTGGGGCTG TTACAAACAA TGGTACAGGC GGCACGGCAA GCCAAGGAGC GTGACGGCAA CCAGTCCGTG GCCGTAGTCT TGGCGTCGCT CGTGTTGAGT ACGGTACCCT ACCTGGCGAC CCTTCTTCCT TCCGAGACGG TCCGCCAGAC TTTGGTCCAA CCATTGGAAG CAAGCGTCCA ATCATACCGG TCCACGTTTG CACCAGGCTT TGGTTGTACC GCAATTCTTC TTCGAGAGGA ACAAATTGAG GATATTGGGG CTTCCATTGA AGAAGAAGAC GACGAAGAGG AGGATGACGA CGAAGAAGAA GAGGGGTCGG GACAAGTGTG CGACAACTTT CAAGACCTGA TGCGGACCGT ACAGTACTGG GTAAAGCAAG AGGACACTAC CGTTGCGTCA CGCCTGGCTC TCTTTCGCGA CGCTCCGTGG GAAGGCTTGG AGGCAAAAGT CTCCTCGCTT ACTTCCACAA CCTTGGACGC CAGCGAGATT GAGGAATCCG CTCACACTCC GTTAGTCTAT ACGGAGACGC CCTTGACCAT TCCAATATTC ACGGATTGCC AGTCCCTGTC GGCTTTGGTG GGTGGTCTTC ACTCGGCGCC CGAGGACGAC AGTCTGTTCT GGGCCGAAAT TGATCTCGAC GGAATTTTTG TGGGTCGCTT GCCCATTTTC GGACCTCCTC CCGAAGTTGC GGACAATGAC GACGACGAAG ACGACGACCA AGACATGGAG GCAGCAGCAC CGGTTAACGA ACGTCTACAA GCGTATCGCT CCGGATACGG CATGGTGGAT CGCTACTTTA TACACGAAGC CATCCGCGAT TGCTTGACTA GCCACGAAAG TTACGTGACG GATACCGGTG TCGAAATTGG CAACGCCAAG ACGGCTGCGG AGCAGGTGTG GTCTATCATT CAAATGGTGA CGGGTGACAG CACCAACGGG TTGGAATATG CAGTTCTGGA AGCGATCTTT TCCTTAATTG TACAATCAAA TGCGGTCTCA TCGTTCCGTT TTGTATACCT TTCGCGTGTG TTATTGGAAC TAACGAGGCT GGAACCCGCC ATCATGTCGC CCGCCATTGC CATTGCGGTC TCGACTTTAT TTCAGGACTA CATGCCAACT CTGGTCCCCA TGGCACGATA CAATCTGAGT CGATGGTTCG CCTTTCACTT GGTTAACACC GATTACCAGT GGCCCGCAGC GTACTGGAAA CATTGGGAAC CCTTTGTGCA GTACGGTTGG AAGAATAGTC GCGGAGCTTT TGTGAAGGGT GCACTGGCAA TTCTGCTGGA AAACGAAAGC GATGCGGGTA TGTTGGTGAA GGAATGCCTG CCCAAGAACA GCCTTTTGGT CGACCATTTA CTTCCCGGGC TCACGACGTC GGCTTTACCA GACGACAGCG CTTTGGCGTC CTTTGCAAAG GATGTCTCTT CACGTATATG GGATAATCGC GAGGACGAAC ATTCGCTGTT GCAATATATT GTAGGGGACG AACTCTCCGA AAGTGTGACG ACCGACTTGG CTGGTCTGCC CGTAGGAGAG AGGACGTGGT GGCGGACACA TGCCGTAGCT CGAGCTTTGC TGTCGGTGAC CAAGCAAGAG CACACGTATT TGGCCCTATC TATCGCACAA GAGCGTACCG CCAACGAGGA CGCAATGGAT GCGACCATTG AAGCACCTGC AGACATTTTG TCCTTGCTGC TAGATGCATT GGTGCAGTAC ACGCCACTCC TGCTGGGAGT TCTCGCCAAA GATCTCGACG GTCAAGCGAG CGCGGATGCT GCTCCGGTTC AGGGCGAGTT ACATGTCCTC CAAGAAATAT CGAATCATGT ATTGTATTCT CGTACCACAC TGGATGCAGT TGTGAGTTCC CTGCTTCACC ACAAGGTTGT TTCACCCGAT GCCGTAGTTC GCTGGTCCTT GGGCGATATG GGACAGGAGA CTCCCGGGGT CTTGGCAATT CACTGGTGGG ATATGTCAAC TATGGCAATA CACTACGGGC TCACCAATCT GTTTGCGGTG ACACCTGCTT CAAACCAGGG CGAGATGCAA GTCGAAGACA ATCAAGACGA AAGCCCGACC ATGAAAAATG CACGTTTGTT TCTGGAGCCG ATCATCGAGT ACACTGTGGG TCGCATTTGC CACCTTTTAT CGTCGGCGAG TCATACTGTA GAAAGCAGCA AACTGACAAA TACACAGGTT GATCTGGTTG AAGGCTTCAA GTGTTTGGTT CGGCAAACAA AAAGGTGTTT GCTGCATGTA CTGCTCAGCT CTTCCGTCAT TGGGCAACAG CTGCGGCCCG CTACGGTGCG AAAGTACCTC ACGAATTCTT GTCTTTCGGG TTCCAACCTA CTGTCTATGT GTCAGGAAAC CGACGGATCG TCGGCCATGA ATACGTTCCG GACAAGTTTG AAATTTATGG CATAATACTT CATTCCAGGA GTTTTGTAAT GTCAAGGACG CTGTCTTACT GTTAATGATC GATCGCTGGT TTGCTGTAAA TTGCTGATCG TTTTGTTTCG TAGCTAGAGT GCTTCTAGTT CCAGTAAGAC CCCGTCTTCG ACGTCTCTAC ATCCGTATTA GTCGGCTGTA ATATTCTTGC CTTTACATAA ATTACCTACT GCGCTGCGTA CGTACCTGAC TTCAACCCAA TCTGCCAGTA TGCTTTCTGA GACCGACGTT AATCCTCGCG TCGCCCACCG ACTCCGTCAG CTTGTTCACC GAGCGCAGCC GCCGACGCCG CCATCGCCTT TGCCGTTGCG TCTCTCCCCG AACTCCTCCG CTCATCGTCT AAGCACAGCG CTACAAAAGT TTTCGAACTC TGCAAGCGCC TCTCCGAACG CAAAGCCACC GCTTCAAAAT TCAAAGATGA CTCCTCCCTT CCGAAATCGG TGCGGATCCG ACCGTCTCCG CACGGTTCGC GGGCAGCCAT GAAGACCGCG GAATTTACTG AAGCTGCTCA AAGTATTCAA GAACTTCACC GGACTTACCG CCTTGGGATG AAAGCTCAAT TCTTGAAACT TGTCAATCTT GAAGTCAAGG TTCTCCGTGA CGAACTGTCA ACCCTTTTTG CCTAG
|
Protein sequence | MSSYTDRRDY TSGGHRGGGG GYRGSGGYGG GGARGGGRGG GYRGGRGGYH RGGRQAHTGT SPYSRHSGGT TNGGPRRSGN RFAVESTRRD PQQELLRNLL VLLHQVGDLQ STNRNNNNNN NSNSNDNTSP VRTIVTTQKQ NIQGLTQVLC GNNPELFLQH EAEDSPANIQ YAEQLAGPLA AGLVHAIMAA PLQTPCYVGL TLAVHVTAHA RDATLFGGFA SRCVRYATRA MARDLDALLL DSDPNSSTSS TTHSGIGDSG SGSTTTRQSR VPSHRLVVRV RMLLRYLVLL ARAGILQLDH DTAYAVTGPA HSNPVSVLGL LQTMVQAARQ AKERDGNQSV AVVLASLVLS TVPYLATLLP SETVRQTLVQ PLEASVQSYR STFAPGFGCT AILLREEQIE DIGASIEEED DEEEDDDEEE EGSGQVCDNF QDLMRTVQYW VKQEDTTVAS RLALFRDAPW EGLEAKVSSL TSTTLDASEI EESAHTPLVY TETPLTIPIF TDCQSLSALV GGLHSAPEDD SLFWAEIDLD GIFVGRLPIF GPPPEVADND DDEDDDQDME AAAPVNERLQ AYRSGYGMVD RYFIHEAIRD CLTSHESYVT DTGVEIGNAK TAAEQVWSII QMVTGDSTNG LEYAVLEAIF SLIVQSNAVS SFRFVYLSRV LLELTRLEPA IMSPAIAIAV STLFQDYMPT LVPMARYNLS RWFAFHLVNT DYQWPAAYWK HWEPFVQYGW KNSRGAFVKG ALAILLENES DAGMLVKECL PKNSLLVDHL LPGLTTSALP DDSALASFAK DVSSRIWDNR EDEHSLLQYI VGDELSESVT TDLAGLPVGE RTWWRTHAVA RALLSVTKQE HTYLALSIAQ ERTANEDAMD ATIEAPADIL SLLLDALVQY TPLLLGVLAK DLDGQASADA APVQGELHVL QEISNHVLYS RTTLDAVVSS LLHHKVVSPD AVVRWSLGDM GQETPGVLAI HWWDMSTMAI HYGLTNLFAV TPASNQGEMQ VEDNQDESPT MKNARLFLEP IIELIWLKAS SVWFGKQKAA DAAIAFAVAS LPELLRSSSK HSATKVFELC KRLSERKATA SKFKDDSSLP KSVRIRPSPH GSRAAMKTAE FTEAAQSIQE LHRTYRLGMK AQFLKLVNLE VKVLRDELST LFA
|
| |