Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_46741 |
Symbol | |
ID | 7204517 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011679 |
Strand | + |
Start bp | 338185 |
End bp | 340748 |
Gene Length | 2564 bp |
Protein Length | 822 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185686 |
Protein GI | 219120909 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGCC AACGGACACT TCCCACGTCG GCTTGGACGA TACTGTGGGG AATGGGACAA TCAACCGGGG TTTATCTGGG TACCGTGCCG ACGCCAGTCC ACGCCTGGGT GCCCCCTCTC ACGAGACCGC ATATTCGACC CATACCACCC TCCGTACGTG CCGTTCTCGT TGACGCCCAA CACACCTCCC AACCATCGCC CGTACCTAAC GACGTGCGAC ACAGTCTACG TACCTGTGTA TCTCTCGTGG AGCTACAGAG ACGTTACGAC GAACTACTCG ATAACCTTCC CGTACGCTCG CGTCCACGAT GGACGGCGGC GGCGTTGAAA CGCACGGTAC AGCTGTTGGC CGTCGACCCC AACAACAACA ACAACAACAA TCAACACACT ACCCGAACGG GACACCCATT CCTATCGACG CTCCTTAAGA CGGTAGACAC GGCTGTCGTG TCTTCCGCAG AACTCACAGG CAATAACGGT CGGCCGAATC CATGTTTTTC CACAAATCTG TCGCTCTACA CACTCTGTGA TGTCTTGACC TCGCTTTCTA TACTCGCCAT TCGCACGGAA TGCGGATACG GCAACACTCA CGTTCACGAA TTCCACAATA GACTCCAAAC ACTGGCACAG AAAATATGGA ACCACTTGGC CGAACGCTCC GATATTGTTC TCCGTCAAGC GGGTCCCAAA CGATGGGTGG ACTGCCTCCG ATCGGTTCAC TCGTTGCAAC TCGACCACGC CACCCATCCC ACTACAACAA CAACAACAAC AATAAAGACG CATCCACCAA ATCTCCCTGT TTCTACCAGT CACTTTGCCA TCGATTGACC CGCGGCGATG CCCTCGCGCA ACTTTCGGCC CGTGACCTTA CCGCCGTGTT ACAAGCGCTC GCGGCCTCCC GATTGGACCA CGCCGCCGAA CGGAACCTCC TTCGCGCCGT GGCGCGTCGT TTGCGCAAAA AATCCGTACG AGTGAACGCC TCGGATGCCA CGCTTGCCCG GGCCGCCGCC GCGGCAGTAC TACTCGCTTC ACGGAATGTT GACGACGATA TCGCCACGAC CGACGTGCAC GAACTACAAA CCCTCGTGTA TACCGTTACC AAAGATCTGG TCGCCCACGC CAAGGAATCT AACGAGGCGA GAGCGGTCTC GGAATTGGTA GCGACACTAG TTTCCTTGTC GGCCTTTGCC GTCCCAACCG ACGATCCGTT GGTACGGGAT GCGTACCAAT GGCTCACGCA ACGTGCCTTG CAAAGTCCCG AGACGTGGAC GGTCACGGAC CTGGCACGCT TTGCGGAAGC ATCCGTATCC TGGAATCTAC CCGCAACCAA CGAACTGGCA CAAGTGCTGG CTACACACTT TGGTAGTCTT GTTGCCGTAG CGGCCCCGAA TTTTCCTTGC CAGCCGTCGC ACGTGAATGA TATTTTGCGC TGCGCCGTAT TGCTCCACGG CCGTGACGAC TCGATTATGA AAATGTACCG GACGACAGCC CGGACGCTCT TTCTAGACGA AAAGTTTTTG GGAAAATGTG AGATGCGGGA ACTATCGAAC TATGCGTGGT TCATGAGCGC TGCAAAATGG AACGATGACC AAGTGTGGGA GGCTTTGGGG AACGAAATTC TACGCCTGGA ACATACGGAG GAATGTTCGC CCAAAACGGC CTGCCGAGTA CTGCGATCGT TCACCAACCA AGCCTCGTAC GCTGAAAGTG CACCCCAGCA AGTCTCACGT CGGTCGGAAA TGCTCTTTTG TTTGTTCCGA AACCTCGGTG AACCCTTGCT CTCCACACAA ATTTCTACCC GCGATGTGAC GTCGGCCATC TACACGTACG CCAAGGCGCT GTACGTTGAC GATATGGGGA TTTTTGACCA TCTAGTGGAA GTAATGGTTG CCCGCTTGGA CAATTGCAGC AATCGTCAGG TCGCCCAGAG TCTCTGGGCT TGCGGAAAAA TGATGGCTTG GGAAGGGGAA CTCGAAATCG ACGACCACCC GGAACCGCCT TACTTGAGCA ATTCCTGGGC GTTGGCTTCC TTTCTCTCTA GCCACGCGGA AGAGCTGTCG ACCAAGGACG TGGCACAGGC ACTATGGGCC ATGGCGTATC TGGGTATTCG TGACGCTTCC ATTGTGAGCC CCATCGCCAG TCGCGCCCAT GCATTATCTG CGGAACTAAC GTCCCAAGAG GTGGCGAACA TCGTCTGGGC ACTCTGCAAA CTTCAGAGTC AAGACTATAA GGTAATTTTT GTTCTGACAA GACGCTTCGG GTTGGACAGT AAGCTGCAAC CGACGCCACA GGAAGCTGCC AATGTATTAT ACGCTCTGGG ACGCATGAAC ATTCGAGACG AAGAAGTCTT TCGTAATTTA TCCTCGGGTA TGATTCGCCA GATCAACATG GCCAGTTCCC AAGCCCTCAC GAACGCGATG TGGGCGCACC GCGCCGTGCA CATTGTTCCA CCCCGACTTT TGCTCGATAG CTGGGCCACC CAAAGGCTCG GACTTGTAGG CGTCCAATCG CATTTTAAAG ACGACGACCT GAGCGACTAC ACTGTAGAAT ACGACTTGAT TTGA
|
Protein sequence | MKRQRTLPTS AWTILWGMGQ STGVYLGTVP TPVHAWVPPL TRPHIRPIPP SVRAVLVDAQ HTSQPSPVPN DVRHSLRTCV SLVELQRRYD ELLDNLPVRS RPRWTAAALK RTVQLLAVDP NNNNNNNQHT TRTGHPFLST LLKTVDTAVV SSAELTGNNG RPNPCFSTNL SLYTLCDVLT SLSILAIRTE CGYGNTHVHE FHNRLQTLAQ KIWNHLAERS DIVLRQAGPK RWVDCLRSSL CHRLTRGDAL AQLSARDLTA VLQALAASRL DHAAERNLLR AVARRLRKKS VRVNASDATL ARAAAAAVLL ASRNVDDDIA TTDVHELQTL VYTVTKDLVA HAKESNEARA VSELVATLVS LSAFAVPTDD PLVRDAYQWL TQRALQSPET WTVTDLARFA EASVSWNLPA TNELAQVLAT HFGSLVAVAA PNFPCQPSHV NDILRCAVLL HGRDDSIMKM YRTTARTLFL DEKFLGKCEM RELSNYAWFM SAAKWNDDQV WEALGNEILR LEHTEECSPK TACRVLRSFT NQASYAESAP QQVSRRSEML FCLFRNLGEP LLSTQISTRD VTSAIYTYAK ALYVDDMGIF DHLVEVMVAR LDNCSNRQVA QSLWACGKMM AWEGELEIDD HPEPPYLSNS WALASFLSSH AEELSTKDVA QALWAMAYLG IRDASIVSPI ASRAHALSAE LTSQEVANIV WALCKLQSQD YKVIFVLTRR FGLDSKLQPT PQEAANVLYA LGRMNIRDEE VFRNLSSGMI RQINMASSQA LTNAMWAHRA VHIVPPRLLL DSWATQRLGL VGVQSHFKDD DLSDYTVEYD LI
|
| |