Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_22659 |
Symbol | |
ID | 7194989 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | + |
Start bp | 25650 |
End bp | 27716 |
Gene Length | 2067 bp |
Protein Length | 638 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183274 |
Protein GI | 219126041 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTAAGTACAG TCCATTCGTA ATGGTCGATA CTCATGCTTC GTAATAAGTG TTTCACTTTC GGCATCGGAT TGCTCTGCTG TACTATTTTT GGGGCAACTT TGGACTTGCC GCCGCTCTCC CACACCGCTT ACGGGACGTC GACGGGGCCG AACGAGACGC CAATCAGTGA GAGTCTCCTC TGGAACTCCC TCGTGGATGC GTTGGACAAC ACCTGGCTTC GACATTTGTC ATCGGAAACT CGAGAGAATC TGGAACGCTC CCGTGAGATT GCGCACTTGT CGTCCTTGGA CGAGAACCGC ACGGCACGAC CCGTCTACAA CGGTCACTAT GTAGCCGTCC GACCCACGGG CTTGCCGCAA CCTCGGTTGC TCCTGTACAG CCCGGACGTG GCCCATCGAG AGTTAACTAT TACTCCAGAA CAGATTGAGT CGGAGGAATT CCTGGCCTGG ATCTCGGGAA ACCAAGTGTA CGGGCCAACT TGGGCCACAC CGTACGCGCT TTCCATCATG GGCGAACGTT ATACCAGCAA CTGTCCTTTC CGAACCGGTG ACGGCTACGG GGATGGACGA GCCATTAGCA TCGGCGAATT CTGGGGACAG GAACTGCAGC TCAAAGGTGC GGGAACCACG CCCTTTTCGC GCAGTGGAGA TGGGAGGGCG GTACTCCGGT CCAGTGTGCG CGAGTTTCTT GCTAGCGAAG CCATGTACGC GTTGGGTGTT GACACAACCC GCGCCTTGTC ACTCGTCATC AGTGATTCCG AAACCATCTC TCGGCCGTGG TACAATCAGG CCTCTAGCGA TAGAATCGAC CAGTCTCTAC CGTTTACTAA TGATTCTTCT CGACAGCAGT CGTCCGAAAC GGAACGACAA GATAGTATTG AACGCGTACG CCATCAAGAT AAACAAGATC CGAATATGTT GGTTCAAGAG AAAACTGCCA TTACCTGCAG AGTCTCAATG AGCTTCATTC GTATTGGCCA TTTCGACTTG TATGCACGAA GATCGGAAAA GAAAAGCTTG GAGAATTTTG AGTATACGGG TCTTCGTTTC GATACCACAA CTGCTGAATG GCACGAATTA GAGCAATTGA TATGGCATAC ATGTTTTCGT GAATTTCGAA CGGAATGTTA CGATCCTTTT TATCCTCGTC GCAACATCGC CGCGGCTGCC GCGCTGCTGC TCGACCTGGC GGCCGAAAAA ATAGCCACTA TGGTAGCCGG GTGGATTCGC GTTGGTTTTG CGCAGGGAAA TTTCAACGCC GACAATTGTC TCGTGGCGGG CAGGACAGTT GACTACGGGC CGTTTGGATT CGTCGAGGAA TTTGATCCAA CATTTTCGAA GTGGACCGGA AGTGGAACAC ATTTCGGATT CATGAATCAA CCATCCGCAG CCTTGGCTAA TTATAAAATT TTGGTGGAAA GCGTTGTTCC AGTTATCGCC GCGCAAACGA TGGAAGATAT GGAACGCATT CGGACTTCCT TTCTCGAAAG GGCACAAATT CTTTTCGAGA AAGCAGTGTC AGAGGTCTTT CGGATTAAGC TCGGATTCTC GAGAGATCAA AAAGAAGGAG ACAGACTTTG GAATTCACTA CAGTATATGC TGAGACATTC TAGAACAGAT TGGACTATTT TCTTTCGCCA ACTATCTTAC ATCACAAGGA GTTTGTCTCA AGATGCAGGC CGAAATTACA CGGAAATGAT GACAATGCTG GAAGGTGGTC AGCAAACCAG CTGGGGACAA GGGGCCTTCT ACGATGCGTT AACTCCATCA ATTCGTCAGC AGTGGATAGC TTGGCTGGAA GAATGGAGTA GTGCTTTAAG CGCAAATAGG ATGTCGTCTG ATGCGTTTGA GCAAATGATT TCTGTAAATC CCAAGTTCGT CCTTCGGGAA TGGATGCTGG TAAAAGCATA CCGTTCTGCT GAGATCGGTG AAGATGCCGA ACTCTTCTAT TTACATGATC TGATCCAAGC TCCTTACAGC GAAGGGAGTC CTCAACAGCA GACGCAATAC TATCGAAGGG CTTCCGAGCA TGCTCTCACG GCTGGTGGAA CAGCATTTAT GTCCTGA
|
Protein sequence | MLRNKCFTFG IGLLCCTIFG ATLDLPPLSH TAYGTSTGPN ETPISESLLW NSLVDALDNT WLRHLSSETR ENLERSREIA HLSSLDENRT ARPVYNGHYV AVRPTGLPQP RLLLYSPDVA HRELTITPEQ IESEEFLAWI SGNQVYGPTW ATPYALSIMG ERYTSNCPFR TGDGYGDGRA ISIGEFWGQE LQLKGAGTTP FSRSGDGRAV LRSSVREFLA SEAMYALGVD TTRALSLVIS DSETISRPWY NQASSDRIDH IERVRHQDKQ DPNMLVQEKT AITCRVSMSF IRIGHFDLYA RRSEKKSLEN FEYTGLRFDT TTAEWHELEQ LIWHTCFREF RTECYDPFYP RRNIAAAAAL LLDLAAEKIA TMVAGWIRVG FAQGNFNADN CLVAGRTVDY GPFGFVEEFD PTFSKWTGSG THFGFMNQPS AALANYKILV ESVVPVIAAQ TMEDMERIRT SFLERAQILF EKAVSEVFRI KLGFSRDQKE GDRLWNSLQY MLRHSRTDWT IFFRQLSYIT RSGQQTSWGQ GAFYDALTPS IRQQWIAWLE EWSSALSANR MSSDAFEQMI SVNPKFVLRE WMLVKAYRSA EIGEDAELFY LHDLIQAPYS EGSPQQQTQY YRRASEHALT AGGTAFMS
|
| |