Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43615 |
Symbol | |
ID | 7197335 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 986625 |
End bp | 989134 |
Gene Length | 2510 bp |
Protein Length | 811 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177732 |
Protein GI | 219111961 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0555808 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGCGAA CAATAGCTTT TTGGGCCTTA ATCGTTGTGC TCTTATTTAT TTCTGTGCAC GCCGAGCAAG TGCCGGTTGG ACCACCTGAA GCGCTTTCAT CCAAAACGGA GCATCAACAT TTCCAAAGAC TGCTTCAAAG TAGCTGGGAT GAGATCGACT TTACAGATAT CGATTCTCAC AAGGCTGCCC GGAGTGGGTT AGATCAGCTC GTCAAAGATG GCAGTCATAT GGAATTGCGC TGGGCCAAAA TGTTTGAAAA GGCCAAGACA CCTGAATGTC GAGCAAAGAT TGCAACCCAT TTTGGGTATT TCTACAACGC AATTGCATCA GAACAATCGA TGCCATTTAG TACTGCGAAA TTTGAAAACA AGTGCCCGGA ACCGTTCTAC GATTGGGAAA ATTTACCACC CGATATGCAC GTTGGGCATG TTCAGAACCG AACTTACCAG CCGCCCCGAG AAAACGCCAC ATACATTGAC GATCCGAAAG ATCTGCGTTT CTTGTACGCA ATTTTGACTC ACGGCGAGTG GCATTCTACC ATTCGACTTA TTGAAACTCT CTACGAGGAT GGACATGTTT TTGTTGTGCA CGTAGACGGC AAGGAAAATT CTGATGAGAC TTACAAAGCT CTGCAAAAGT ACGCTGCCAC AAGAGATCAT GTCCACGTAC TTGGATCATC CTTCCGTGTT CGCGTAAACT GGGGAGGCTT TTCAATGGTC AACGCGACGC TGCAAATCTT ACAATATTCG TTCAACGTGA ACGGGCACTG TTCACGACAA CGAGACCCAC TGGTATTCGA CAAGGTTATC CACTTAGCTT CCTCCTCGTA TCCCTTGGCG ACACGATCTG AAATTCGTCA GCGTATAGCG TCATTTCCTT TGGATGCCAA TTTCTTGCAC GTAATTATGA AACCAACTCG CCCCAGCCCT GATGTTTGGC ATTACTTTGT TGAATGTGAC GACAGTTTAC ACCGTATTTA CCGTCTCAAC CCGTTGAACA ACCACACGAA CGGTATGGAG CTTTTCACTT CATCTCAATG GTTCATCATT TCTCGCGAAT TTGCTGAATA TTTAGCTCGT GCCGAAGCAG GAACCTTCGT ACACCAGTAT CTCGACTACA TTGAGCATGT TGTAGTCGCC GACGAAACCT TCTTTGGTAC CGTCCTTCGG CACACTCCTT TCTGTTTGAA GCATCATAAT CGTAACTTTT TACACTTGCA GTTTGATCGG TGGGAGTCAG AGCTTCCGTC GAATGATCGC GATCCTCGGA AGTGCATGAT GCTTGATCCC AATCATTGCG GGCGGTCACC TACCACGCTT ACGGCCGACT ATGCAGACAT ATTGGAGCTC AGCGACGATT TGTTTGCTCG AAAATTTGTG GAGCACATAT CGGACTTCGA AGGCAAATCG GAAGAAGAGG TACCTGAGCA TAATGTCAAA GACATTGTTG ATGATTGGCG AAAGCGTCGG GGCACCAGCA AGCAAGGAAG TAACTCTTCG ACTACACTTT CGGGACAACC GCAAATGACC TTCGAAGGAC ACGGTGTTTT GCTGGTTGCT AGAGAGACGC TCGGGATTGA CGGTGGCGAT AGACAACCGG TTCCCTTGTG CTTGGGTTTG GGAGAAACTG GCAACAACTT ACATCTCGTG CCGTGTTTTC ACGATTGGGT GATACCGACG CTGGCGCCCA ACTGGGAATT CGGTGCCGTA ATTGAAGCCG AAACAATACC TCACAATCGC TGGGAGATGC AACCCTGTAC ATCGGATGGT CACCTAGAAC GATTGTGAGT AACATTGCAT AGTTGGCTTC CGTTTGTACG TTCGATCTTT CAATTAACCC AAGATTTTTC TGCGACAGAG ATTCCGGTGA AATTGAAGTG ACACCCGGTA ATTACTCAAT TACGGGACCG CGATGCATGT TGAAAATGAT GGAAGGTATA CGTGCGGGGC GATGTTTTGA CGGGGATTCT GGTAATTCGC AACCTGGAGG AGAAGTGCAA GTATTTCCAT GTGTTCACCG CTGGGTACAG TTCCTGTCTG TTGGCGACGG GAGACTAGCA CCCAAGGGAA GCCTATTCTT CACCATTCCG CTCCACATCG TTCGACAGAT CCATAGGATG GGACATGAAC AAAGTCCTCA CATGTGCTTG GGGGTGTGGG GGCGTGGAAA TAAAGACGAA GTGGATTGGA AAGACGAATC GCAAGCCTTT TCACAAGAAA GGAAAGAAAA CCCAGTAGAT GGGTGGAAGC CGTTGTCGGA GTGGGAAGGA GAAGAACTTT TCTCAACACA ATGTAGCAAT GTCGGGGCAG TCGTAGAATG GATTTTTGTG CCGTTCATAG TAGAAGATAA TCTTTCCGCG GACGACGGTA CCGACATCGA CGCTACTACA GACGCAGACG AAGTTATGTT TGGACCTGCC ACAACAACTG AAGACACAAT GAATACAGCC CACGAAACCG TGATTGACGA TAATGGTGCA GCTCCAGAAC GAATTGGCGA TGAGCTATGA
|
Protein sequence | MQRTIAFWAL IVVLLFISVH AEQVPVGPPE ALSSKTEHQH FQRLLQSSWD EIDFTDIDSH KAARSGLDQL VKDGSHMELR WAKMFEKAKT PECRAKIATH FGYFYNAIAS EQSMPFSTAK FENKCPEPFY DWENLPPDMH VGHVQNRTYQ PPRENATYID DPKDLRFLYA ILTHGEWHST IRLIETLYED GHVFVVHVDG KENSDETYKA LQKYAATRDH VHVLGSSFRV RVNWGGFSMV NATLQILQYS FNVNGHCSRQ RDPLVFDKVI HLASSSYPLA TRSEIRQRIA SFPLDANFLH VIMKPTRPSP DVWHYFVECD DSLHRIYRLN PLNNHTNGME LFTSSQWFII SREFAEYLAR AEAGTFVHQY LDYIEHVVVA DETFFGTVLR HTPFCLKHHN RNFLHLQFDR WESELPSNDR DPRKCMMLDP NHCGRSPTTL TADYADILEL SDDLFARKFV EHISDFEGKS EEEVPEHNVK DIVDDWRKRR GTSKQGSNSS TTLSGQPQMT FEGHGVLLVA RETLGIDGGD RQPVPLCLGL GETGNNLHLV PCFHDWVIPT LAPNWEFGAV IEAETIPHNR WEMQPCTSDG HLERLDSGEI EVTPGNYSIT GPRCMLKMME GIRAGRCFDG DSGNSQPGGE VQVFPCVHRW VQFLSVGDGR LAPKGSLFFT IPLHIVRQIH RMGHEQSPHM CLGVWGRGNK DEVDWKDESQ AFSQERKENP VDGWKPLSEW EGEELFSTQC SNVGAVVEWI FVPFIVEDNL SADDGTDIDA TTDADEVMFG PATTTEDTMN TAHETVIDDN GAAPERIGDE L
|
| |