Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50497 |
Symbol | |
ID | 7199331 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | + |
Start bp | 219236 |
End bp | 223093 |
Gene Length | 3858 bp |
Protein Length | 1165 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185401 |
Protein GI | 219130498 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTCTT TGGTCTTTGG CAGTCCAGAG GGAAGGACCA CTGAATTGAA AAGACAACTC CATTGGGACG AGTCGTCCAC GGATATTTAT TCCGCCGCGT TTCCTCCTGA GGCAAGTAAG TCTCATCCTA AATCAAGTGA ACATTCAAAA AAGGATGGCT GGGACCTCGA GAATCTGGCA CCAAATATCA TCTTTCGGAT TCGCTTGTTT CGGTTGGTAG TGGTGTCCAC CTTGCTCATA GCAGGTGTGA TATGCTCGAC TCTCACATAT AACTTTCTCC GTCAGGAGGA GCAAGACAAC TTCGCCAATA CCGTAAGTAA ACGAAATGAA ATATATGCTT CTTACGAGCA CGCCGTCCGT CGTTTTTCAT CAGATCTTCG TTTCCTCTGC AGTACTTCTC CTTTTGCGAT CATATGCAAG ACACTACTCG TTTTCGTGTT CAGAACACTA TCCAGGCTTC TCGTGGCCTG TCGGAAACCT TGACTGCCTA TGCAAAGTCT TCCAAATCGA CCTTTCCGTT TGTGACTATG CCTATGTTTG AGGTCGCTGG TAAAAATGCC AGACATCTAT CGGGTATCGA TGTTTTTGGC TTTCTACCGT TTGTCGATGA AGATCAGAAA AATTCTTGGG AAGTATATGC TCTCGAAAAA TCCAGTTGGC TAGACGAGAG TAGAGTAAGT CAGCCTCTAG TGCTTTAGCC GGCCGGAATA AATGAGTTGC GGAGCTCATA TGCATTGTTT ATTTGTTGTG ATTAGGATCT TTACAACAAC GAAAGTGGGA ATGAAACTTC CGGATTCACA GAAGCTCCAT TCCCTCTGAC AATAACCCAA TTTACATCCG ACTTCGGAAC ATTGAAACCG GCTGTGCTCG GGCAAAAGGT GAGACTTGTA TTCAACTTCT GTGCAGATTT TTTTCAATCT TTACTGATTT TTGCTCCTTT GAGTGTAGCT TTACACTCCA GTCCATCAAG TTACTCCGCC GCCTTTGTCG AATAGTTTGC AGAATTTAGA TTTTATCACC CTTCCGTCCT TTGCTCGATC GCTCGAAGCT ATTACAGCTT TAAAAGGTAA GCAGGTGAAA AATAATGGTC TAAACCGCCG AAGCATCTTT GTACTGATAA TTGACAATAC TTCTTTTTTT TCAGATGTAG TTTTCTCTGA TTACGAAGAT GTGGACCTTC TTGCTAGGAT GGCATACCCT CCGGACGCTC AAGACACACT TGCTGGAGGC AGAGGGAGTG TTGACTACGA TGCAAGTCCT CGTGTATATA TGCATACTCC AGTTTACTCA CAGCTTAGTA CAGATCACGA AGAGATCGTT GGAGTATTGA GTGCGCCATT TGCTTTCGAT CTCTACCTCC AGGATCTACT TCCCAAAGAC ATAAAAGGAA TACACGTTAT TCTCGAAAGC AGTTGTAACA AGTGGGCTAC CTTTGAGCTG ATTGGGGAAC GAGCAGTCTA CTTAGGGCTT GGAGACAGGC ATGACCCTCA GTACGATCAC ATGGAGGTTG CTGTAGAGTT CACCGGCTAC GGAGGTATAA AAACTGAGAA TATTCCTGGG CAATGTGTGT ATACTTTGCG CTTCTATCCT ACTCGATACT TCGAGATCAC CTTTGATCGA AACACCAAGA TTATCGCTCC TGCTGTCGCG GCAGCCACAT TCTTTTGGCT GACCCTGGTT TTCTTTCTGT ATGATCGGTA TGTCCAACAA CGCACCGTCA AGGCTATTGA CACTGCGGCT CGGTCAGGGG CTGCAGTAGC TTCGCGATAC CCGTCTAGGA TTCTTACCCG CCTTTTCGAA GAAGCCAGCG ATCGAAAGGG CTGTCATGCT TCTCAAAGTA CCAGTTTAAA TCACAAACCC ACTCTAGACT GGACCGATGG CTGCAGCATT GACCACCCTT GTACTTTACC TTTCAAAACA AAGCCAATCG CAGAATTCTT CCCCACCTCT ACAGTAATGT TTGCGGATAT TGTTGGTTTC ACCGTATGGA GCTCAGGCAG GAAACCAGAG GATGTTTTTA CGGTATTGGA GACTCTGTAT CACGCGTTTG ATAAGATATC CAAGAATCGT GGGGTATTGA AGGTTGAAAC TGTGGGGGAC TGTTATGTTG CTGTGAGCGG CTTGACTAAC CAGCAAGAAA ATCACGCGGT GATTATGAGT CGCCTCGCGC GTGAGTGCAT GCATACAGCT CACGCTCTGA CGAAGCAATT GTCTTCAACC CTGGGGTCCG ACACTGCCGA ACTTTCGCTA CGGATTGGTA TACACAGCGG ACCGGTGACG GCGGGTGTCC TGCGCTGTGA GCGGTCTCCT TTCCAACTCT TCGGTGATAC CATTAATGTG GCATCAAGAA TGGAACGAAG TGGTCGAAGC GGAAGAATTC AGATATCGGA AGAAACTGCT GACAAATTGA CTCAAGCCGG AAAGACCGAC TGGATCATTC CGAGGGAAGA CGCACTCTTG ATCGATGGCA AGGGCGAGCT CAAAACTTAT TGGCTTTTCC TTGGAAATGA TGATCAACAA TGGTGCGGCA GCTCTTCGAA TTTGACAAGA TCGGTCTCGT ACACCACGCG AGATGAAGAT ACTGGTGACA GTGTCTATGA TGACCTCTTT GGTACCAGTG TTGGAGGCGC TTCTGAGTCC TTGAGCAAAG TCGGTAGCCT GACCAGCGAT AAGATGACCC GTCTCATCAG TTGGAATGTC GACGTTCTTA CGCTTTTGTT AAAAGAAATA GCGGCTAGCC GTCGTCCAGA TCCCGAGCCT GGAAGAAGCA CTGCAAGCAG CCGTCCGCCC CTTCCGGTTC GGACACAAAC TAAAGCTAGT CCAGCGGCGG CTGTCCCCGG TCGTGTTCTG GATGAAGTGA AGGAGATTCT TTCCTTTCCG CTCTCTGATG CCGCGAGTTC TGAGAAAAAG TTTAAGGAAG GAGACGTAGA GCTTGAGGAA ACCGTTCTCT CGCAGTTGCG CGTCTACGTT ACCAACATTG CTGCCATGTA CCGCAACCAT CACTTTCACA ATTTTGCGCA CGCCTCGCAT GTTGTCTTGT GCATGACCAA GATGCTTTCG AACATCGTAG CGCCCGGTGA TGCGGTGAAT GGAGCAAAAC ATCCTCAGCA AATCTCGCAG CACGATCATA CGTTTGGTAT CACATCCGAT CCTCTGACGC GGTTTGCTTG CGTTTTCTCG GCACTCATCC ACGATGTGGA CCACTGCGGT GTACCGAATA CGCAGTTGGT CCAAGAGAAG GCCCGTATCG CTTCTTTTTA CCGGAATAAG TCTGTCGCGG AACAAAACTC TGTTGATTTG GCCTGGGAAT TGATGCTGGA CGAGAATTTT GCCGAGTTGC GGGCGGCGAT TTGTATGACT CGTGGGGAGC AGACCCGCTT CCGTCAATTG GTAGTAAACG CCGTTATGGC AACCGACGTG ATGGACCCAG ATTTTAAGCT ACTCCGCAAC GCCCGTTGGG AACGCGCTTT TTGGGAAAGC CGAGCGGATG CCACCAGTCG GGAGACGATA AATCGCAAGG CCACGCTGGT TCTCGAGTAC TTGATGCAGG CGGCGGATGT GGCGCACACC ATGCAGCATT GGCACGTGTA TCGCCAATGG AGCGAGCGCT TCTTTCAGGA ATGCTACACG GCTTTCCAGG ATGGCCGTGC GGAACACAAC CCTGCCGAAA TTTGGTACTA TGGCGAGTTG GATTTTTTTG ACTTTTACAT CATTCCGTTG GCCAAAAAGC TAAAGGTTTG CGGAGTGTTT GGTGGTTCTG GAGGCGAATA CTTGAACTAC GCGCTGAAGA ACCGGAAGAA TTGGGAGAGG ACCGGTCGGG AAGTGGTCCA GGAAATGGTG AGGTTGGCTT CTGCTTGA
|
Protein sequence | MGSLVFGSPE GRTTELKRQL HWDESSTDIY SAAFPPEASK SHPKSSEHSK KDGWDLENLA PNIIFRIRLF RLVVVSTLLI AGVICSTLTY NFLRQEEQDN FANTNTIQAS RGLSETLTAY AKSSKSTFPF VTMPMFEVAG KNARHLSGID VFGFLPFVDE DQKNSWEVYA LEKSSWLDES RDLYNNESGN ETSGFTEAPF PLTITQFTSD FGTLKPAVLG QKLYTPVHQV TPPPLSNSLQ NLDFITLPSF ARSLEAITAL KDVVFSDYED VDLLARMAYP PDAQDTLAGG RGSVDYDASP RVYMHTPVYS QLSTDHEEIV GVLSAPFAFD LYLQDLLPKD IKGIHVILES SCNKWATFEL IGERAVYLGL GDRHDPQYDH MEVAVEFTGY GGIKTENIPG QCVYTLRFYP TRYFEITFDR NTKIIAPAVA AATFFWLTLV FFLYDRYVQQ RTVKAIDTAA RSGAAVASRY PSRILTRLFE EASDRKGCHA SQSTSLNHKP TLDWTDGCSI DHPCTLPFKT KPIAEFFPTS TVMFADIVGF TVWSSGRKPE DVFTVLETLY HAFDKISKNR GVLKVETVGD CYVAVSGLTN QQENHAVIMS RLARECMHTA HALTKQLSST LGSDTAELSL RIGIHSGPVT AGVLRCERSP FQLFGDTINV ASRMERSGRS GRIQISEETA DKLTQAGKTD WIIPREDALL IDGKGELKTY WLFLGNDDQQ WCGSSSNLTR SVSYTTRDED TGDSVYDDLF GTSVGGASES LSKVGSLTSD KMTRLISWNV DVLTLLLKEI AASRRPDPEP GRSTASSRPP LPVRTQTKAS PAAAVPGRVL DEVKEILSFP LSDAASSEKK FKEGDVELEE TVLSQLRVYV TNIAAMYRNH HFHNFAHASH VVLCMTKMLS NIVAPGDAVN GAKHPQQISQ HDHTFGITSD PLTRFACVFS ALIHDVDHCG VPNTQLVQEK ARIASFYRNK SVAEQNSVDL AWELMLDENF AELRAAICMT RGEQTRFRQL VVNAVMATDV MDPDFKLLRN ARWERAFWES RADATSRETI NRKATLVLEY LMQAADVAHT MQHWHVYRQW SERFFQECYT AFQDGRAEHN PAEIWYYGEL DFFDFYIIPL AKKLKVCGVF GGSGGEYLNY ALKNRKNWER TGREVVQEMV RLASA
|
| |