Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49710 |
Symbol | |
ID | 7198395 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 20368 |
End bp | 24327 |
Gene Length | 3960 bp |
Protein Length | 1200 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184554 |
Protein GI | 219128720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGCT CGTCGTTACC GACGACGAAG GTCGTTTCGG GTCACAGTGG ATTTCACGCG TACTCTCCCA TTAGCACATT AGAGCGCCAG TACGCGGGCT CCCCACGCGT CTTTTTATTT GTCAAAATCA ATGCCAACTG CTGTTTTCCG GGAGGTGTCG AGGCCATCCT CAAGGTCAAG AACAACGCTC TCTCCACTCT CTCGACGCAA CGAGATGGAT CCTACGAAGA AAACTCCCGA GAAAGAACAG GCAGTGACTA TAGTGATCCC AATGCGAACT GGGACCCTTT TTCGTTCCTG AGCGATCATT CGGAAGACGC TTCTGCGAAC GGCTCCGCCC TTGCCGATGG CAGCACAGCA GCTAATAAAA AAGGTCCCAA CCTCGGAAGA TTCCTGAAAA AAGTGGCGAA ATCTACCACG CAATCACTTG AACGAGGATT TCACAATATT GCTATCCGAG CGGACCAAGG AAGGAATGCA GACCTAATGG TGTTGGGTCT TTACGATGAG CAGGACGGAC TGCTCCACAT GACGGAATCG CAACCGCTAC CCGATGACCA CGCCCGTCTT TCGGGCGTCC GCTTTCTTGT TCCTCTCATT CTTCCTGCAC ATGTTGATGG AAATAATCGT GTCGTGATCA AGCTGTGGAT CCGGAGTGGG GCCGCCTTTT TGCAAGGAAC AAAGTCAGCG AGGAGCTACC TTATCGGTTC GGTACATTTG TCAGCGGCAA GGCTTCGATC TATTGGCACG TCAGGCGCCT TTCTTGATTG TAATGTCCAG TCTACATTGG TTGCAGACGG TCAGTTGAAT ATTTGTGTTG TCCCGGACCT CAAGTTTTCA CCCTTGGGCG GCCGCGGGTG GTCATTGGCG GACCCTGATG CAAATACCGC CTACCAGAGC CACAGCAGCT TGTTTAATTT GCCCCTCGAC ATGTCTTACG GTTTCACTTT CCCGCCCCGT CCCCACGCAT GTTTGGTCGC TAGCGAACGC GCTGTTGAGT CGACTGTGGT TTTGCCCATT GCGGCAGCCT TCGCGACATT GGCGTCCCAA GCGGCACAAG TTTCCCTCCA TCACGCCGTG ACGGTTCGCG ATCGCGTCTT TTACATTCGT CACGACAGTG CCGTTGGGGA ATATGCCGAC GTAAATGTTG GAATCGGCGT GCTGCAGACG GATCCGGAAA TGTTAGCCCA CACAACACCG TTCGTGTCGG CGTCGTGGCA GCGGGCGGAC TCTATTTTTG ATGTGGAACT CTTGCATCCG ACCAAAGTGC CAACAGCTTC GTCGCAGCCC ACCGATTTTC GGCCCGCCAT CGCGTTTCGA TTTTTCCCCA AACCAAGTCG GACACGCATT TTACCGGCTT TGCTGCACGC CAACGGTGGA CGCTTACCAA ACTGCGGCTT CATGCTCGGA TCCTTGAGAC TGCTGATTGT CATTCCCAAG CCCAGATCCA CGAATGGTAC AATCCCCGAA AATTCCTATG GAGGACCTGC TTCGTCTTTG GCACCACCCG ATCAGGAAGT TTGGGAGTGC ATGATTTCGC TTGATTCACA TGTTTTGCAA GCCTCTGGTG GCAATTCAGT CTCTCTGTAT CCGGTTCATC ATGTTCCCTC ACGTCGAGTC ATGGGAACGA TTTGCCTTTC TTTGTCACTT CAAATGCAAC AGGGTCCTAC CGTTCCTACT GAAGCTATCC CAGCGCGTGG TGGACTCGTT TCGTTGGTCG GCATGGATGC CATGATGGAT CATGTGTCAC CTTCGTTGGA CTTTGATCCG CAACCAACTA GCCTGGAGCC CGCTTTCCAG CGCCGAGAGC AACAACTCGC CACAATGGGC GTTTTTGCAA CACACGCGTA CGTGGATCAA CACGTGAAGA ATACGCGATC AACAGATGTT TTGATTATCC AAGAAAGGGC CAATCAATAT CAGGCCGCCT TGACGATGAA GCGCAGCGGC AAAAAGCCTC CCACCCACGA GGATCGTTCA CCTAAGCCAT TTAGGCCTTC ATCAAGTCGA CCCGAGATTT TGCTGAGTGG CATTCCGTTC AATTGTCATA CTGCCACGTT GGCACTGAAC TTGACCGATC CCGAGCAACC CCGTGATAGC AACATGACCG GAGCCCTGTT TTACGATGTA ACATGTGGCG CGCCGGCGGA CCATGCTCGT GGATTTGGTA ACGTGTTTCC GTCCAAGAAG GATACTACCA CCTTCGTCAA CACTGGTCTC ACCTCGCCGA TTGGAATAGT TACCGGTGGA TTGCGTCGGA TCGAGTCGCG CCGACAGGAA CTCGCTAAGC TTGTGTACGA TTTGCAAACA ACGTTAACCA TGAATGTTCA AAATTACTTT GGAAAGGAAC GTCAACAAAA GAATTTTGTG AATCACGTGA CCTCATCGTG TTCGGAGCTG CAGGATTTGA GGTGGCAATT GTTCGAAGCA ATTCAAGCAT TGCATCACGT TACTTGGCAC TGTGCCGTAC GTCGGGCCAG CGTATTTCCG CAAGCCCTCG GTTTGGCGGT CACATCCTAC ATGGCCTCCT TAAGCGACTC CAACAAGTAC CAGTCGACGT GGCCAGATGC TTGGGCGCAA CATGGATATC TTGTATCCTT CGAAGGTCTT CTCAGTGCCG CAGGAAAGGA GCTTGGTATG ATTGAAGACG CTTCGGTTGG AATAGGTATG TTGAATCGCG TCCAGATCCA GATCCAATCC GACGACGGCT CTTCGAACAA AGACCGAACT CCAGTTCCAC ACTCGCCGTA TTTGAAATGG TTGACATTTT CAGCTATAGG CGATGGTGCA AGGACCGAAT ATGTGTTGAA ACTTGGCGTT TTGCCTTCTT ACTTCAACGA GCGCATCCCG AATTCGCTCA AAGGAGGGAC GATGGTTCGG CTCTATCCAC TTTTGTTCGA AGTCGGTGTT GATATTCGCC AGTGGGGAGC AAACACGGGT TCGAGCGTCA AAAGTCAAAT CAGTAGTCGT TCTACAAATT CGACTTTTGC GCCAGAAGAA ATGACAAAGG AGCCTACCGG AGGCCTATTG GACGAAGAAG ACGATGATGT TGGTGTTTCC GACGACGATG TGCTGGTGCA ACTGAACTAC GAAGCTTTTC AAAAGATGAA TACGTATGCC TTTTCCATAT TTCCAGTTAG TGCCGAGCAA GGGGGACAAC CACGCACCCA CCCTCAACTG GAAACGTTAT ACCAGCACAT TGTGAGCTCA GCCGGTAAGA TGAACCACGA TATCTTGGAT GAAGCAGCGT CTTTGTCGAA GCAGTTAGGC GGGGGTGGTG TGGTCTTTTG CAAATCGGGT AAGGATCGTA CGGCTATGCA TATCACATAC AAGCAAGCGC AGTTTGCGTG CCAATTTCGT CAGCGACATC CGCTTCCCGA TCAAAACGCT TCGCTCCCAG ATACAACGTT GGCGGACGCC ATGATGATGC GCGTTTACGG AACACGCTTG CCAATTTGTG AAAAGAATGT GGGCCAATCC AAGTATGCCT TTAACAGCTT GCAAGTGAAG TTTATGCCGG ATGCCCTCAA GCCTCCCATG AACACACTTG CTGGTTTTCT CAAGGGCGGC AAAGTCTTTG CTGGTGGTGG AATTGAGAGC TAGAATGAGT TTGTGCATAA TCGGCTCAAA AAATGTTTGA TGGACCAAAA TTGAATACTG CTTCCCTTCT AGTCCTAATA CAGGCCCTTG TTTTGAACTT GACTGTACTT TTCACTCTTG TGTTGCGTTA CAGTGGGAAC AACGAAAGGG GAGCAGCTGA TCCATACTGC CTTAGGATTT TGAGGATTGA ATGGTATATG GCAGTCTTCC ATGCATTGGG TAACCAGGCT TGTCTTCCTC TGTTCTAGAG TCTATTCTAC TTTACGCATC AATTGAGGAG CTACTTCTCT ATTGAGGTGC ACTTTCAACC AAAGTAAACA CAACAAAGCT GTCATAATCA ACGCTTGCCG TTATTTTCAT
|
Protein sequence | MASSSLPTTK VVSGHSGFHA YSPISTLERQ YAGSPRVFLF VKINANCCFP GGVEAILKVK NNALSTLSTQ RDGSYEENSR ERTGSDYSDP NANWDPFSFL SDHSEDASAN GSALADGSTA ANKKGPNLGR FLKKVAKSTT QSLERGFHNI AIRADQGRNA DLMVLGLYDE QDGLLHMTES QPLPDDHARL SGVRFLVPLI LPAHVDGNNR VVIKLWIRSG AAFLQGTKSA RSYLIGSVHL SAARLRSIGT SGAFLDCNVQ STLVADGQLN ICVVPDLKFS PLGGRGWSLA DPDANTAYQS HSSLFNLPLD MSYGFTFPPR PHACLVASER AVESTVVLPI AAAFATLASQ AAQVSLHHAV TVRDRVFYIR HDSAVGEYAD VNVGIGVLQT DPEMLAHTTP FVSASWQRAD SIFDVELLHP TKVPTASSQP TDFRPAIAFR FFPKPSRTRI LPALLHANGG RLPNCGFMLG SLRLLIVIPK PRSTNGTIPE NSYGGPASSL APPDQEVWEC MISLDSHVLQ ASGGNSVSLY PVHHVPSRRV MGTICLSLSL QMQQGPTVPT EAIPARGGLV SLVGMDAMMD HVSPSLDFDP QPTSLEPAFQ RREQQLATMG VFATHAYVDQ HVKNTRSTDV LIIQERANQY QAALTMKRSG KKPPTHEDRS PKPFRPSSSR PEILLSGIPF NCHTATLALN LTDPEQPRDS NMTGALFYDV TCGAPADHAR GFGNVFPSKK DTTTFVNTGL TSPIGIVTGG LRRIESRRQE LAKLVYDLQT TLTMNVQNYF GKERQQKNFV NHVTSSCSEL QDLRWQLFEA IQALHHVTWH CAVRRASVFP QALGLAVTSY MASLSDSNKY QSTWPDAWAQ HGYLVSFEGL LSAAGKELGM IEDASVGIGM LNRVQIQIQS DDGSSNKDRT PVPHSPYLKW LTFSAIGDGA RTEYVLKLGV LPSYFNERIP NSLKGGTMVR LYPLLFEVGV DIRQWGANTG SSVKSQISSR STNSTFAPEE MTKEPTGGLL DEEDDDVGVS DDDVLVQLNY EAFQKMNTYA FSIFPVSAEQ GGQPRTHPQL ETLYQHIVSS AGKMNHDILD EAASLSKQLG GGGVVFCKSG KDRTAMHITY KQAQFACQFR QRHPLPDQNA SLPDTTLADA MMMRVYGTRL PICEKNVGQS KYAFNSLQVK FMPDALKPPM NTLAGFLKGG KVFAGGGIES
|
| |