Gene PHATRDRAFT_49710 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49710 
Symbol 
ID7198395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp20368 
End bp24327 
Gene Length3960 bp 
Protein Length1200 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184554 
Protein GI219128720 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAGCT CGTCGTTACC GACGACGAAG GTCGTTTCGG GTCACAGTGG ATTTCACGCG 
TACTCTCCCA TTAGCACATT AGAGCGCCAG TACGCGGGCT CCCCACGCGT CTTTTTATTT
GTCAAAATCA ATGCCAACTG CTGTTTTCCG GGAGGTGTCG AGGCCATCCT CAAGGTCAAG
AACAACGCTC TCTCCACTCT CTCGACGCAA CGAGATGGAT CCTACGAAGA AAACTCCCGA
GAAAGAACAG GCAGTGACTA TAGTGATCCC AATGCGAACT GGGACCCTTT TTCGTTCCTG
AGCGATCATT CGGAAGACGC TTCTGCGAAC GGCTCCGCCC TTGCCGATGG CAGCACAGCA
GCTAATAAAA AAGGTCCCAA CCTCGGAAGA TTCCTGAAAA AAGTGGCGAA ATCTACCACG
CAATCACTTG AACGAGGATT TCACAATATT GCTATCCGAG CGGACCAAGG AAGGAATGCA
GACCTAATGG TGTTGGGTCT TTACGATGAG CAGGACGGAC TGCTCCACAT GACGGAATCG
CAACCGCTAC CCGATGACCA CGCCCGTCTT TCGGGCGTCC GCTTTCTTGT TCCTCTCATT
CTTCCTGCAC ATGTTGATGG AAATAATCGT GTCGTGATCA AGCTGTGGAT CCGGAGTGGG
GCCGCCTTTT TGCAAGGAAC AAAGTCAGCG AGGAGCTACC TTATCGGTTC GGTACATTTG
TCAGCGGCAA GGCTTCGATC TATTGGCACG TCAGGCGCCT TTCTTGATTG TAATGTCCAG
TCTACATTGG TTGCAGACGG TCAGTTGAAT ATTTGTGTTG TCCCGGACCT CAAGTTTTCA
CCCTTGGGCG GCCGCGGGTG GTCATTGGCG GACCCTGATG CAAATACCGC CTACCAGAGC
CACAGCAGCT TGTTTAATTT GCCCCTCGAC ATGTCTTACG GTTTCACTTT CCCGCCCCGT
CCCCACGCAT GTTTGGTCGC TAGCGAACGC GCTGTTGAGT CGACTGTGGT TTTGCCCATT
GCGGCAGCCT TCGCGACATT GGCGTCCCAA GCGGCACAAG TTTCCCTCCA TCACGCCGTG
ACGGTTCGCG ATCGCGTCTT TTACATTCGT CACGACAGTG CCGTTGGGGA ATATGCCGAC
GTAAATGTTG GAATCGGCGT GCTGCAGACG GATCCGGAAA TGTTAGCCCA CACAACACCG
TTCGTGTCGG CGTCGTGGCA GCGGGCGGAC TCTATTTTTG ATGTGGAACT CTTGCATCCG
ACCAAAGTGC CAACAGCTTC GTCGCAGCCC ACCGATTTTC GGCCCGCCAT CGCGTTTCGA
TTTTTCCCCA AACCAAGTCG GACACGCATT TTACCGGCTT TGCTGCACGC CAACGGTGGA
CGCTTACCAA ACTGCGGCTT CATGCTCGGA TCCTTGAGAC TGCTGATTGT CATTCCCAAG
CCCAGATCCA CGAATGGTAC AATCCCCGAA AATTCCTATG GAGGACCTGC TTCGTCTTTG
GCACCACCCG ATCAGGAAGT TTGGGAGTGC ATGATTTCGC TTGATTCACA TGTTTTGCAA
GCCTCTGGTG GCAATTCAGT CTCTCTGTAT CCGGTTCATC ATGTTCCCTC ACGTCGAGTC
ATGGGAACGA TTTGCCTTTC TTTGTCACTT CAAATGCAAC AGGGTCCTAC CGTTCCTACT
GAAGCTATCC CAGCGCGTGG TGGACTCGTT TCGTTGGTCG GCATGGATGC CATGATGGAT
CATGTGTCAC CTTCGTTGGA CTTTGATCCG CAACCAACTA GCCTGGAGCC CGCTTTCCAG
CGCCGAGAGC AACAACTCGC CACAATGGGC GTTTTTGCAA CACACGCGTA CGTGGATCAA
CACGTGAAGA ATACGCGATC AACAGATGTT TTGATTATCC AAGAAAGGGC CAATCAATAT
CAGGCCGCCT TGACGATGAA GCGCAGCGGC AAAAAGCCTC CCACCCACGA GGATCGTTCA
CCTAAGCCAT TTAGGCCTTC ATCAAGTCGA CCCGAGATTT TGCTGAGTGG CATTCCGTTC
AATTGTCATA CTGCCACGTT GGCACTGAAC TTGACCGATC CCGAGCAACC CCGTGATAGC
AACATGACCG GAGCCCTGTT TTACGATGTA ACATGTGGCG CGCCGGCGGA CCATGCTCGT
GGATTTGGTA ACGTGTTTCC GTCCAAGAAG GATACTACCA CCTTCGTCAA CACTGGTCTC
ACCTCGCCGA TTGGAATAGT TACCGGTGGA TTGCGTCGGA TCGAGTCGCG CCGACAGGAA
CTCGCTAAGC TTGTGTACGA TTTGCAAACA ACGTTAACCA TGAATGTTCA AAATTACTTT
GGAAAGGAAC GTCAACAAAA GAATTTTGTG AATCACGTGA CCTCATCGTG TTCGGAGCTG
CAGGATTTGA GGTGGCAATT GTTCGAAGCA ATTCAAGCAT TGCATCACGT TACTTGGCAC
TGTGCCGTAC GTCGGGCCAG CGTATTTCCG CAAGCCCTCG GTTTGGCGGT CACATCCTAC
ATGGCCTCCT TAAGCGACTC CAACAAGTAC CAGTCGACGT GGCCAGATGC TTGGGCGCAA
CATGGATATC TTGTATCCTT CGAAGGTCTT CTCAGTGCCG CAGGAAAGGA GCTTGGTATG
ATTGAAGACG CTTCGGTTGG AATAGGTATG TTGAATCGCG TCCAGATCCA GATCCAATCC
GACGACGGCT CTTCGAACAA AGACCGAACT CCAGTTCCAC ACTCGCCGTA TTTGAAATGG
TTGACATTTT CAGCTATAGG CGATGGTGCA AGGACCGAAT ATGTGTTGAA ACTTGGCGTT
TTGCCTTCTT ACTTCAACGA GCGCATCCCG AATTCGCTCA AAGGAGGGAC GATGGTTCGG
CTCTATCCAC TTTTGTTCGA AGTCGGTGTT GATATTCGCC AGTGGGGAGC AAACACGGGT
TCGAGCGTCA AAAGTCAAAT CAGTAGTCGT TCTACAAATT CGACTTTTGC GCCAGAAGAA
ATGACAAAGG AGCCTACCGG AGGCCTATTG GACGAAGAAG ACGATGATGT TGGTGTTTCC
GACGACGATG TGCTGGTGCA ACTGAACTAC GAAGCTTTTC AAAAGATGAA TACGTATGCC
TTTTCCATAT TTCCAGTTAG TGCCGAGCAA GGGGGACAAC CACGCACCCA CCCTCAACTG
GAAACGTTAT ACCAGCACAT TGTGAGCTCA GCCGGTAAGA TGAACCACGA TATCTTGGAT
GAAGCAGCGT CTTTGTCGAA GCAGTTAGGC GGGGGTGGTG TGGTCTTTTG CAAATCGGGT
AAGGATCGTA CGGCTATGCA TATCACATAC AAGCAAGCGC AGTTTGCGTG CCAATTTCGT
CAGCGACATC CGCTTCCCGA TCAAAACGCT TCGCTCCCAG ATACAACGTT GGCGGACGCC
ATGATGATGC GCGTTTACGG AACACGCTTG CCAATTTGTG AAAAGAATGT GGGCCAATCC
AAGTATGCCT TTAACAGCTT GCAAGTGAAG TTTATGCCGG ATGCCCTCAA GCCTCCCATG
AACACACTTG CTGGTTTTCT CAAGGGCGGC AAAGTCTTTG CTGGTGGTGG AATTGAGAGC
TAGAATGAGT TTGTGCATAA TCGGCTCAAA AAATGTTTGA TGGACCAAAA TTGAATACTG
CTTCCCTTCT AGTCCTAATA CAGGCCCTTG TTTTGAACTT GACTGTACTT TTCACTCTTG
TGTTGCGTTA CAGTGGGAAC AACGAAAGGG GAGCAGCTGA TCCATACTGC CTTAGGATTT
TGAGGATTGA ATGGTATATG GCAGTCTTCC ATGCATTGGG TAACCAGGCT TGTCTTCCTC
TGTTCTAGAG TCTATTCTAC TTTACGCATC AATTGAGGAG CTACTTCTCT ATTGAGGTGC
ACTTTCAACC AAAGTAAACA CAACAAAGCT GTCATAATCA ACGCTTGCCG TTATTTTCAT
 
Protein sequence
MASSSLPTTK VVSGHSGFHA YSPISTLERQ YAGSPRVFLF VKINANCCFP GGVEAILKVK 
NNALSTLSTQ RDGSYEENSR ERTGSDYSDP NANWDPFSFL SDHSEDASAN GSALADGSTA
ANKKGPNLGR FLKKVAKSTT QSLERGFHNI AIRADQGRNA DLMVLGLYDE QDGLLHMTES
QPLPDDHARL SGVRFLVPLI LPAHVDGNNR VVIKLWIRSG AAFLQGTKSA RSYLIGSVHL
SAARLRSIGT SGAFLDCNVQ STLVADGQLN ICVVPDLKFS PLGGRGWSLA DPDANTAYQS
HSSLFNLPLD MSYGFTFPPR PHACLVASER AVESTVVLPI AAAFATLASQ AAQVSLHHAV
TVRDRVFYIR HDSAVGEYAD VNVGIGVLQT DPEMLAHTTP FVSASWQRAD SIFDVELLHP
TKVPTASSQP TDFRPAIAFR FFPKPSRTRI LPALLHANGG RLPNCGFMLG SLRLLIVIPK
PRSTNGTIPE NSYGGPASSL APPDQEVWEC MISLDSHVLQ ASGGNSVSLY PVHHVPSRRV
MGTICLSLSL QMQQGPTVPT EAIPARGGLV SLVGMDAMMD HVSPSLDFDP QPTSLEPAFQ
RREQQLATMG VFATHAYVDQ HVKNTRSTDV LIIQERANQY QAALTMKRSG KKPPTHEDRS
PKPFRPSSSR PEILLSGIPF NCHTATLALN LTDPEQPRDS NMTGALFYDV TCGAPADHAR
GFGNVFPSKK DTTTFVNTGL TSPIGIVTGG LRRIESRRQE LAKLVYDLQT TLTMNVQNYF
GKERQQKNFV NHVTSSCSEL QDLRWQLFEA IQALHHVTWH CAVRRASVFP QALGLAVTSY
MASLSDSNKY QSTWPDAWAQ HGYLVSFEGL LSAAGKELGM IEDASVGIGM LNRVQIQIQS
DDGSSNKDRT PVPHSPYLKW LTFSAIGDGA RTEYVLKLGV LPSYFNERIP NSLKGGTMVR
LYPLLFEVGV DIRQWGANTG SSVKSQISSR STNSTFAPEE MTKEPTGGLL DEEDDDVGVS
DDDVLVQLNY EAFQKMNTYA FSIFPVSAEQ GGQPRTHPQL ETLYQHIVSS AGKMNHDILD
EAASLSKQLG GGGVVFCKSG KDRTAMHITY KQAQFACQFR QRHPLPDQNA SLPDTTLADA
MMMRVYGTRL PICEKNVGQS KYAFNSLQVK FMPDALKPPM NTLAGFLKGG KVFAGGGIES