Gene PHATRDRAFT_39943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39943 
Symbol 
ID7195558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp421546 
End bp423969 
Gene Length2424 bp 
Protein Length807 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183993 
Protein GI219127544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCTACA CAAAGCTATT AGGCCGCACG ATTGGTACTT TGATTTTTCT GGTATCGATG 
GGGCAGATAT ACAACCACGA GCGGTTCAAC CGAACGTCAG ACCTTTCCAT CCATGCAGCT
CGTTTGCCCT TTCTCAATGA TTCCTACGAT AAAGGCCAAA GAAACATCAG CCAAGTCATC
CCATATACGG AAGAGACGGC TGATTTCGAA AAGAGCTCCA ATAATTCTGG GCATCGACAG
AAGTCAGCGC CACCGTTGCC GTCGTGGGTC ACCGAATATT TTCACTGGCA TGCAGATGAA
CGACAAAAGA TGACTCCCGA AAGCTGGGAA GAGAGACGCT ATCTCGTCTT GCGCTGCCTC
GAATCTGACA AAAGGTGCGG CGGCACGGCC GACCGCTTAC ATAACTTACC CGTCTTGCTG
CGATTGGCGC AAAAGTCCCA ACGAATTCTG TTTATTCATT GGGAAAAACC AGCACCGTTG
GAAGACTTTT TACTTCCTCA ACCTCCGGAA ACTTCGGAAT TGCGTCTCGA CTGGAGACTT
CCATCGTGGT TAAAAGAGCC CATGCAGCTG GGTCAGATAC CGATTTCGCG GCTTATGAAT
GGTACGGACG GAACAGGCAT CATCGACTCC GATGAGCGAG TAGTGGCAAT GCGGTCCCTA
CACGGATCGC TTTTTTACGA TGAATTGAAA GGGCCCGACG AACCATCGTA CAACAATATT
TTAAAGTTGC TCTGGTATGC CGTCTTCGTA CCAGCTCCAA TTGTTCGCGT TCGTGTTCAA
CTCCAGCTCG CCAGGCTAGG ATTGACACCT GGAGAGTATG CGTCGGCTCA CATTCGTTCC
TTGTACATTG GAGACGAGAC CCATGTTGAT ACCTTGTATG TACACGCTGT CACATGCGCA
GCGCAAGCTA CCAACAATGC TTCTTTCCCC ATTTTTGTGA CGTCGGATTC TCCGTTGGTG
GAGGAACAGG CCGTTGTCTT CGGATCAGCC GTCCACCATC TACGTATAGT AACGCATAAC
AGATCGGAGA CCTTACACTT GGATAGAGGC CGCGATTTTT TAACGCAAGG CCGATCTGAT
AGCTGGAAGA GCATTGATAT GGACGGCTTC TACGATGTTT TTGTGGATCT TTATTTGGTT
GCAAATAGTC GGTGCGTATC ATTTGGAGTG GGCGGATTCG GAAGGCTTGG AGCACTACTG
AGTGCAGATC CATCCTGCGC TTATAAGTAT GCCCCGGCTA CTCGTAGTGA CCAATGCACG
GTTCCAAAAC CAATACGAAA CATCTCTGCA GCAAAAGCTG TATTCACGTC GAGCTCGCTG
TTTTCCTCTC CTTCTATTCT ACCAATTTAC AATTCCACAA CAATTCACTC GTGGAACAAT
ACAAAGATGA TTCCGAAGTG GATGAAACAG TACTTTTGTT GGCACCAGGA TGCCCGTCGA
TTGCTTCAGA ATGGAGAAAA GTCAGCTTCA GACTACAAGT ACTTGGTACT GCGATGCTTA
ACGAAGAATA AGAAATGTTC AGGCGCCGCA GATCGGCTAA AGTCAATTCC AACTGCGATA
CGAATGGCAT ACGACTCACA GCGATTGCTT TTTCTCAAGT GGGAAAGACC GTGTGCGCTA
GAACACTTTC TAGTCCCACC TCGCGGTGGA CTGGATTGGC GAATTCCTTC CACTTTAGAG
CTGGATTTTG AAGAAAAGTT CAGCTGGCGT GACAAGGCTA TTGTTCTGAC GGAAAGCAAT
AATGTCGAAA AGGCTCTCAA ATCCGAAGAA GTGATCGTCA GTTTGAAGTC AGTACGCGAC
AGAAAGTATT TTGAGGAACA AAGAGAACCA GGAGACTACA GTTTCGAAGA AGTATATCGA
GAGGTTTGGT CTTCTGTTTT TGAACCATCC CCACCTGTGG CTCGCTTAGT CAGCACGGTC
ATGGAGGAGC TAGGTTTGCG GCCTGGTGAA TATGTTGCCG CCCATGTCCG AGCCCTCTAT
GTGCAAAATA CTGTCAAGAA TCGAGAAGAG ATCAACGCAC TGAACTGTGC TTCGCAATTG
GGACCACGAG CGACGATCTT TTTCGCCTCA GATTCCGCTG AAACAACCCG ACTTGCACTT
CAGTATGGCA GGGGAAAAGA AGCCACCATC GTAGCGCGTA TCGGCGAGAG TGAGCCACTT
CACCTCGACC GTGGGCACGT CTTTTTGGAG CAGCATGGGG TAGTTGCCGG TGAACATGAG
CCTCAAGACT TTTACGATAC ATTTGTGGAT CTTTATATTC TAGCCGAGAG TCGCTGTATA
ACTTACGGGG CAGGAGGGTT CGGAAGCTGG GCGAGCCTCA TTTCCAGAAA CTCACTGTGC
TCTATTCGAC ATCGCACGAC TAATTGCGTT TGGTTTGACG ATCCCATACT GGGATCCCCA
TCTTTATCAG CTATTCGACC CTGA
 
Protein sequence
MSYTKLLGRT IGTLIFLVSM GQIYNHERFN RTSDLSIHAA RLPFLNDSYD KGQRNISQVI 
PYTEETADFE KSSNNSGHRQ KSAPPLPSWV TEYFHWHADE RQKMTPESWE ERRYLVLRCL
ESDKRCGGTA DRLHNLPVLL RLAQKSQRIL FIHWEKPAPL EDFLLPQPPE TSELRLDWRL
PSWLKEPMQL GQIPISRLMN GTDGTGIIDS DERVVAMRSL HGSLFYDELK GPDEPSYNNI
LKLLWYAVFV PAPIVRVRVQ LQLARLGLTP GEYASAHIRS LYIGDETHVD TLYVHAVTCA
AQATNNASFP IFVTSDSPLV EEQAVVFGSA VHHLRIVTHN RSETLHLDRG RDFLTQGRSD
SWKSIDMDGF YDVFVDLYLV ANSRCVSFGV GGFGRLGALL SADPSCAYKY APATRSDQCT
VPKPIRNISA AKAVFTSSSL FSSPSILPIY NSTTIHSWNN TKMIPKWMKQ YFCWHQDARR
LLQNGEKSAS DYKYLVLRCL TKNKKCSGAA DRLKSIPTAI RMAYDSQRLL FLKWERPCAL
EHFLVPPRGG LDWRIPSTLE LDFEEKFSWR DKAIVLTESN NVEKALKSEE VIVSLKSVRD
RKYFEEQREP GDYSFEEVYR EVWSSVFEPS PPVARLVSTV MEELGLRPGE YVAAHVRALY
VQNTVKNREE INALNCASQL GPRATIFFAS DSAETTRLAL QYGRGKEATI VARIGESEPL
HLDRGHVFLE QHGVVAGEHE PQDFYDTFVD LYILAESRCI TYGAGGFGSW ASLISRNSLC
SIRHRTTNCV WFDDPILGSP SLSAIRP