Gene PHATRDRAFT_26387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_26387 
Symbol 
ID7199857 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp42771 
End bp46028 
Gene Length3258 bp 
Protein Length1053 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178916 
Protein GI219116242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.421794 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGTCG AACAGCTGCA CGTTGTCTTG CAGCAATCGT TTTCGGCCGA TGCGTCGATT 
CGAAACCCTG CCGAACAAAC CATTAAAAAC CTCAAAAACT TGCCCGGTGC CGTCAATCTA
CTCTTGCAGG TCGCTACGGA AAAGCAGGTA TGCTGAACAC AGCTTAACGG AGCTCTGCGA
CGAGTGTAGC CCTCATCCGT CTCACGTAAT CGTACATAAT GTACGCGTTT CTTCCTCATT
GGCCAGGTCC GTTTCGAAGT CCGACAAGCC GCTGCCATTC AACTCAAAAA TATTTGCCGC
GAAGGCTGGG CGGAACGTAT TCATTACGCT CCGTATGCTG AAGAAGCCAC GAAACCAGCT
CTGCTCGCCG ATGAAGACAA AGCAGTTGTG AGGGTCGGCC TGCTCAAGAC GCTCCTCGAC
GAACCAGAAA AGAGTATCCG AGATTTGCTC GCGGAAACCT TACACACGGT GGTGATCCAC
GACTTTCCCG AAAAATGGCC TCAGCTCATT CCCACGCTCC TCGCGAGTAT TCAAACGGGT
GTCGGTGACA TGGGAAAACA CGGATTGCAG GTACACAATG CTCTCCTTGC ACTCCGCAAA
GTTTGCAAGC GATACGAATA CAAAAGCAAG GAGCAACGCG GACCCCTCAA CGAAATTGTC
CAATCGAGTT TTCCGCTCTT GCTCCCGTTA GCGCAGCAGC TATCTGCCGA AAACGAAAAC
TCGCTGGAAG CCGCCATGAT GCTCAAACAG ATTCTCAAAA TTTTTTGGTC CAGTACGCAG
TTCTATTTGC CCGGTGGCGA CGGATCGGAA ACGTCCTCCA TTGGGTTGGC ACGACCGGAA
CAGCTGCAGC CTTGGTTTGA TGTTGTGAGA AGCGCTTTAC AAAAGCCCCT ACCAGAAGCG
TCGACGGGAC TTGAACCACG TAACCAGCCA GTCGATGTCG ATGCTCGCAA CGCGTGGCCT
TGGTGGAAGG TTAAAAAGTG GTCAGTGCAA ATTATGAGCC GACTGTTTTC TCGCTACGGT
ATTCCAAGCT ACGCGGACGA TCAGGAAGCC AAGGATTTTG CCGTTTTCTT CAGTCAAAAC
GTGGCGCCGC AATTCTTGGG GCCTGTCTGC GAAACACTAA ATCTAAGACC CTCGGGAAGT
TTCTGTACCG ACCGGGTAAT CCACTTGTGT TTGACCTTTG TGGACTTGGC GGTCGAGCTA
GCCAGTACCT ACAAGTTGCT GAAGCCACAT TTGGACTTTC TCTTGTATCA GGTGTGCTTT
CCAACAATGT GTTTGACTCA AGAGGACATT GACTGTTTCG ACAACGATCC GGTGGAATTT
GTGCACAAGC AGAACAGTCC CTTGGCCGAC TTTTACGACC CGCGCATGTC CGCGGTCACT
CTCGTCACCG ATCTAGTCAA ACATCGTGGA CAAGACGTAA CTCAGAATTT GTTGGGACGT
ATGACGGCCA TTTTGCACAC TTACAGCCAA GCAGCCCCTG ACCAAAAGAA TCATGTGGAA
AAGGACGGTG CCTTATTGGT GTTTGGCTCG TTGTCGAAGA ATTTGTTGGC AAAAGAAAAG
TATGCTGCCG AGCTTGAAGG CTTATTGGTA TCCTCAGTTT TTCCGGATTT TGGGTCACCG
GTCGCCTTCT TGCGATATCG TGCGTGCTGG ATGGTACAGC AATATAGCAC TGTCCAATGG
TCCGACGATG GAGCTCATTT GCGAACTTTA CTCGAAATGG TTCTAAACCG CTTGAGCGAT
CCCGCTCTCC CCGTACAGAT TGAGGCCTCC AAGGCCCTGC GATTTTTGGT AGAAGCTGAT
GGCGCGGAAG AAACTCTTCT TCCCGTCCTA CCTCAGCTAT TGACAGAGTA TTTTCGTATC
ATGAACGAAA TTGGCAACGA CGAAGTTGTG TCTGCCTTAC AGGCTTTGCT CGATAAGTTT
GGCCGTCACA TTGAACCACA CGCAGTCGCT CTTGTAACAC AATTGACGAG TGCCTTTTCA
CAATATTGTA CAGCCGGGGA AGACGACGAT GATGACGACG CCGCCATGGC CGCAGCGCAG
TGTCTTGAGT GCGTTGCGAC GGTTCTAAAA GGCGTTTGTG GGAAAGCTTC CATGCTGAAA
ACTCTCGAAC CACTACTGAT GCCGCTGGTC TTGAAAATTC TAGGGAGCGA CGGTGATTTT
ATTGAATATT TGGAATGTGG ACTCGATATC TTGACTTTTT TAACTTTCTT TCAAGAACAT
ATTTCGCCAG AAGTCTGGCA AGCCTTTCCT TTGATATATT TGGCTTTTGA TCAATTCGCC
TACGATTATC TGAACATGAT GGTACCTTGC TTGCAGAGTT ATATTGGCAA GTCAACCAAT
ATTTTTTTAA CCGGTACTGC CCAGCTCCCT GAAGGAGACA TTCCGTATAT TGATTTGATC
ATCAGCATAG CCGCCAAGAC AGTCACGAAC GACCGCGCTT CTGAATCAGA ATGCCGGTAC
GCGCTTAGTC TGTTCATGAC GATTCTTCAC AATTGTCCTG GCAAGGTAGA TGGATACATT
CCATTTATGA ACGAGATTGC GCTCGGCAAG CTCGGACAGC AAGTCAATAC CGAGATTCCT
TTGACTCGGT TTTCAATATT TCAGGTTCTC GGGTCTGCGC TCTACTACCA GCCTCAGCTT
GAGTTGATGG AGCTCGAAAA GCGAAGCGTC ACACAACAAG TTTTCACGCA ATGGATAATT
GATGCGGACA AAATGGAGCG ATGGCTCCCA AGAAAGTTGA CCGTGCTTGG TTTGTCTTCC
ATTTTGAGCC TGCCCACGTC GACCTTGCCT GCATCAATCA TCAGCTTGCT ACCGCAACTA
ATTCACATGG CGTGTAAATT GGCACTCGTC CTCAAAGCTG AGGCCGAGCA AACCGAGAAG
GATGCCGACC AACTAATCGA GGAAGCACCT GAAAGGGATG ATGGCGTTGG CGACGTTGAT
CTAGGATTCG ACGAAAGCCA AGATGTGACA AACGAGGTAG ACGAAGCTTA CAGAAAAGCG
CTGCAAGGAG TCTCAGGCTG GGACGATGAC ATGGCAAAAT TCTTACTCGG TGGTTGGGAG
GACGAAGGTG ATGACATTGA CGAAGACTAC AGCTCGCCAA TAGATAAAAT TGACGAGCTC
ATTCTGCTGA ATGACACCAT CAAAATGGCT TTTCAAAGAG AACCTGAAGC CTATCAACAG
ATTCAGTCCG CCCTTCCGCC GGAACCTGTT GCGGTGGTTC AGAATTTATT TGCCAGCGCC
GATATCGTAC GAGCGCAA
 
Protein sequence
MDVEQLHVVL QQSFSADASI RNPAEQTIKN LKNLPGAVNL LLQVATEKQV RFEVRQAAAI 
QLKNICREGW AERIHYAPYA EEATKPALLA DEDKAVVRVG LLKTLLDEPE KSIRDLLAET
LHTVVIHDFP EKWPQLIPTL LASIQTGVGD MGKHGLQVHN ALLALRKVCK RYEYKSKEQR
GPLNEIVQSS FPLLLPLAQQ LSAENENSLE AAMMLKQILK IFWSSTQFYL PGGDGSETSS
IGLARPEQLQ PWFDVVRSAL QKPLPEASTG LEPRNQPVDV DARNAWPWWK VKKWSVQIMS
RLFSRYGIPS YADDQEAKDF AVFFSQNVAP QFLGPVCETL NLRPSGSFCT DRVIHLCLTF
VDLAVELAST YKLLKPHLDF LLYQVCFPTM CLTQEDIDCF DNDPVEFVHK QNSPLADFYD
PRMSAVTLVT DLVKHRGQDV TQNLLGRMTA ILHTYSQAAP DQKNHVEKDG ALLVFGSLSK
NLLAKEKYAA ELEGLLVSSV FPDFGSPVAF LRYRACWMVQ QYSTVQWSDD GAHLRTLLEM
VLNRLSDPAL PVQIEASKAL RFLVEADGAE ETLLPVLPQL LTEYFRIMNE IGNDEVVSAL
QALLDKFGRH IEPHAVALVT QLTSAFSQYC TAGEDDDDDD AAMAAAQCLE CVATVLKGVC
GKASMLKTLE PLLMPLVLKI LGSDGDFIEY LECGLDILTF LTFFQEHISP EVWQAFPLIY
LAFDQFAYDY LNMMVPCLQS YIGKSTNIFL TGTAQLPEGD IPYIDLIISI AAKTVTNDRA
SESECRYALS LFMTILHNCP GKVDGYIPFM NEIALGKLGQ QVNTEIPLTR FSIFQVLGSA
LYYQPQLELM ELEKRSVTQQ VFTQWIIDAD KMERWLPRKL TVLGLSSILS LPTSTLPASI
ISLLPQLIHM ACKLALVLKA EAEQTEKDAD QLIEEAPERD DGVGDVDLGF DESQDVTNEV
DEAYRKALQG VSGWDDDMAK FLLGGWEDEG DDIDEDYSSP IDKIDELILL NDTIKMAFQR
EPEAYQQIQS ALPPEPVAVV QNLFASADIV RAQ