Gene PHATR_33493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33493 
Symbol 
ID7203883 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1065809 
End bp1067937 
Gene Length2129 bp 
Protein Length684 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186178 
Protein GI219113189 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGTCT TTGCGGTCTT GGCTTCCACG AGCGCCCGGA GGGCGGCTGC CTTTATCCCA 
GGTCGATCGC GGTCGGCAGC TATCACAAAA GTATTTGGTG GCCTTCCTCT TCAAAGATCA
CACTCGTTTC ACTTTTTGGA ACAAAAGCAA TCGTCGCGTC TCTTCGCGAC CATCGCTGAC
GAAGCTGCAA CTGTCGAGGA AGAAGAGCTT GTGGAACTAC CCACCAACGA CAACGATGAA
GAGTTATTAC GTATCCGACA TTCATCGGCT CATGTTATGG CAATGGCGGT TCAGCGGATT
TTCCCGGAGG CCCAAGTGAC AATTGGTCCC TGGATCGACA GCGGCTTCTA CTACGACTTT
TTCTTTCCCG AAAGGACCGA TGAAGAATCG GGAGAAACTA TTCCGGCACG GAAGCTTTCC
GAGCAGGATT TGAAAGACGT TAAAAAGGCA ATGGATAAGA TCATCGGGAA AGACTATCCG
ATTTCTCGTG AGGATGTTAG TCGTGAAGAA GCGAGACGGC GCATTGAGAA AATCGGTGAA
CCCTTCAAGC TAGAAATTCT AGACAGTATC AAGACCGAAC CTATCACGCT TTATCACATT
GGCGACGAAT GGTGGGACCT GTGCGCTGGT CCGCACGTGG ATTCAACGGG AAAGCTTCCC
AAAAAAGCTA TACAATTGCA AAGCGTGGCG GGAGCCTACT GGCGTGGCGA CGAGAATCGC
GAAATGCTCC AACGCATCTA CGCCACGGCC TGGAAGGATC CGCAACAGCT CAAACAATAC
AAGAAAATGC TCGAAGAAGC GAAGAAACGC GATCATCGTA TGCTGGGGAA GAAACTGGAT
CTATTCTCGA TCCAAGAAGA CGCCGGTGGA GGCCTCGTAT TTTGGCATCC CAAGGGGTCC
AAGGTTCGCA GGATGATTGA AGATTTCTGG AAAGACGCAC ATATCCAAGA TGGCTACGAC
GTTGTCTATT CGCCACACAT CGCAAATATT GATTTGTGGA AAACTTCTGG CCATTTCGAC
TTTTACCGTG CGGACATGTT CGATCAAATG GATGTTGAGA ATGAACAGTA TCAGATCAAG
CCAATGAACT GCCCGTTCCA TTGTCTAATG TATAAGGATG AGCTTCGATC ATATCGTGAT
CTGCCCTTTC GGTGGGCGGA GCTAGGAACG GTATATCGGT AAGTGTAGAA AATTTGGGCA
CAGACATTTA TCAAACTGTA TAACTTCTGT GCTAACGTCT CTGGTGCATT AGGTACGAGC
GTTCTGGAAC TCTGCACGGC CTCATGCGTG TCCGTGGTTT CACTCAGGAC GACGCGCATA
TTTTTTGTCT TCCGGAACAA TTGCAGGATG AGATTGTTGG TGTTCTCGAC TTGACTGAAA
CGATCTTGTC ACGTTTTGGC TTTGACAAGT ACGATGTTAT GCTCTCGACT CGTCCCGAAA
AGTCGGTAGG TTCGGACGAG ATTTGGGACG CAGCTACTGT TGCGTTAGAA GGAGCCCTCA
AAGTCAAAGG CTGGGATTAC AATGTTGACG AGGGAGGCGG CGCCTTTTAT GGGCCAAAAA
TTGACCTCAA AATCCGAGAT GCGATTGGCC GGCAGTGGCA GTGCTCTACA GTCCAATGCG
ATTTCAATTT GCCAGAACGC TTCGGACTCG AGTATGCAAG CGCCGATGGC ACACGAGAAC
GCCCTATCAT GGTGCATCGT GCTATTTTTG GTTCTATCGA GCGGTTTTTT GGAATCCTAA
TAGAGAATTG TTCAGGCGAC TTCCCACTTT GGTTGGCTCC GACGCAATTA AAGCTGTTAC
CAGTCACGGA TGCTGTGCAT GAGTATTGTA ATGAGATTGC AGCGAAGGCT CTAAAAATGG
GACTCCGTAT CGAAGTAGAT CGCGGAAGCG AACGATTGGC TAAGCAAATT CGCAACGCTG
AACAGGCTCG TATTCCAGTT ATGGCGGTGG TTGGTATGAA AGAGATGGAA TCAAATTCTC
TTGCTGTGCG TAGTCGGAAG CTGGGAGATC TCGGGTCTTT CGAAGTGGAA GATCTGTTGA
GCGAATTGAA ACGTTGCGCT GCAGCTGCAG AGGAGATGAC TCTGGTCGGC GAAAAGGAAG
GAAAGATTGA CAGCGAGTCC GTCAACTAA
 
Protein sequence
MSVFAVLAST SARRAAAFIP GRSRSAAITK VFGGLPLQRS HSFHFLEQKQ SSRLFATIAD 
EAATVEEEEL VELPTNDNDE ELLRIRHSSA HVMAMAVQRI FPEAQVTIGP WIDSGFYYDF
FFPERTDEES GETIPARKLS EQDLKDVKKA MDKIIGKDYP ISREDVSREE ARRRIEKIGE
PFKLEILDSI KTEPITLYHI GDEWWDLCAG PHVDSTGKLP KKAIQLQSVA GAYWRGDENR
EMLQRIYATA WKDPQQLKQY KKMLEEAKKR DHRMLGKKLD LFSIQEDAGG GLVFWHPKGS
KVRRMIEDFW KDAHIQDGYD VVYSPHIANI DLWKTSGHFD FYRADMFDQM DVENEQYQIK
PMNCPFHCLM YKDELRSYRD LPFRWAELGT VYRYERSGTL HGLMRVRGFT QDDAHIFCLP
EQLQDEIVGV LDLTETILSR FGFDKYDVML STRPEKSVGS DEIWDAATVA LEGALKVKGW
DYNVDEGGGA FYGPKIDLKI RDAIGRQWQC STVQCDFNLP ERFGLEYASA DGTRERPIMV
HRAIFGSIER FFGILIENCS GDFPLWLAPT QLKLLPVTDA VHEYCNEIAA KALKMGLRIE
VDRGSERLAK QIRNAEQARI PVMAVVGMKE MESNSLAVRS RKLGDLGSFE VEDLLSELKR
CAAAAEEMTL VGEKEGKIDS ESVN