Gene PHATR_33150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_33150 
Symbol 
ID7204270 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp187832 
End bp189168 
Gene Length1337 bp 
Protein Length362 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186295 
Protein GI219113423 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.461728 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCAA CTAAGGGAGT TGCCGGTCTC TTTGCACTTA CGGCTTCTGC TGCCTTCCTA 
TCCATTGCAG GAATCTTTAC TGCTCAGGGA ACAGTCCAGA ACATTCAAGG TACCACTGCC
CAGCTCAAGG ATTCACTAAT GAATATGGAC ACTTTATTTC GGTCCGAGGC CAATTTCCAG
ATTATTTCCG CTCATGATCC GTTCGCTTTC AGCTCCAACA AATCCGCAGA GCAATACATT
GAAACATCGG ATGATAGCAT GGCGGAATAT CAGACAGGGC CCATTCGAAG CGATACTAGA
CTGGAAAGGA ACGGTAAAGA GCCGGAAAAA GACTTTCGCA AACCAGTCTG GAGTAACTCT
TCTGATTATA GCATCGACTC CCCGACAATT CTTGTCCAGC TTGGGGGTGA GCTAGCAAAT
AATCTGGGCC ATATGGCTTG AGGGTTTGGA CTGGCATGGT GGCTGGAGCG GGAATTTGGT
TTAAACGCCA CCGTCATGCT GAGACACGGA GTGATACATG CTAAGTGGAC GAGCGCAGAA
GGGATGCGAC ACACTGTTTT CCATACTTGC AAGACTTCAA TTTTTCGGCC GGAAATACTG
ACAGTATCAC TAAGGAGTTG CAGTCTTTGA AACAGCGTGG GCCAGACAGA ATCATTGCCG
AGAAGAGAGT TTTTCAAATA GACAAGAAGG GACCGCTTGA CCAAGGTTTG AAATCTTTTG
TGAGTCTTTA CGCTGCCAAT CACACAAACA TGGGGCAAAG GAATGGTTAC ATCACCATTC
CTTTCCTAAC TACACATCAA ATGAGCGCAA AAGACCTCAT TGTGGACAAG TATTATAACG
ACATTCGCAG AATATACCAT TTTGATAAAA GCTGCTGCAT GGATGTACCC AACCCGGATG
AATCCGTTTT TGTAAGTGGT ATTTCAGCTA ACAAGCATGT ACAGGAAAAA GATTTCGCAA
GACTTTCTCA CTGAGTCTTT TTGTATTTCA AGCATTTTCG CAATTTTATT AGGGAAAGTG
GTAGACTACG ACATCGTGCA GGCTACGAAG AGCTAGCTCC TGAGCAAGTG GCAAATGAGT
TGTTTGCGCA TTTGAATCCT GGCGACAAAG TAGCTATTGC ATCACGATTC TCCGATGATT
TTCGAACGCA AATGATTGTG GACGCTCTTG AGAAGCGACA GCTTCGGGTT CGAGTCACGG
AGCCACGATC GGGGGTTGCG GATTTTTGCT TCTTGCTGTA TGCCCAAAAG GAGTTGGTTG
GCACGGCCAA ATCTCTTTTT TTATTTGGGC TGGCCTACTT GGAAATGCCA CAAGGGTTCG
ACCTTATACA GCAATGA
 
Protein sequence
MASTKGVAGL FALTASAAFL SIAGIFTAQG TVQNIQGTTA QLKDSLMNMD TLFRSEANFQ 
IISAHDPFAF SSNKSAEQYI ETSDDSMAEY QTGPIRSDTR LERNGKEPEK DFRKPVWSNS
SDYSIDSPTI LVQLGDFNFS AGNTDSITKE LQSLKQRGPD RIIAEKRVFQ IDKKGPLDQG
LKSFVSLYAA NHTNMGQRNG YITIPFLTTH QMSAKDLIVD KYYNDIRRIY HFDKSCCMDV
PNPDESVFHF RNFIRESGRL RHRAGYEELA PEQVANELFA HLNPGDKVAI ASRFSDDFRT
QMIVDALEKR QLRVRVTEPR SGVADFCFLL YAQKELVGTA KSLFLFGLAY LEMPQGFDLI
QQ