Gene PHATRDRAFT_46381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46381 
Symbol 
ID7201763 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp166999 
End bp168199 
Gene Length1201 bp 
Protein Length201 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180769 
Protein GI219120043 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.38852 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGACCGACC TACAAACTAG AAACATCGAC CATGCCGCCT TACGGAGAAC CAGACTGGGC 
CACCCCCGGA AACACATCCA ATGTTGCTAC GCAGAATGCA GGAACACCTA CTGCAGCGAC
AGCTTCTTCA GGAATGAACG GCAACAGCAG TGAATCGCGG TACGTTCGAC TGTCATTGTG
CAGTGTTCCG TTGAGCTTTG GTTGAGTACC GGATTCCGTT TTTGTCAGAT TCGACAAGGT
TTTGCCGCCA GACGAGGGAA TTTTGCTCTA CCGCAAGGAG ATCTATATCT CTGTGGTCCA
TTCCACCACC GTTTTACTCA TCCTGCTGAC TTTTGTCACT TCTTTCTGCA GGCAAAAGGC
TCGATGGGCG ATTTCATTGC TGTCGTTTCT TAATTTCGGG CTGGCTGCTA TGATGGGAAC
TCTAGGTGTT CTCTCCCTCA TCCATTTCAA CCCTGGGAGT TCTTCGGACT ATTCAGCAGC
ATTTCTTTCG TCCTACATGG TCATTTTTGC TGTGATCTTG TTCCTCTACG AACTTATTTG
GTGGACACCA ATTGCCGCAT TGAACAAAAT GTTCCGAATG AATTTCGGTT TCATGTATGG
ATTGCGAGGG AAAGGTCTTT TCTTGGTTTT TATTGCGTTT CTTTGCCTAG GTCTTCGAGA
TGAAAATGCC TCTGGGGTGA AAGGATTGGA CTGGGCAACC GGTCTCGCTT GGTTGGGCGC
AGGATGTTTC AATATTTTTA TTTGGATGAC CTGGTCGGAA GCGTCTGCGG CTTACAAGCC
ACCGACAGCT GGTCTGACTG GACCCAGCGA CAGTAACACC GTTGTGTAGA TCAAGTGAAA
GGATCAAGAG AAGGGATGGT CAGCCAACCG CACTACATCT TAAAGCCGAT TTTCGTGTAC
TTAGCTCTAC TCAAAGTCTA CCTACAGTCT ATGTATGTCG GGCTCACGAC GTCGTCCCCA
TCATTGTTTA CAGGCGGGGA TCCGAGCCAT ACAATGTAGA TCACCCTTTA GTTCAAAATC
GATGAAGGAG TTAGCAATTC ACAGTGATGC GGTACACCGT TTCTGGCAAG CTGGAGCGAG
CATAGCGAAT GTGAATGTAC CAATTTTATG TTGGGTTGTA ATCCTACAAT GCGGCTCCTA
CAGCTGCAAA TTGTTGACCA TGCACATCGA TAACTTATCT AGACGTGGGA CGACGTCTAT
T
 
Protein sequence
MPPYGEPDWA TPGNTSNVAT QNAGTPTAAT ASSGMNGNSS ESRQKARWAI SLLSFLNFGL 
AAMMGTLGVL SLIHFNPGSS SDYSAAFLSS YMVIFAVILF LYELIWWTPI AALNKMFRMN
FGFMYGLRGK GLFLVFIAFL CLGLRDENAS GVKGLDWATG LAWLGAGCFN IFIWMTWSEA
SAAYKPPTAG LTGPSDSNTV V