Gene PHATRDRAFT_46447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46447 
Symbol 
ID7201548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp368238 
End bp369517 
Gene Length1280 bp 
Protein Length275 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180813 
Protein GI219120136 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTTACCTTA CCTCCCACCA ACGGCAAGAG ACCGAATTGG AACATTGTTC GAAGCAGCTG 
TAAAATTGTT TCATGCTGCT GACCAGTCAG AGAAATTTGA AACGTGCACT TGGCGCTATG
TACAATGTAG TGTTCGGATC ATCATTTCAT CCCGACCTGA CTAGGCAGCA GTCTTGAACT
CCATGAAATG AGAAAATCGA ACCGAAGTGG CCTTTTCTCT ATCAGCAACA ACGCGAAGGA
GGCTTTTAGT TGCTCGACGT AAATATTCTC GAACACATGC ATTACATGAT AGGTTGCGCG
ATCGCCTGCG AGAGAGGCTA CAATTGCATA GATGCGAAGA CAGGCCTATC AAGAACGGAA
TCTGAACAAG GACAATCAAG GTCACAAAAA ACGGAAGCAC AGACGATTAC TTTGAGATTA
TTACGAAAGC CATGAAACTA TCCTATGTTA CTTCACTACC AGCAATGACT TCGGCTTGCA
ACACCGGGCA ATACCGGATG GGACGCTCTT GCATGAATGG TAGTAACTCC ATTCGGAATT
GTGCTAGACG TCACCATTGC GGAGGCAATC GCCACGGCTG GGGTCATGCG TGCCGGAATG
GGGTGGTTCA TGAGGATGCA GCGGAAAAGT TGCTTGTGTC TAGCTGGGGT CCTCGTAGTC
TTGGTCACCA CAACTGTGCG CCGGGAATCG GACCCCATGA AGGACAGCAA GATACGAAAA
TGTGCTTTGG GCAGGGACAC CCAGGCAGTC AAAAAGGCCA AGGCTGGAAC TTCTGGCAAT
CTAGTTCCAC TACGCAGGGT CAAGTGCTTA GCCCGGCTCA AGGAAGAGGA CGCCGGCAGG
GTTTAGGCAT GGGTTGGCGC TCGACGGAGG AAAGTCAGCC ATGTGAATTG AGAAACATGG
TTGGACAGAG TTCCCATGCG CCGTTGGTTG ATATCGTTAC AGATGATGAC CATGTATTTC
AAATAGCTTT TGACTTGCCC TATGCAAAAC CATCGGACAT TGAAATCTCT GTCAATAGAC
AGGATAGAGT CTTGACTGTT TCGGGTATGC GTCAAATTGG ATTTGGAAGT GAAACTTCTA
TGATCCCTTT CTTGGAGCGC ATTTCCATAG ATTCATGGAT CAGTATGGAT CGATTCTCGG
CGAAGCTAAG CAACGGATTA TTGCTGGTTA CAGCCCCAAA AGAATTTGAT GCGAAAGACA
GCTTTGTCCA GAAGATTTCC ATCCAGGATG TCGACACTAA AGAGCAAGCT ACGGATTAGA
AAATGTAAGT TATAAAAACC
 
Protein sequence
MKLSYVTSLP AMTSACNTGQ YRMGRSCMNG SNSIRNCARR HHCGGNRHGW GHACRNGVVH 
EDAAEKLLVS SWGPRSLGHH NCAPGIGPHE GQQDTKMCFG QGHPGSQKGQ GWNFWQSSST
TQGQVLSPAQ GRGRRQGLGM GWRSTEESQP CELRNMVGQS SHAPLVDIVT DDDHVFQIAF
DLPYAKPSDI EISVNRQDRV LTVSGMRQIG FGSETSMIPF LERISIDSWI SMDRFSAKLS
NGLLLVTAPK EFDAKDSFVQ KISIQDVDTK EQATD