Gene PHATRDRAFT_49158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49158 
Symbol 
ID7195655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp95076 
End bp96559 
Gene Length1484 bp 
Protein Length462 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183926 
Protein GI219127404 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.472791 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTAGCTACT ACTGATTTAG TTACGACTAC TACTACGCTT GTCAGTTCCC TCCATTGCTT 
CATCGACTCA TTCAAATCAA AGGAGTAAAC CATAGAATGA GCAGAAATCG TTCGCGAATC
CCGCGGAGAC GGTACACGTG GCCAAAGGCA GCGGTAGTGA TGACGGTAAT GGGTTGGGGA
CTCTTGCTCC TCGTATGCAC CGAAGCGGCC CAGTCGTCGT CGTCTCCTTC GGGCACCAAC
ACTACCCGTA GTCGGAGTAC GGCTACCGGT ACTACCGACG AATCGTCCAC GAGCGTTACG
CGGGTGGGGA ATCTGGATTA TCTCGACGCC GCCACCATGG CGTACTACTT GGATGCCCCG
CGGTGGACCG CCGATCATCC GGAACACGAC GTCGTCGTTC TCTTTTACGC ACAGTGGTGT
CGCAACTGTC ACGCCTTTGC CCCTCTGTAC GATCAAATGT CGAAATTACT ACACGCCGGT
ACCAAAGATT CGCAACTAGT AATGGGATTG TTCGATTGTG AACAGGACAA GGCGCATTCT
CGAGTTTGCA GTGACGCTGG CGTTACACAC TATCCCACCA TCATGTTCCT CTCCTCCAGC
GGACAAGTCC TCCAACGCGG ACGACGGTCT CCGAAAGTCC CACTGCCCAA ACACATCACC
ACCTTTCGTG GTAATTGGCA GTACGGCGAC GCCGTCATGG ATTGGATCAA GACACTGCGA
GGGTTGAGTC ACTGGCATCG CGCTGGATGG GGGAAAACGC TCCGGAATCT TCTCTTTGGA
CGACGCCACC GTGATCCCGC CCGCGAACGA CTCCCCACCG GAATTCCGTC GGGTACAAAC
CGGCGGGACG GAACCACGAC GGCCGGGAAC GGGGCCACCC ACGCCTCGCA CGAACAACAA
GAATTGCGAG ACGAAATCCG ATCCTTGTCC GATCTCGTCA TTCGCTCCAG TACCATTGTG
GACGCCCTAT TGTTCCCCGT CACCGCTGCC GCAAACAAGA CTCTATCCAC CAATGTGATA
CGCGACGAAA ACGCCAAGAA CTACACCGAC GTCTTTGCCT TTTTGCGAGA CGCCTGGCAC
AGCAACCGCA CCTCCCACCA AGTAATACGA ACCTGTGCCA TGGAAGTGGC GTTAGACTAC
TGTGGACGCT TGAATACGCA CGTGACGGAG GATTGGTTGA CGGCATTTCC CTCCATCGAC
CGCATTACGG AAGCCGATTT GAATCTGTTC CGGAACGAAT TACCGAAACT TGTGGCCAAG
CAGGAACCCT ACTGTGCGGT AGTGGAAGAC TGCATCGTTG GTGATTTTGC CGAAGAGCAT
TGTCGCCCCG CTGCTTGTCC CTTTGTCGAT CCCGCCGCGT GCCGGTATCT GACGTGCTGT
CTCACCGAGC AAGTCTACGA AGAATACGCC GTGGCCATGG ATTTGGTTGA AAACGTCACG
GCGGGCACTT CCGCAAACAT TGACGCAGCC GACAAAGATA CGCC
 
Protein sequence
MSRNRSRIPR RRYTWPKAAV VMTVMGWGLL LLVCTEAAQS SSSPSGTNTT RSRSTATGTT 
DESSTSVTRV GNLDYLDAAT MAYYLDAPRW TADHPEHDVV VLFYAQWCRN CHAFAPLYDQ
MSKLLHAGTK DSQLVMGLFD CEQDKAHSRV CSDAGVTHYP TIMFLSSSGQ VLQRGRRSPK
VPLPKHITTF RGNWQYGDAV MDWIKTLRGL SHWHRAGWGK TLRNLLFGRR HRDPARERLP
TGIPSGTNRR DGTTTAGNGA THASHEQQEL RDEIRSLSDL VIRSSTIVDA LLFPVTAAAN
KTLSTNVIRD ENAKNYTDVF AFLRDAWHSN RTSHQVIRTC AMEVALDYCG RLNTHVTEDW
LTAFPSIDRI TEADLNLFRN ELPKLVAKQE PYCAVVEDCI VGDFAEEHCR PAACPFVDPA
ACRYLTCCLT EQVYEEYAVA MDLVENVTAG TSANIDAADK DT