Gene PHATRDRAFT_47533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47533 
Symbol 
ID7202759 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp37171 
End bp38398 
Gene Length1228 bp 
Protein Length384 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181989 
Protein GI219123350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACCA ATTCGCAGAG GAGCAAACAA TCTCCAATCA GTTTTCGATT TCTAATACTC 
ATCGTTGTGT TTGATATCAC ATGCTTCATC CTTGCCGTTC GTGGCCTTCA CAAAATGGAA
GGCTCCTACA ACGACTCTCG TAAAGCACAC GTCGTGCCAG TTTTCCGTCA AAGGATAAAC
GATACTGGCG ATTGCACAGA CAAAGAGCTA CTACTTCAAA TTCTTGCCGA CGCTCTGAAA
AACGCGTCTG CTACAGAAGA CCAAACATAC GGCAACTGCT CCGCTTTGCC CGCTTGGCAA
GAAGTGATCA AGTTGTACGG GTCCAAACCT GTGATCTTGG GTCTGGAGCA CTGCGCAGCT
TTTCGCAACA ATGTCACCCT ACAGGACCCG CTGGGTGGTC TGAGAGTCGC CGGGTTTTAC
AATTCGGGAA CCAACGCTTT GGAACAAACA CTTTTGAGAA ATTTGAACAA CGCGGATACT
GATGGCCGAC AAGAGCTTCC GACCGTAGTA CCTTGGTCTA AGCACAGACC GCTGTGGACA
GCCAAAGAAT CATATTTTCT TGAACATCGC CATGTTCTAC CTGTCGTCGT GGTTCGCGAT
CCGTATCGAT GGATGCAGTC CATGGTAAGT GAAAGTTGTG TTTTGGTTTT CCCAAAAGAA
ACGGTGTCGC TGCTTTTTCT CACTTTCCTT TGGTCAGTGC AAAACCCGGT ACGATTTGTT
TTGGCAAAGG AACCTCAGAC TTAACGGAAT TGAACACTGT CCCAATCTTG TCCCTTCTAG
CCAAGACGAA CAGAATTTCA ACCAAAATCG TTCCACTTTT GCGGTGTGGT TGCGCAATCC
TATTCAAAAT AGCACGCACG ATTCGTTAGC GGCCTTGTGG TCTAGATGGA ACGGTGCGTA
TTTGAATACG AGTATTCCAC GACTAATTGT ACGGATGGAA GACTTGATTT TTCACGGGCC
GGAAATGGTG CAAAAATTAA GTGAGTGTGT CGGCGTTGAC CGGACCGATC CTTATGTCTT
CCTTACCGAA GCTGCCAAGT CCCACGGACG GTCAGCGGAT TTGGCGACCG CCATGATCAA
GTACGGTCGG CGGGATGGCC GCTATGCTGG AATGACGACG CTAGACTTGG CGTACGCAAG
GCATGCTTTG TCAGGCGATC TCATGCAAGC ACTACGTTAC GAATACGATG ATTTTTCGCT
GGACGCAAGT CCAAGAATTC TGTGGTAA
 
Protein sequence
MNTNSQRSKQ SPISFRFLIL IVVFDITCFI LAVRGLHKME GSYNDSRKAH VVPVFRQRIN 
DTGDCTDKEL LLQILADALK NASATEDQTY GNCSALPAWQ EVIKLYGSKP VILGLEHCAA
FRNNVTLQDP LGGLRVAGFY NSGTNALEQT LLRNLNNADT DGRQELPTVV PWSKHRPLWT
AKESYFLEHR HVLPVVVVRD PYRWMQSMCK TRYDLFWQRN LRLNGIEHCP NLVPSSQDEQ
NFNQNRSTFA VWLRNPIQNS THDSLAALWS RWNGAYLNTS IPRLIVRMED LIFHGPEMVQ
KLSECVGVDR TDPYVFLTEA AKSHGRSADL ATAMIKYGRR DGRYAGMTTL DLAYARHALS
GDLMQALRYE YDDFSLDASP RILW