Gene PHATRDRAFT_46564 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46564 
Symbol 
ID7201847 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp731591 
End bp733049 
Gene Length1459 bp 
Protein Length448 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181063 
Protein GI219120658 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0400699 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTGTCGGA GCTTTCCTTT ACAACCTTTA AGACCCACAA GGTCATACAT GTATTGCAAT 
ATGGTACCAC AGAGGAGAGG AAGTAGAAGA CGACGACGAC AATCGCATTT GACGCAGCAA
TGTACTCGCG TCGTTTGTCT TCTGATATTG GGGAAAACGG CCTCCCAACC CGCTGAAGAA
ACGGTCGCCG CCTCGCTACC ATTGTCGCAA CCACACCTTC GCAGACGTCA CGACAACGGT
AATACCGTGG AACTTGTTCC GAATGCGACC GTCCGTCTGC CTCTGCACGC CGTCGCGGGT
ACGCATCACG TGACGGCTTG GATGGGGGAA CCGCCGCAGG CGCAAACGCT GATTGTCGAC
ACCGGGTCGC GGTTGACGGC GACCGCGTGC GAGCCCTGTT CGCAATGCGG GACGACGCAC
GCACACCCGT TCCCCCATTT GGACCCCCAG CGGTCCAGCA CGCTGCGATA CACGCAGTGT
GGATCCTGTC TGCTCAGCGG CATCCAGGAA TGCGCAGCGG AACAAAAGTG TGGTATTAAT
CAAAGGTATA CCGAAGGCTC CAGCTGGACA GCAGTGGAAG TCAGCGATAC GTTTGTCCTG
GGAGGACCGG AAATATCCAG TTTGGAACAG TACGTGAGCT TTACGATTAT CTTTGCGTTC
GGATGCCAGC AAAAAGTCAG GGGATTGTTC CGAACACAGT ACGCCAACGG TATATTGGGT
TTGGAACGGT CCGACCTCTC GCTCATTAAG CGATTGTGGA AGGAAAATGT CATTCCTCGC
GAGTCGTTCT CCCTATGCAT GACACCTTTT GAAGGCTACA TTGGACTGGG AGGACCACTA
CGAGACAAGC ATACGGAATC GATGAAATAC ACGCCGTTCA CTTCCACTCA GAGTTGGTAT
GCTGTCCACG TAGTCCGAGT GTTTGTAGGG GACGAATGCT TGACAAGCAA TGACCAGCAC
GACACTGTTG TCGAGCATGC ATTGGTCGAA GCCTTTGCAG AGGGCAAGGG TACTATACTG
GACTCGGGAA CGACGGACAC GTATCTCCCC AAGGCAGTTG CGGGTCGTAT GCGAGAAATA
TGGGCGCGCC TTTCCAACAC GCCCTTTCAA CCGTCGAGCA CGTACGCCTA CACATACGAT
GAGTTTAGAT CGCTGCCCAT CGTGACCTTT GAGCTCGCCA ACAACGTAAC CTTACAGGCC
CTGCCTAAAA ATTTCATGGA AGACCTTCCC GAGCCTTTGC GGCCCTGGAC GGGACGGAGG
AAACTAATGA ACCGCCTGTA CGCGGACGAA GTACAAGGTG CCGTGGTGGG ATTGAATACA
ATGGTGGGCT ATGACTTGCT CTTTGACGTC CAAGGCAATC GTTTTGGTGT CGCCCCGGCC
CTATGTGGAA TTGCGAACAG TACACCAGCA GCGACTCATT AAAACGGAAG CGTTTGTAAA
GGTTTTTTTT GACAATTAA
 
Protein sequence
MYCNMVPQRR GSRRRRRQSH LTQQCTRVVC LLILGKTASQ PAEETVAASL PLSQPHLRRR 
HDNGNTVELV PNATVRLPLH AVAGTHHVTA WMGEPPQAQT LIVDTGSRLT ATACEPCSQC
GTTHAHPFPH LDPQRSSTLR YTQCGSCLLS GIQECAAEQK CGINQRYTEG SSWTAVEVSD
TFVLGGPEIS SLEQYVSFTI IFAFGCQQKV RGLFRTQYAN GILGLERSDL SLIKRLWKEN
VIPRESFSLC MTPFEGYIGL GGPLRDKHTE SMKYTPFTST QSWYAVHVVR VFVGDECLTS
NDQHDTVVEH ALVEAFAEGK GTILDSGTTD TYLPKAVAGR MREIWARLSN TPFQPSSTYA
YTYDEFRSLP IVTFELANNV TLQALPKNFM EDLPEPLRPW TGRRKLMNRL YADEAIVLVS
PRPYVELRTV HQQRLIKTEA FVKVFFDN