Gene PHATRDRAFT_48969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48969 
Symbol 
ID7195247 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp190622 
End bp192112 
Gene Length1491 bp 
Protein Length470 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183567 
Protein GI219126655 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.731324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GGCAAGCATG AGAAAGCTAA AGAGGCGCAA TGCTCCGTCC AATGATGAAG ATGATCGATT 
GCCCTTGTAC GATTCGTCCA CTGCCACAAG CTCCAACGGA CCGTCGTCTC GAGCCAAGCA
GCGCAAACGA AGATCGACCG GACCTTGTTG GCCATGGGCC AGACTAGTAA TGGTATCCGG
TATTGTTCTG ACAGTCTACT GTGGTTGGAT CTGGTGGAAA GCACCCGATC ATAAACCTCC
CATTCCCCCC ATTCTACATC GTGCGTTTCC CTACAAACGT GATTGGTACT TGCCACGGAT
ACGAGACGAT GTCAAACTGG AAGAGTGGGA TGGCCCACAG CTAATACACG TGGTTCACAC
ACGGTTCATG CAAGAACAGC CCAGCTTGAC GTCTTTGGGA CGTGCACGAC TGGGGCTCTT
TCGCGTCTTT TGTCTACCTA CGATGATTGA GCAAACCACC AACCATTTTT TGTGGATTAT
TAAGACGGAT CCCGACCTCG ATGCCGAAAT TATGCAAGTG CTGGTGGATT TGGTCTCTCC
TTATCCCAAC TTTTTCCTCG TAGCATCCAA CGTCAATTTT CGTATCAACG AAGATTTTCC
TGGCGCTTGG CGGGATGGTG CCCAGGCCAG GGACTTGGCC CTGTCTCGAA CTTATACGGG
CAATCAAACG CTTCTCGAAG TCGCCATGGC GTTGGAAGCC CAGCTACCCA TACTCGAAAC
CCGGCTCGAT GCCGATGACG GATTACACGT TGAATTTCTG GAACAAATGC AGTACCAGGC
GACCAAGGCC TTTCGACAAA GTGCACTTAA ATGGATGTAC TGGTGTACAC GACGGCATAT
GGAATGGCAT TGGATAGACG AAGTACCGTC CTCCTTCGAA CACGACTCGC CACTTGGCCA
AAAGATTGTG GAATACGGGG CTTTGCAAGG TGTGCAACAT TCCAACCTCT GTATTACAGC
CGGATATACA GTGGGCTTTC CAGTCGGGGT GTCTGAACCA GACGTACCCG TGTATCCTCA
TCAAGATTTG GTGTCCATGA TTCGAAAACT ACCATCGGAA AAGGCTTGTG GATTGAAACC
GAGTGAAAAA TGTTTGCAGT TTGTTGAAGA ACACATTTTT GAAGCAGTTC GATCCCGAAC
GCACACCTCG GCCGGGATGC TGAAGGTGCG ATTAGAGCAA GACGGCCTGG TGAATACTCC
TTGGTTGTCC TACGCGTACT GGGATCTGTT ATGCAAGAGT TTTGAAATTC AGCGAATGCA
AGTGCGGTGG ATGAACGAAT ATCTGACATC CCACATTATC GACATTGCCC GAGACAATCT
TCTGGGACAG TGCACTCTGG GTCACAGTTG CAAGGTATGT GAGAGAATTA ATCCTTAAAC
CTTGTATTGG TAAATTTGAC AAACCTGACT TCTTTCTCTT CTCAGGACTC GGCCAAGGAA
GAGCTGGCAA AGGTGATTGA AAAGTACAGA AACCGAACAA CTTCGGGCTA G
 
Protein sequence
MRKLKRRNAP SNDEDDRLPL YDSSTATSSN GPSSRAKQRK RRSTGPCWPW ARLVMVSGIV 
LTVYCGWIWW KAPDHKPPIP PILHRAFPYK RDWYLPRIRD DVKLEEWDGP QLIHVVHTRF
MQEQPSLTSL GRARLGLFRV FCLPTMIEQT TNHFLWIIKT DPDLDAEIMQ VLVDLVSPYP
NFFLVASNVN FRINEDFPGA WRDGAQARDL ALSRTYTGNQ TLLEVAMALE AQLPILETRL
DADDGLHVEF LEQMQYQATK AFRQSALKWM YWCTRRHMEW HWIDEVPSSF EHDSPLGQKI
VEYGALQGVQ HSNLCITAGY TVGFPVGVSE PDVPVYPHQD LVSMIRKLPS EKACGLKPSE
KCLQFVEEHI FEAVRSRTHT SAGMLKVRLE QDGLVNTPWL SYAYWDLLCK SFEIQRMQVR
WMNEYLTSHI IDIARDNLLG QCTLGHSCKD SAKEELAKVI EKYRNRTTSG