Gene PHATRDRAFT_47185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47185 
Symbol 
ID7201960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp740817 
End bp742619 
Gene Length1803 bp 
Protein Length600 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181433 
Protein GI219122188 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGACT CGGATGAAGA TTTCTCCTTG GGAGAGGAAG ACGAATACAC AGAATCTGAC 
GGATGCACTA AAAAGCATGC ACTGGAAGAC ATTGGGCCAG AACGCGCGGC AAAATCGGTC
CGGTCGAAGT CTCCAAAGGC GTCCAATCAG ACAAGGGCCC TGATGCCAAG CACACTTGAG
GTACTTTTGC CTATGGAAAG AGAAGCTTCC TTGCTACTCC AGGGCATAGA CTACGATCGA
AATACCTCGG CGCGTATCAT TGTGGCCAGT CTTAGGTTCC AGAAGGAATA TCTCCTGAAA
AAATCCAAAA ATAAGTGTAA GAACACACCC GCTCCTGAAG TTCGAAATAC CGTGTGCCGT
ATGCTTGGCG TTTCACCAAA GGTTTACACT AAAATCATGT CAGGGTACAT CCTACACCGC
TCTGTTTATG CTAGTGGATT GGATGGTCAA GGGAGAGGCG GAAATAGTGG CGCAAAATCA
AGTCGGATCC CCCGAACAAA GAGCGTTTCG ATTGCCGTCC GTGAGTTTGT TCGAGGGGAA
CGGAAACTTC AGAAACTAGT AACGGGACGT GAAGTGCTGG ATTTCTTTAT TAAAGAAGGC
ATATTATGCA TTCCAATGGA TCCGAAGATG GGAGTGTTTG TAAAGAAAGA TTTCCAGACA
GCCTCTCGCA ATGTGCGTCG ATATTTGCAA GACTATGGTT ACAGGCGGGG CCGACGTAAC
AACATTGCCC CAAATCAATC AATGATTGCC AAACGCCATG AGTATGTTCA AGCTCTCTTT
ACCAATGAAG CACTACCAAA AGGGGAAAGG CTGCGCAATG TGTACATGGA TGAAAGTTGT
ATTCATGAGC ACTATAACAA GAACGACAGG AGTGTTTGGG ACCCTAAGGA TGTCTTGGAC
ATCCAGCATG GAAAGTCAAA GCACAAAGGT CGGCGATACT GCTTTGCTGC CGCAATTCAG
GGACCAGATC CTTTTGTTGA TGTTCCTGAA CTTGCATCCG AAAAGGCTGG GTTGGTGCCA
GGAACCATTT GGGCATTCTG TCCACAGAAG AAGGGTAGTG ACCAAGGTGA CTACCATAAA
GTATTCAATG GTGAAAACTT TGTCACTTGG TGGAAAAATC AGCTCCTACC CAACCTGCAT
CAGCCTTGTC TTATTCACAT GGACAATGCA GCATATCATA AGGTATATGG GAGTCATGTT
CCAAAGTGTG GGAAGATGCA AAAGCAGGAG TGTATTGATT ATCTCCATTC CAAAGGTATT
GAGACGGAGG CAGAATGTTC TGCTGTGGTG CTTAAAGTCC GGACAAAGGC CTGGATTGTT
GCAAATGAAA AATTTGAGTG TGTGAGGTTG GCTGAGGAGC AGGGACATAG AGTTCTGTTC
ACCCCACCAT ATCATAGCGA TCTGCAACCA ATTGAGCTTG TATGGGCTTT GATCAAGGGA
AATGTTGGCA GACAGTACAG TTTGGATTCA ACTTTGGACC TTGTGTACCA GCGATTGATG
AAAGAATTTG ACATGTTGCA GGAGTCAGGG CATGATTCCA TTCATTGTAT GATTGTCAAG
TGCACCAATT TAGCGCGACA GTTCAAGGAA GACATTCCAA TGGAGGAGGC AGCTGATGAG
GCATTGGAAA TGGAGGAAGC TGATGATTAT GATGCTTACA AAGCGGGATT AGACGAAGGT
ATACCTCCCG AAAATTATCC GGATGAAAGC GGAGAGGAAA GCGGTGTAGA GGATGTGTTT
GGTGCGGCTA GCGATGCTGA TCTCAAGGAG AACGGAGACA TTAAAGACAC AGTGCAAGTT
TAA
 
Protein sequence
MVDSDEDFSL GEEDEYTESD GCTKKHALED IGPERAAKSV RSKSPKASNQ TRALMPSTLE 
VLLPMEREAS LLLQGIDYDR NTSARIIVAS LRFQKEYLLK KSKNKCKNTP APEVRNTVCR
MLGVSPKVYT KIMSGYILHR SVYASGLDGQ GRGGNSGAKS SRIPRTKSVS IAVREFVRGE
RKLQKLVTGR EVLDFFIKEG ILCIPMDPKM GVFVKKDFQT ASRNVRRYLQ DYGYRRGRRN
NIAPNQSMIA KRHEYVQALF TNEALPKGER LRNVYMDESC IHEHYNKNDR SVWDPKDVLD
IQHGKSKHKG RRYCFAAAIQ GPDPFVDVPE LASEKAGLVP GTIWAFCPQK KGSDQGDYHK
VFNGENFVTW WKNQLLPNLH QPCLIHMDNA AYHKVYGSHV PKCGKMQKQE CIDYLHSKGI
ETEAECSAVV LKVRTKAWIV ANEKFECVRL AEEQGHRVLF TPPYHSDLQP IELVWALIKG
NVGRQYSLDS TLDLVYQRLM KEFDMLQESG HDSIHCMIVK CTNLARQFKE DIPMEEAADE
ALEMEEADDY DAYKAGLDEG IPPENYPDES GEESGVEDVF GAASDADLKE NGDIKDTVQV