Gene PHATRDRAFT_49735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49735 
Symbol 
ID7198429 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp105886 
End bp107023 
Gene Length1138 bp 
Protein Length312 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184486 
Protein GI219128578 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.15177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCACAAAAAA TTGCGTAGAA TGTTAATGAT TCACATGGAT CGTCTTACCC ATCAACAGCA 
CCACCACCAT CCTGTGGATG CTCGACATTG GGAAACGGAT TCCAATACAG ATGACGCCAG
GTCCATTAAA TCGGCGAGTA TCAGCGGGAT GCTTCCATTG GCCTCCGACT TTAAACCAGG
TCCTTACGAT GTAATTTGTG CTCGAGGAAA GGCGGCAAAA AACCACGTGG GTAACATTCA
ATATCGGCTT AACGTGGAAC GGACGCTCGA GCAATACAGT GCCGCCAGTA CCAAGCTAGA
AAAGTCCCAA ATCGTTTCCG GCATTGTGGA TTCCATTCGC GAATCCAGTC GCTATGGAGG
TTTTGTCAAA GAAGAAGATG GTCGGTGGTT TGAAGTGGGA GACCACATTG CACGGGAAAA
GGTGGGACAA AGGTAAGCGA TTCAGAGCTA TGAAGCATGC CCGATTGCCG ACGGGTCTTG
TCAAATTTTG CTTATCCGAG TCTTCATCCG GTTTCGCAGT TTTCGCGACA TGCTACACAC
GAAGTATCGA TCGAGTACGA GGGCAAAGAA AAAGCGACGT AAAGAGGAAC AGAGCAAAAT
GGGTGACGAT GTTGACACGT TCATGCTATC CCATGCAAAC GTGGCTTCCA AAATGAAGGA
ACTGTCCCAG ACGGCGCAAC AGCGAGGTAC GTTTGAGCGC CGGGTCTGCT TTGGTGTACG
TGAGAAAGCC TTGAGTATCA CAAACCTTTT GCGCTCTCAC AATTTGTTAA AACCGTAGAA
ACGGACCAAT CTATGGAAGA AATGTTCAAC AAAGCCAATG ACCAACTGTT GCAGGTTCTC
AAGAGGGAAT CGCAACACCA GCAGTTGAAT GAAGCCGAAC ACACACCAGA TGGCCAAACT
TCCCCAAACC TCGATCCGAT CCCCTTCGCG GCGGTCGCTC GTCGCACACA GATCCATCGT
CCTCCGGAAC AAGCACCATA TTCTTACGGT CGCTTTGGGG AGTTGGCGTT TCTGGACGAT
TCTTTGTCCG AGTCACGACC GCAAGTAGCA GAGTCGTCTT TGTCCCACCC GCACATCGAT
TCTGCTTTTT CTGAATTCCA GGCCCCGCAG GATTTCAACC GCAAGCGCAC ACGCTGAA
 
Protein sequence
MLMIHMDRLT HQQHHHHPVD ARHWETDSNT DDARSIKSAS ISGMLPLASD FKPGPYDVIC 
ARGKAAKNHV GNIQYRLNVE RTLEQYSAAS TKLEKSQIVS GIVDSIRESS RYGGFVKEED
GRWFEVGDHI AREKVGQSFR DMLHTKYRSS TRAKKKRRKE EQSKMGDDVD TFMLSHANVA
SKMKELSQTA QQRETDQSME EMFNKANDQL LQVLKRESQH QQLNEAEHTP DGQTSPNLDP
IPFAAVARRT QIHRPPEQAP YSYGRFGELA FLDDSLSESR PQVAESSLSH PHIDSAFSEF
QAPQDFNRKR TR