Gene PHATRDRAFT_49589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49589 
Symbol 
ID7198203 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp133558 
End bp134719 
Gene Length1162 bp 
Protein Length332 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184310 
Protein GI219128208 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.966771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTTG AGATTATACC GGATATCGTC GATGACGGGA GCGAGTCCTC ACAAAGCTAT 
AACGTCGACT TTTCTCGTCA ATACGCCTGC GAGCCTTCCA CGTCTGCCTT TCCGAGTGGG
AAGTACGATC GGGAAGGTGA AGGCCCCTGC ACTGAATGCG GGATGCAGAC TCATGATATC
CAACACGATT TCCTTAGCCA ACATCAAAAA GTTCGTTTGA ACGTTGACCA AGAATTACGC
AGGGGGCAGT GCCTCCTTTG CTTCCCCATC GCCTCGGAAG CTAGTCTTAA CCGGGTTGAC
CCAAGCCGAC AATCAAAAAC TTGCGCCGAT GACAGTACGG ACACACATTG GAGTAGATGC
AGCAAACGCA TGAAGCAGAC ACACCACATC GTTGCACGTC GTGTCTCCCA ATCCAACATG
TTCAAATTCA ATCATTCTCT TGATGTGAAT GCTGAAATTG AGGAAATAAA GATCGGAGGA
AGCTACGACA TCGCGGACAT TCTTTGTGCA ATGAAAACCG CCCCTCACGA CCACCTCATT
CAAGAGCTCG GCTGCGAAAG TCTATGGATA CTCTCCTGGG AGGATGAAAA TGCAAGTGCC
ATTGGTTGCG TCGGAGGGAT TCCAATGGTG CTCAACGCCA TGATTCGCTT TCCTATGAAT
TCGCACTTGC AACAGTGCGC CTGCGAAACT ATTCAGAACT TAGCTTTGGA CGAACAGAAT
CGTCGAGAAA TTGTCGAGCT TGGCGGGATC TCTGTTATTG TTAAAGCTAT GATGCGTCAT
ATGGAGTGCG CCGGTATTCA GCAGTGTGTA TGTACAGCTT TGGCCAGTAT CGCCACCGAT
CCGGCCAATC GTCCACTAGT GGCTGACGCT GGAGGCTACG ATGCCATTGC AGTGGCAGTC
CGCAACTTTG CGGACAACGA ACCCGTTGCA CGAGCAGCCT ATGACGCACT TGCCATACTC
GGTTTTCCAC AATGCACTTC ATTAGGAACC TGGCGGTAAC AACACAAAAT GATTTCATAG
CGAAATCGCT TCTCGCAACA AAAACGAATG CAGGTGGATT TCATCAGAAG GTTGGGCCGA
TTGAGAGTGA GCCCCCGTTT CCCACAATTC GTAATGTAAG TGAATTACGG TGACGAAAGG
AAAGTAAGAA AAAAAAGGGT TG
 
Protein sequence
MPVEIIPDIV DDGSESSQSY NVDFSRQYAC EPSTSAFPSG KYDREGEGPC TECGMQTHDI 
QHDFLSQHQK VRLNVDQELR RGQCLLCFPI ASEASLNRVD PSRQSKTCAD DSTDTHWSRC
SKRMKQTHHI VARRVSQSNM FKFNHSLDVN AEIEEIKIGG SYDIADILCA MKTAPHDHLI
QELGCESLWI LSWEDENASA IGCVGGIPMV LNAMIRFPMN SHLQQCACET IQNLALDEQN
RREIVELGGI SVIVKAMMRH MECAGIQQCV CTALASIATD PANRPLVADA GGYDAIAVAV
RNFADNEPVA RAAYDALAIL GFPQCTSLGT WR