Gene PHATRDRAFT_47857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47857 
Symbol 
ID7202987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp257480 
End bp258688 
Gene Length1209 bp 
Protein Length396 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182357 
Protein GI219124116 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTTGA TCATGAACGC GCAGAATCCC AGCAAGAACA AACCGACGAG GAAAGCTGTG 
CAGCCATCGC GAGGTAGTTA CAGTAAGCTG GCTTATTGTT TTTTGGGCGC GTGCATGGTC
GGGGCTATGG TCCTAAATAT GTCCCACCTG CGGAACGTGC AGGAATCCTC GCATCCTCAA
TCCGCACTAC CACGACGAAA CGCTAAAGTG GTTCTGAGGA GAGAGTCCGT AAAGAAAACT
GTCGCTTCCT TGAAAAAGGT TGTGGCTCCT CAGAGTCCGG CCCTGTCCGA TAGGAAACCG
TCGCAATCGT TTATTCCCAT TGCACCTCGA CCAACGGATC CTGCCAAAAA TCTCCTCCAG
GCTGGGGACT ACATTTACTA CCAGGACCCC GCTATACCTC GTTGGGATGC GGCACCGATT
GTGGTTGAAA GCCATAAGTT GCTTTTTTTC ACCACGCCCA AAGTCGGATG CACGGTTTGG
AAACAGCTTT TCCGTCGTAT GATGGGCGCA AAAGACTGGA AAAGTCAGGA CGCACAATCG
CTCCTTCCAC ATAATCCGGA AGTCAACGGT CTCAAATACC TTTACGACTA CCCCTTGGAA
GAAGCCGACC GCATGATGAC CTCGCCTAAA TGGACTCGGG CAGTCATGGT TCGTGATCCC
AAGCAGCGAT TTCTGTCGGC CTTCTTGGAC AAGGCCGTCA GCAACCGCCA CCAACACATT
CAGCACCGCT GTTGCCCGGA CCAGGCGTGC ATAGCAGACG CCCAGACATT AGCAGGGTTT
CTCCGGCTGT GTGAGCGTTG CGACGATGAG CATTGGCGAG CGCAGAATGC CCGCCTTGAT
TCCAAATTTT GGCCATATAT GGACTTTGTT GGCCACGTAG AAAACTCGGC GGCCGACGCG
CAGGCATTGC TGACTCGCGT TGGCGCTTGG GACGAATTTG GCGCCTCGGG GTGGGGAACC
GACGGCACCA GTGCAATTTT TCAGTCCAAA GGATCCGGCG GTGCGGGTAC ACACGCAACC
TGGTCTCAAT GGAAGGTGTG GCAATGGTAT ACCCCGGAAA TCGAGCAACA AGTGGAGGAT
TTCTTCCGTG CCGATTTCGA AAATCCTTTG TTCAATTTTA CTCGAGGTGA ATGTTTGACC
TGTCTCTCCG ATGAAGATAA AGCCAAACTT GCGGCTGAAC AAAAGAAGTA GAAAACGGTT
TAGGGCGTA
 
Protein sequence
MVLIMNAQNP SKNKPTRKAV QPSRGSYSKL AYCFLGACMV GAMVLNMSHL RNVQESSHPQ 
SALPRRNAKV VLRRESVKKT VASLKKVVAP QSPALSDRKP SQSFIPIAPR PTDPAKNLLQ
AGDYIYYQDP AIPRWDAAPI VVESHKLLFF TTPKVGCTVW KQLFRRMMGA KDWKSQDAQS
LLPHNPEVNG LKYLYDYPLE EADRMMTSPK WTRAVMVRDP KQRFLSAFLD KAVSNRHQHI
QHRCCPDQAC IADAQTLAGF LRLCERCDDE HWRAQNARLD SKFWPYMDFV GHVENSAADA
QALLTRVGAW DEFGASGWGT DGTSAIFQSK GSGGAGTHAT WSQWKVWQWY TPEIEQQVED
FFRADFENPL FNFTRGECLT CLSDEDKAKL AAEQKK