Gene PHATRDRAFT_49667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49667 
Symbol 
ID7198152 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp353701 
End bp354975 
Gene Length1275 bp 
Protein Length353 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184448 
Protein GI219128496 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.753595 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TGCGTACATC GGTTTCAATG AGAGTATAGT TGGTTGCTCT CATAGCGCAA AGAGAAACTT 
TACATTCTGT TCATGGCTTT TCTTGAGGAA AAGAACGCGT TGCGATTGTG TATGAACTTG
GCAGTTGTCG TGCTTGTGGC TCTGGCGTTT ACGGCAACAA GCATGCAACG CATGGGCGAA
ATCAATAGAA AGGACTTCAA GATCCAACAA ACTTGGAGCG ATCAGCAGAA AGCCACCATT
ATTTCTAAAG GAACAGAGGG CATGTGGCTA AATAGAACAG GATGGCCCAG AGGAGACTGG
AAAACCGTCA ATAACTTTTA CAACGGAGTC GATTTCCAAA ATCAATCCAC GGAGCCAGTC
TACTCGTCGT CCCAAAAAAG TTTTAAACAA CGCCTGACAC AATCTTGGTT GATCCAAGAC
GAATCCGGAC GAACGGTAAT GACGAAATAT GGGGTGCAGA GCCGCCCAGG TGCATTTGTG
CACATAGGTA AAACGGGCGG CTCATCGTTG AGTCAACATC TCCGATACGG TTGCCACTCC
TTCGCTCTCA ATCGGTGTAA AGAGAGGTTG CCGGGCAATG AGACAGAGTC GTATCTCTCT
CGTCTTACGT CATACTACCA TGTGCCAGAT TTTGGTAGGC TTCACAAGGC CAGGCACCAC
GACTTCTTTG CAATCACTAT TCGTGATCCT TTCTCTCGTT TTTTGTCGGT GTTTACCTTT
ATGCACCCAC AAAACGTTCG TGCCCGCAAG GGATGCAACC GAGAATTTTG CAAGACCGGG
AGCGCGTGGG AGTGCTTCCC TTCCGTGAAT GATTTTGCGT GGCACTTAGC GTCAAACGGG
ACCTCCAAAA GCATCGATGG TAATTTGACC AGTCAAATAA GAATCCAAGA CTCCCCTGCA
AACGAAAGTG TGCACACTCT GAATTGTAGC GAAGTAGCAC AGCAATGGAT TGGTGGTGGT
CAAGTCGATG CTCCTGCTCA TTTTCGATTC GGCACTAGAA CAGTCGTGGA CGAGTATTTG
CCTCCCGGAA GCGTATTCAA TTCCACAATT TTGGTGATAA GAAACGAGTA TTTGTGGGAA
GATTGGATTG CCACAAACGA ATGGCTAGGT CAGGAAAAAG GCACGGTTGC TACATTTCCA
CTCGACGCAG TTCGGGATTT TTCCCAACTC AAGCTACCGG TAACGAAGGA ATTATCGGAT
GACTCACGTC AAATACTGTG TACGTTTTTA CAGGATGAAT ACCGTGCATA TCTCCAAGTG
CTGCGAATGC CGTGA
 
Protein sequence
MAFLEEKNAL RLCMNLAVVV LVALAFTATS MQRMGEINRK DFKIQQTWSD QQKATIISKG 
TEGMWLNRTG WPRGDWKTVN NFYNGVDFQN QSTEPVYSSS QKSFKQRLTQ SWLIQDESGR
TVMTKYGVQS RPGAFVHIDF GRLHKARHHD FFAITIRDPF SRFLSVFTFM HPQNVRARKG
CNREFCKTGS AWECFPSVND FAWHLASNGT SKSIDGNLTS QIRIQDSPAN ESVHTLNCSE
VAQQWIGGGQ VDAPAHFRFG TRTVVDEYLP PGSVFNSTIL VIRNEYLWED WIATNEWLGQ
EKGTVATFPL DAVRDFSQLK LPVTKELSDD SRQILCTFLQ DEYRAYLQVL RMP