Gene PHATRDRAFT_50586 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_50586 
Symbol 
ID7199406 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp210608 
End bp211789 
Gene Length1182 bp 
Protein Length362 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185543 
Protein GI219130797 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.616621 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAAC ATTTGCTTGA ACGAACAACA GCGTGGTTGA CGACTTGGAG TCTCATTCTG 
CTCGTGGCCG TTGCTCAAGA AGATGTTCAC CCTGACGTAG TTGCCGTTGT TGATGTTGCT
GGAGGTGTAC GACCGACACC ACTTTGGGCA AGCAGTTATT CGGACGGCGA GAATTGCTAC
TGCCTTCCTT CGTTGGATAG CGCCATTGGG AACTTTGTCG TAGAAACGCC ATTAGGGTGG
CTGACGACGC AGGAAGTGTG CGATCTGCTA GGAACAGGAC CAGGAAGACT AGGACAACCC
CTTTACAATG ATATCCAATG CGGTAACGGC CCCCCAAACG CTGATGAAAA CGAATTCCTT
TGTCCGGGAC GAACCGATGT AAGTGATTGC GAGGAAAAAG CCGAGCGCTG TCCACTAGTA
GAAACAAATA TATCTCAACG GCAAGCTTTG TACTGATTCG CTCTTCCTCA GATTGGGGAA
ACAGGTTGTG GTCAAATAGG ACCCAAATGG AATTTTGATA ATGCAAACCT TGCGGACGGC
CCCCCACGAC TTCCATCGTT GCCTGAGGAC ATTCATCCCG ATATCGTTGC GGTGATCGAC
GTTGTGGGTG GTGTGACGCC GAATGGAAGA TCGTGGGCCG ACAGCTATTC CTTTGGCAAC
AAGTGCTATT GTGCGACAAC GTTTGATCAC GACATTGCGG ACGTGCTAGT CGAAACACCG
CAGGGATGGA TGACGATCCG TCAAGCTTGC GAGTTACTTG GACCGGGTCC CGGTATTGAA
GGACGACCGG TGTACAATGA CATACAGTGC GGGAATGGAC CACCTAATAA TGCAGGCGAT
GAGCACGTGT GTCCTGGACG AACCGATGTA CGTCGACTGG AACGCGGTAG ACTGGCTAGT
ATTTTCAGAC CGTTGTTTTC TCGGACTGTT TATAATGAAA TGCCTCACCA TGATGCTTCT
CCGTTGTACA ATTCACAGCT TGGACCAGAA GGTTGTGGTC AGATTGGTCC CCGTTGGAAT
TTTGATGCCA TCAAATCATT ACCGCCAGGC AGCGCCCCCA CAGCTCTGCC CTCTTCTTTA
GCAGCCGGAG CAGTGCCTGT CCCAATGCTA CGGGGCTTAG GGGTAATCAC CGGATATCTG
TTCTGTGTTT TGAACTGGCA ACTTTTTGAT CTCGTTCCGT GA
 
Protein sequence
MKQHLLERTT AWLTTWSLIL LVAVAQEDVH PDVVAVVDVA GGVRPTPLWA SSYSDGENCY 
CLPSLDSAIG NFVVETPLGW LTTQEVCDLL GTGPGRLGQP LYNDIQCGNG PPNADENEFL
CPGRTDIGET GCGQIGPKWN FDNANLADGP PRLPSLPEDI HPDIVAVIDV VGGVTPNGRS
WADSYSFGNK CYCATTFDHD IADVLVETPQ GWMTIRQACE LLGPGPGIEG RPVYNDIQCG
NGPPNNAGDE HVCPGRTDVR RLERGRLASI FRPLFSRTVY NEMPHHDASP LYNSQLGPEG
CGQIGPRWNF DAIKSLPPGS APTALPSSLA AGAVPVPMLR GLGVITGYLF CVLNWQLFDL
VP