Gene PHATRDRAFT_30909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_30909 
Symbol 
ID7198828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp83775 
End bp85017 
Gene Length1243 bp 
Protein Length347 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185044 
Protein GI219129750 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.130627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGACGATGCG ACTTCTTGAT CTAATGTAGC ACATGTGGAA GCTGTTGTAG CACACATCCG 
GAAGGGGCGA CTGCGTCCAG AGTTTCGTCA TGGCTTATTC TAGGTTTGAG TGCAAAGTCA
CAGCATCGCT GTTCATTATT TGGGGAATTC TTTTGTTATC GACTCCGGTC GATTGTTTCG
CGGTCTCCCG AAAGCAGAGG GGTCTATTCT CGAGAGCTCC ATTCCCTCGG GCGTTACGGA
AAACGAGATT TCCTACCTTG TTCACCTTGG AATCGACTGA TACCGGTACG CGCGTCTACG
CTGCTCTTTC CGATTCCAGC AACGGCGACA AACCGCTCTT TACTCAAGTT ATTAGTGACG
TCGACGATAC ACTCAAATCT TCGGGTGGAG TCAATGTTGC GGGCGTCGCC TTGGGTGGAA
TCGATGTTCA ATATCCCCGT GGTGAAATGT ATCCTGGTGT GGCCGAATTC ATGCTACAAA
TGAGTTTGGG AGTAGCCTCG TCACCAACAG TATCCAATCC TGCCCTTGCC GTCGCCCCTC
CCAAAATTGC CATTCTAACC GCTCGCGCCG AAGAATTCAA AATTGCTTTG CAGCTGAAGG
ATGATTCGCC CTTGGTAATG GCTCTGCGCC AGGCAGGCGA AACGGCGGGG GTTAGCAATT
GGGGCGTGGG ACCGGTGTTG TACGGTAGCG TGGCGGAATG GATCGCGCAG GACCGTAAGG
GGTTTCGGAA GTTTACTAAT TTTGAGCGAC TCCTGCAACA GGATCCAACC GGAATCATAT
TCCAGTACGT GTACGTTGGC GATACTGGCG AACTGGACCA AGAAGCCGGT GAAACCATGT
TACGCGAATA TCCGGAAGTC GTCAAAGCCG TCTTTCTACA CGTAGTGGCC GACAAAATTG
GGTCCGTAAC GGTGCCGCCA CCCAAAATTA TTAACGGACG ACCGGTCATT TTCTTTCGAA
CGTACGTTGG AGCCGCCGTG GCGGCAGTGC AACTAGGATT TATGAGCATG GATGGACTGG
ATAGTGTAAT TGCGGCCTCC TGTCAGCGAT TGGCAGACGT CCCGCGAGAA AGTGACAAGT
GGGCTGACCT GGAACGAGAT ATTGCTCGCG CTCAAGCGAC TTTTGTATTT TAGAAGACCC
GCAGAGACTC GGAGATGCCG CTACAGTCTA TACAATCCAT CGGGTGCGCA CAAAGGCAAC
TAAAATACAG CGTCATACGA TTTTATTACA AATATATATA TAT
 
Protein sequence
MAYSRFECKV TASLFIIWGI LLLSTPVDCF AVSRKQRGLF SRAPFPRALR KTRFPTLFTL 
ESTDTGTRVY AALSDSSNGD KPLFTQVISD VDDTLKSSGG VNVAGVALGG IDVQYPRGEM
YPGVAEFMLQ MSLGVASSPT VSNPALAVAP PKIAILTARA EEFKIALQLK DDSPLVMALR
QAGETAGVSN WGVGPVLYGS VAEWIAQDRK GFRKFTNFER LLQQDPTGII FQYVYVGDTG
ELDQEAGETM LREYPEVVKA VFLHVVADKI GSVTVPPPKI INGRPVIFFR TYVGAAVAAV
QLGFMSMDGL DSVIAASCQR LADVPRESDK WADLERDIAR AQATFVF