Gene PHATRDRAFT_49842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49842 
Symbol 
ID7198666 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011693 
Strand
Start bp62135 
End bp64049 
Gene Length1915 bp 
Protein Length571 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184725 
Protein GI219129079 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAACTCCAA GATTTCACTT CTTTGATCCT TTTTTAGCAT GAATGTCATC CACGAGACCG 
AATTGTCACC TAATGAAATC TCAGCGACTG CTGAGAAAAA GAGAGCACGA CGGAAAGGGC
GGGGTATAGT CCGTTTCGGT GATGAAGATT TCACCATTCA AGAAACCGAA GATCTTCTGG
ACACGTACGA GAAGGCGTCA TTTGGAGAAG TGTGCACCGC ATGCTGTTTT CGATCGGAAA
TTGAATGGCT ATGGATGCTG TTGGCTGTTA TAATCTTGGT CGGTTCTCTG TATTGGCTAG
TTTTCGGTCT GACGCTTCTC GGAGATGCGG CCAAAGTCTT GGCAGGTTGT GGAGCAGGGA
AACTCTTTGA TTCGGACTCC AACCCCTTGA GGTAGGAAAA TTTCGAATGT TGTCGATGAG
TTGAATGCTG CTGCTATTCC ATCTGTTTCT AACACATTGT ACTGTTTTAG TGCGTTGATG
GTCGGCATTA TTGCCACAGT GTTAATGCAA AGCAGCAGCA CAACCACCAC CGTGATTGTT
TCGCTCACTG AGGCACGCGC CATAAGCGTG GCCCAGGGTA TCTACATGTG AGTCAAGAGG
CGTGCGTATT CCAACTTTGG GCTTGCGATT GATTTTGTAT ACACTCAAAA GCTACTGCGT
TTCTTGAAGG GTTATGGGTG CCAACGTGGG AACCACAGTC ACTTCGACGA TCGTTTCTTT
GGCCCAAATG GGGAAAAGCG CCGAGCTGGA CCGGTCGTTC GCAGCTGCCA CATTACATGA
CGTCTTCAAC ATCTTTACTG TGGCAATCCT GTTTCCAGTG GAATGCGCCA CGGGATATTT
ACAGCATCTT ACCGGAGCCC TTGCCGAGGG GGCTGAGACT GGTAGAAGAG ACCATTTCGA
AGGACCGATA AAGAAATTTG TTAGTCCATT GTCTGCTCGT CTCTTGACCA GCAACAAAAA
ACTCATTGTT GGCGTTGCGA ACGGAAAAAC ATGCGACGAC TACTACCCGA TTCATTGCGA
TGAAGGAGCA GAGCCTAGCT TCGCAACGTG TAAAGTTGGA CTGATTGGCT GCTATGAGAG
TACCGGGCGC TGCCCTGTAC TCTTTCGCGA GTCTGCTTCT CGCACTCAGG ATCAAGTAGC
CGGAGCCGTC TCTTTTGTTA TTGCTTTGAT CATACTGTTT GTTTCCATGC TGTCCATGGT
TTTTGCTGTT CAGAAGCTTT TGTTCGGGTT GTCCACCAAG GTAGTGCATA CGGTTACCAC
CTGCAACGGA TACATCGGAT TCCTTGTCGG TATCGGAATT ACGATGATCA CACAGAGCTC
AAGCATTACG ACTTCGGTTC TTGTTCCGTT CGCTGGTGTC GGGGCTCTTC GATTAGAGCA
GGTCTATCCT CTCGTCTTGG GTGCGAATAT GGGAACTGCC GTTCAGGCAG TTGTCAGTAG
TTTGGATGCT GTCGGCACCG ATCCTCTTCA GGTAGCCCTT GCACATTTGT TTTTTAACTT
GACGGGTTTT CTGATTTGGT ATCCGCTTCC GCCACTTCGA AACATTCCAT TTTTTGCGGC
ACGTCGACTC GGGAAGAACG CAGGGATCTG GCGGATGTTT CCTTTATGCT ACATTGTCCT
GGTATTTTGC CTGTTGCCGC TTTTCTTCTG GGGTTTGTCG TCCCTATACG AGAACGGAAG
TGCAGGATTG CTTGCATTTG CAGTGCTACT AACTCTTGTT GCGGGTTTTG CCTTGGCCTT
GTTGATGTTT TGGTGCAACT TCCGTAATGG GCAATCAAAA TTTCACTCGA TTTTGGACCG
TTTGAGCCAT AGAAACGACA AGCCGTCGCA AACCTACGAT GCGGAGGGCA ATTTTGATAT
TGATGCTGAC GGAGATAACG TCTCTGGCGA GAAAGATACC GTTGAAAATC CGTGA
 
Protein sequence
MNVIHETELS PNEISATAEK KRARRKGRGI VRFGDEDFTI QETEDLLDTY EKASFGEVCT 
ACCFRSEIEW LWMLLAVIIL VGSLYWLVFG LTLLGDAAKV LAGCGAGKLF DSDSNPLSAL
MVGIIATVLM QSSSTTTTVI VSLTEARAIS VAQGIYMVMG ANVGTTVTST IVSLAQMGKS
AELDRSFAAA TLHDVFNIFT VAILFPVECA TGYLQHLTGA LAEGAETGRR DHFEGPIKKF
VSPLSARLLT SNKKLIVGVA NGKTCDDYYP IHCDEGAEPS FATCKVGLIG CYESTGRCPV
LFRESASRTQ DQVAGAVSFV IALIILFVSM LSMVFAVQKL LFGLSTKVVH TVTTCNGYIG
FLVGIGITMI TQSSSITTSV LVPFAGVGAL RLEQVYPLVL GANMGTAVQA VVSSLDAVGT
DPLQVALAHL FFNLTGFLIW YPLPPLRNIP FFAARRLGKN AGIWRMFPLC YIVLVFCLLP
LFFWGLSSLY ENGSAGLLAF AVLLTLVAGF ALALLMFWCN FRNGQSKFHS ILDRLSHRND
KPSQTYDAEG NFDIDADGDN VSGEKDTVEN P