Gene PHATR_44112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44112 
Symbol 
ID7203872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1014924 
End bp1017235 
Gene Length2312 bp 
Protein Length757 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186449 
Protein GI219113731 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAATTC TGTGTATCAC GGTTCTTTCT TTGGTCGCGG TTACTGTTGC GTTTCGTCCT 
TCCCAACGAT CGCTGGTGCG GAGTAGCGCC GCTTCTTTGA CTTTACGCCA TGCCCCCTTT
AGCCGCTTAA CGAAGATCTA CGAGCAGACG GATCAAACCC AAACGGAGGT TCTCAAGCTT
GAGCCTATCC TACTAGTCTC AGATACGGAT AAAGTGGTTG ACCCTGGCGA GGAATTGTTT
TTGGCTACGG TAGAAGAGGC GGTAGACGAG GCGTTGCAAG CGGAGGTGGA AACACTTGGT
TCCGAAAATG AAGTCGAGAC TTTCGGCGTC GTATTAGAGA GCTTCACAGA AGAGAGTGCC
AACGTAATAT CCTCATCAGA GATGTCTGCA GAATTTGCAT CCGTATCAGA GACATCTGCC
GAAGTAGCAT CTGATGTTGT GAGTGCCATT CTCGCTGCTT CGCAAGAAGC CGCTGATGCT
GCGGAAGCAA CCCTCTCGGA TGAAGACATT TTTAATTATT CCACACCCGG CTTTAAAAAT
GCTACGGTGG AAGTTAACAG AATCCCTGAG ATCCTTCCGG CGTCGCAGAT CGTTGGGGAT
CCCTCAGTGG CACCAAAAAT AGCCGCACCG TCGGTTGGAA AAATTCTTAA ATTCGCGTTA
CCTGCCACCG GGGTGTGGCT CTGTGGGCCT CTCTTGTCGT TGATTGACAC GAGCTCTGTA
GGCATTCTGT CCGGAACGGT CCAGCAGGCT GCTCTGAATC CCGCGGTTGC TGTCACTGAC
TACGCTGCCT TGCTTATTGC ATTTTTGTTT ACGGGGACGA CGAATCTCAT GGCGTCAGCC
TTGGAGTCTG ATCGTGGAGT AGAAGGATCA CCCCGGAGCA CAAGTACCCT GAAAGGAGCC
ATACAACTTT CGACTTATGT CGGCGCTGGC TTGGGCGCCG TTTTATTTGT CTTCGCCCGA
CCCTTGCTGC AAGCTTTAAT TGGAAATGAC GCCATGAGTC CTGCCGTATT TGCCGCCGCA
ATGAAGTACG TTCGCATCCG GGCGCTTGGA ATGCCGGCAG CTGCCGTAAT TGGGAGTACT
CAAGCTGCTT GCCTTGGCAT GCAAGATATC CGCAGTCCTC TCTATGTTCT ATTGGCGGCG
GCTGTTGTCA ATTTTATCGG AGACATGCTT TTCGTCGGGA GTACCAACCC TTGGCTTGGT
GGAGCGGCCG GAGCCGCTTG GGCTACCGTA TTCAGTCAAT TTGCGGCCGT TGGTTTATTT
GTGCACTGGC TTTGTCACAA ACCGCAAACG AAAGAGCGTA AACAGGTGGT CAACGTGTCT
CGAGCTATTT TGGAACTGAC TGGAAAGTCG GATAGCGCCG GTGAAAACCG AAGGCGGCGC
TTCATAGACA CTTTGCAGTC GTTCCGAGCG AATTTATCAG AAGAGAAGTC GATAGCAGTT
CCAAGCAGAA CAGGACACGC TACGACAAAG ACACGTCGAT CCAAATGGAC CAAAAAAAAC
AAGCCATCAT CGAAGGAGAA ATCTTTTTCA GTGCGCGGAT TCCTCGAGAG CAAAATCCAG
AGGCGGGAGC TTGTCAGGCT TCCGTCCAAA AGCATTATCA AAGAATTTTA TCCATATATG
TTGCCGGTTA CCAGCACACA GGTTGGTCGG GTTTCAGGTT ATGTTGCTAT GGCGCACGTT
GTTGCCAGTT CACTCGGTAC CGTCAGCATG GCGGCTCAGC AAGTAATTGT CAGCCTTTTC
TACTGCCTCT GCCCCATTGC GGATTCACTT AGTTTAACAG CGCAGTCCTT TGTGCCAGCG
ATTGCCGAAA AGAAGGTTTC GAAAGAACGA ACCAATGCAT TACGAAAGAC GACGAGAAAC
TTTTTTAAGG CCGGCTCAAT TTTCGGCTCT GTGATGGTCA GCGCTGTTCT CTGCATTCCA
TTCTTGTCGC AATTTTTTAC CGCTGATCCT GTTGTCAGTT CCATGGTAGC GTCCATTGCC
CCGTTGCTTG TGGGCGTGTT TGCCGTGCAT GGTATTGTTT GCGCATCTGA AGGTCTCTTG
TTGGGGCAAA AGGATCTGGG GTTCTTGGGC AAAATGTACG CCGGCTTTTT TGCAGTTGTT
CCTTTTTTTA TGCTGCGGGT GAAACGTGCG GCTGCGCGCG GCGTACCAGG AACTAATTTG
AGTTCTGTCT GGAAGGTGTT CTTAGGCTAC CAACTTTTCC GATGGATGAT GTGGATGTCT
CGAGTGGTCA CAATTCAGCG AAGAACTGAG CGAGAATCAG CCGGCTTTAT GTAGCTGAAA
GCATTTCCTA AACTTTGTAC ATACCACTTC TA
 
Protein sequence
MQILCITVLS LVAVTVAFRP SQRSLVRSSA ASLTLRHAPF SRLTKIYEQT DQTQTEVLKL 
EPILLVSDTD KVVDPGEELF LATVEEAVDE ALQAEVETLG SENEVETFGV VLESFTEESA
NVISSSEMSA EFASVSETSA EVASDVVSAI LAASQEAADA AEATLSDEDI FNYSTPGFKN
ATVEVNRIPE ILPASQIVGD PSVAPKIAAP SVGKILKFAL PATGVWLCGP LLSLIDTSSV
GILSGTVQQA ALNPAVAVTD YAALLIAFLF TGTTNLMASA LESDRGVEGS PRSTSTLKGA
IQLSTYVGAG LGAVLFVFAR PLLQALIGND AMSPAVFAAA MKYVRIRALG MPAAAVIGST
QAACLGMQDI RSPLYVLLAA AVVNFIGDML FVGSTNPWLG GAAGAAWATV FSQFAAVGLF
VHWLCHKPQT KERKQVVNVS RAILELTGKS DSAGENRRRR FIDTLQSFRA NLSEEKSIAV
PSRTGHATTK TRRSKWTKKN KPSSKEKSFS VRGFLESKIQ RRELVRLPSK SIIKEFYPYM
LPVTSTQVGR VSGYVAMAHV VASSLGTVSM AAQQVIVSLF YCLCPIADSL SLTAQSFVPA
IAEKKVSKER TNALRKTTRN FFKAGSIFGS VMVSAVLCIP FLSQFFTADP VVSSMVASIA
PLLVGVFAVH GIVCASEGLL LGQKDLGFLG KMYAGFFAVV PFFMLRVKRA AARGVPGTNL
SSVWKVFLGY QLFRWMMWMS RVVTIQRRTE RESAGFM