Gene PHATR_44133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_44133 
Symbol 
ID7203886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp1075030 
End bp1078470 
Gene Length3441 bp 
Protein Length1012 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186179 
Protein GI219113191 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATGACTTC CTAGACACAG CTTGGTGAGG GCACTACAAC AGTAGTACGC TGTTTGGATG 
CGACCTAACA CTGACTGTAA GTAGTAGCCT AGATCTTACA CAAGAGTTTC GCCCGTTGTG
CCAGTCCTGT CTGTTAGGCA CGACGGTGGG TCCTTAAATC CTCCCGTTTC CAAAGACTTC
CAAGAGAGCG TATCGGCCTT CGTCATGACA ACAATCCACA TTATTGATAG CTGGGCCGAT
CTTCTTGGAG GAATCACGCT ATCGGATTGT CGCAACGTCA CTGACCCATC CTGTGTCGGC
GGGGTTGCGA ATGTTGCCGC CTTTGCGCGG CACTTGACCC GGAGCGGGGC GAGCGTTACC
GACAGTCCGA GCGGTGGGAC TGCGGAAGAT ATTGAGGAAC CGTACATAGT GACGCCGCGG
CTCGATCGGT ACTCGCCATT CGTGCAAATG CATCCACTTG GATGGAGTGT GAACCGGTTG
ATCTTTGTTG AAGTATTGCA ATGGAAATTC TTTATCTCGA GTCCCGCCTT GTTGTATTTA
AGCAGCACTG CTCGTGGGAG CACTAACGGT AGCAGCACGA TTTCGACGAA CTCCTCGTTG
ATGTCGGATT TGCAAACGGA CGAGTTGCCC TTCTTACTGA CTAACGTGGC TGTGCCGCCA
GAGAACGACT GGTACCCCTA CACAAAGGCG ATATACTTTG ACGAAACCAC GCACCTGGCT
TTTTTGTCGG TCGCGGATTC TGACGAGCCC TTAAATGCGC CGCAGATAGA ATCTGCAACG
GGAGCCTTGA ATTATATTGC ACGTTTGAAC AAAGAAAGTG ACTGCGATGC ATTTGTTGAC
GATGGGGGTG GAACTGCGGT AGCCCTGGGG AGTAGCTGGG ACAATCGAAC CTGTTGGATT
CCAGTGGTTG CCTTTGGTGA TTCGGAAAAT CGATGGAACA ATTTCCTATT GGCGATGACT
GCATTAGAAA ACCCTCCGTC TCTAATAATG GATATTGAAG GACACGATGC GCGATTTTTT
ACACCGCAAA AGTACAACCA AACTTGGGTT TCTAGTTACA AAATGAATTC TACGACGTAT
CGTCAACAAA GCATTGTTCT AGCAGAGGAC AGCCGAACAA TTGTGAGCGT GACGGCGACA
GCCATACCCC TAAATGAGTT GCCCGACGAG TTCAAAGACA ATATCTACAC ATCCCACGTA
ACCCGCATGT ACGAGTTGGG GCAGGAAGCG GCAAACAAAG ACCCAATTGT GGGAACAAGT
ACTTTCATGC CAATTGCTAG AATTGACCAG TATAGACGGT GTATGGGTGG TGAATGCGAG
ATCGGAAATC TCTTTACAGA TGCTCTGCGA TGGTATTCTA GCGCGGATGT GGCATTTGTT
TCGAGTGGAG GATTGCGAGG CCAAGGATGG CCGGAAGGAG TTGTTCAAAT GTCCAATTTA
TGGGAATCAT TGCCGTTTCC GAATACCCTA TGCTCCGGAA CCATGACTGG AGTATCGTTA
TTCAAGCTGT TCAACTATTC TACTAGTGTT GCAAACTTTG AAGTCAAAGA AACAGTTTCG
GGGGGACAGC TGCTGCAGGT GTCTGGGGCG CGACTTCGAT ATAATACGAA ACTCCCCCAA
GGAGCGTCGC GAATGGTTAG GCTGGAGATT TGGGACAAAA ACGCCAATCA GTATGCACCA
GTGCAACGCC TCCAATTGTA CAAGTTCGCA ACCGATAATT TTTTATGCGA AACCAACATA
CCGTATCCTG AATTGTTGGG GCAAAATTTT TACATCGATG GGGAAGTTCC GGGTGTTGTG
CGTGACGATT TACATCAAAA CATTGTTGCG GACTATCTTA CACAATTGAA CACAACTTAC
CAAGCGACTA TTGAGGGACG ACTCATCAAT GACACTACTG TTCTGGATGC AATGAATTTG
GTACAGATCG AAGGAGGCTG TGGACAAGGA ACTTATTGGG TTTTTACGCA ACAGAGTTGC
AAGGTTTGTC CGAATACTGA TCAAGTCTAT TTTGGAAAGA AAGAGCTTGA ATTTGCGAGT
GAAAGCGGAT CGTCGAAACC TGTTGAGGGG CGTTTCGAGA TACTGAATAA TGCGGGTTTC
CCTGTATCGG TGGGTCCCAG ATCGTTTCCA TTCTGGGTGA CTTTGACAGT TTTCTTGTGC
AATGGTACGA TCCCAATTGA TCCAATTCCA GCAGGTGTAA CGCGCGTGTT GCAATCCGGC
GAGAAATTGA CGGTAGGTTT ATCTATCTCG TCTGAGCAAC TCGAAGCAGG AACAGCGGTT
GCGACTGGAT CTTTTTCTGT GGTGGATGGC GGGAGCTTTC CTGGATGCAT TGGCAATGAA
ATTTCTTTTG ACATTCTCGT TCGTGTGGAT CCCAGTCGGG AACTCAATCA GATAGGAGGA
ATTCGATGGG TTGGCTGGTC GCTATTCATG GTCCTTGTTT TTTCTGCGAT GTTCTTTTAC
ACTTGGGTAT GCCAGCACGA GCGAATTGCA GTTGTCCGTG CTATGCACCG CTTGTTTCTC
AACACGGTTT GCCTAGGCAT TGTTGTTTTG GGCTCTGTGT TGATTCCAGT GGGTTTTGAC
GACGGTGCAT TCTCGGAAAA CATATGCAAC AGCGCGTGTG CTTCAATCCC ATGGATCAGC
GCAGCAGGGT TGAGTACAAT ATTTGCAGCT ATGTATCGCA AGCTGGGATC GATTGTCGGA
AAAAATGAAG ATGCTCGTGA ATTTCGTGGT CGCCGTGTTG TTTTAACATT TGCCGTTTTC
TTTGGCCTAA ATGCGTCGAT TCTAGTGCCT TGGAGTATTT TAGCCCCATT GCACTGGGAT
CGAACGCCAC TCGTTCAAGA AGAATGGAAG AGCTACGGTC GTTGTTCAAC GAGCGATACG
TCAAGTTTGG CTTTTGTGGT TATGGCCGGT GTTCTGAATG TGTCGGGATT TGTCTTGATA
TGCCGGTTGG CTTATAAAGC CCAGCAAATA CAAGATAGAA GGGACCAATT CGACCAGGCC
AAAAGCATTT CATTGGCCCT GTACAGCTGG ATTCAATTAG CAGTCGTGGG AATTCCTGTT
CTCTCTCTGA TTAGCGCGGA AACCACTCGT GCTCGATATT TTATGATCGT TGCGCTCATC
TTCGCCTTGT GTATCTCCAT GCTGCTTAGC TTGTTTATTC CGATGCAAAT GCAGAAAACG
ATGCAAAGCG TGCGAACATT GAGTTTAGGC AGTCGCTTCC TGAGCTCATT TCGATCTAGA
TAATCTGCGA CGACACAAGG AGGCGGGAAA CTCAACACAC AGCATTAAGA ATCAAACCGG
TTTTACATAC CTTCCTTCCG TGCATTGGGG CAACAACCTC GGAGCCGATT CGAATTAGAT
GACATAAGCG GCACCGTGGA CGACGCAGTC ATTCTGCAGT GTCCGGACCA AGTGCGATCA
ATTTTCGGAC ATGAGGTTCG C
 
Protein sequence
MTTIHIIDSW ADLLGGITLS DCRNVTDPSC VGGVANVAAF ARHLTRSGAS VTDSPSGGTA 
EDIEEPYIVT PRLDRYSPFV QMHPLGWSVN RLIFVEVLQW KFFISSPALL YLSSTARGST
NGSSTISTNS SLMSDLQTDE LPFLLTNVAV PPENDWYPYT KAIYFDETTH LAFLSVADSD
EPLNAPQIES ATGALNYIAR LNKESDCDAF VDDGGGTAVA LGSSWDNRTC WIPVVAFGDS
ENRWNNFLLA MTALENPPSL IMDIEGHDAR FFTPQKYNQT WVSSYKMNST TYRQQSIVLA
EDSRTIVSVT ATAIPLNELP DEFKDNIYTS HVTRMYELGQ EAANKDPIVG TSTFMPIARI
DQYRRCMGGE CEIGNLFTDA LRWYSSADVA FVSSGGLRGQ GWPEGVVQMS NLWESLPFPN
TLCSGTMTGV SLFKLFNYST SVANFEVKET VSGGQLLQVS GARLRYNTKL PQGASRMVRL
EIWDKNANQY APVQRLQLYK FATDNFLCET NIPYPELLGQ NFYIDGEVPG VVRDDLHQNI
VADYLTQLNT TYQATIEGRL INDTTVLDAM NLVQIEGGCG QGTYWVFTQQ SCKVCPNTDQ
VYFGKKELEF ASESGSSKPV EGRFEILNNA GFPVSVGPRS FPFWVTLTVF LCNGTIPIDP
IPAGVTRVLQ SGEKLTVGLS ISSEQLEAGT AVATGSFSVV DGGSFPGCIG NEISFDILVR
VDPSRELNQI GGIRWVGWSL FMVLVFSAMF FYTWVCQHER IAVVRAMHRL FLNTVCLGIV
VLGSVLIPVG FDDGAFSENI CNSACASIPW ISAAGLSTIF AAMYRKLGSI VGKNEDAREF
RGRRVVLTFA VFFGLNASIL VPWSILAPLH WDRTPLVQEE WKSYGRCSTS DTSSLAFVVM
AGVLNVSGFV LICRLAYKAQ QIQDRRDQFD QAKSISLALY SWIQLAVVGI PVLSLISAET
TRARYFMIVA LIFALCISML LSLFIPMQMQ KTMQSVRTLS LGSRFLSSFR SR