Gene PHATRDRAFT_44773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44773 
Symbol 
ID7199889 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp202282 
End bp206306 
Gene Length4025 bp 
Protein Length1153 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178950 
Protein GI219116310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.749806 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTTCCT ATACGGATCG ACGTGACTAC ACGTCGGGTG GTCATCGTGG AGGTGGCGGG 
GGATACCGCG GTTCTGGTGG ATACGGGGGC GGTGGTGCAC GGGGTGGAGG ACGCGGTGGT
GGGTATCGCG GAGGACGGGG TGGATATCAC CGTGGAGGAC GTCAGGCCCA TACCGGAACA
TCTCCCTACT CGCGTCACAG TGGCGGGACT ACGAACGGTG GACCCCGACG CTCCGGAAAT
CGTTTCGCCG TGGAATCCAC GCGGCGAGAT CCTCAGCAAG AACTCTTACG CAATCTACTC
GTCCTTTTGC ATCAAGTAGG AGATCTGCAG TCCACGAATC GCAACAACAA CAACAACAAC
AACAGTAACA GTAACGACAA TACCTCCCCG GTGCGTACAA TAGTCACAAC GCAGAAGCAA
AACATTCAAG GATTGACGCA AGTCTTGTGT GGCAACAATC CCGAACTCTT TCTCCAACAC
GAGGCCGAAG ACAGTCCCGC AAACATTCAG TACGCGGAGC AATTGGCCGG ACCGTTGGCG
GCGGGATTGG TCCACGCTAT TATGGCGGCG CCACTGCAAA CACCCTGCTA CGTGGGGTTG
ACTCTGGCGG TTCACGTCAC GGCGCACGCC CGGGACGCAA CCTTGTTTGG TGGATTCGCA
TCCCGCTGTG TCCGCTACGC CACCAGGGCC ATGGCCCGTG ATCTCGACGC TCTGCTGCTA
GATTCGGACC CGAATAGCAG TACTAGTAGC ACTACTCATA GCGGTATTGG AGACAGTGGC
AGCGGTAGTA CTACTACTCG ACAGTCCCGC GTCCCATCCC ATCGATTGGT GGTTCGGGTG
CGTATGTTGC TCCGCTACCT GGTTCTACTG GCACGCGCGG GTATTCTGCA ACTCGACCAC
GATACCGCAT ACGCCGTCAC GGGCCCCGCA CACTCCAACC CGGTCAGTGT CTTGGGGCTG
TTACAAACAA TGGTACAGGC GGCACGGCAA GCCAAGGAGC GTGACGGCAA CCAGTCCGTG
GCCGTAGTCT TGGCGTCGCT CGTGTTGAGT ACGGTACCCT ACCTGGCGAC CCTTCTTCCT
TCCGAGACGG TCCGCCAGAC TTTGGTCCAA CCATTGGAAG CAAGCGTCCA ATCATACCGG
TCCACGTTTG CACCAGGCTT TGGTTGTACC GCAATTCTTC TTCGAGAGGA ACAAATTGAG
GATATTGGGG CTTCCATTGA AGAAGAAGAC GACGAAGAGG AGGATGACGA CGAAGAAGAA
GAGGGGTCGG GACAAGTGTG CGACAACTTT CAAGACCTGA TGCGGACCGT ACAGTACTGG
GTAAAGCAAG AGGACACTAC CGTTGCGTCA CGCCTGGCTC TCTTTCGCGA CGCTCCGTGG
GAAGGCTTGG AGGCAAAAGT CTCCTCGCTT ACTTCCACAA CCTTGGACGC CAGCGAGATT
GAGGAATCCG CTCACACTCC GTTAGTCTAT ACGGAGACGC CCTTGACCAT TCCAATATTC
ACGGATTGCC AGTCCCTGTC GGCTTTGGTG GGTGGTCTTC ACTCGGCGCC CGAGGACGAC
AGTCTGTTCT GGGCCGAAAT TGATCTCGAC GGAATTTTTG TGGGTCGCTT GCCCATTTTC
GGACCTCCTC CCGAAGTTGC GGACAATGAC GACGACGAAG ACGACGACCA AGACATGGAG
GCAGCAGCAC CGGTTAACGA ACGTCTACAA GCGTATCGCT CCGGATACGG CATGGTGGAT
CGCTACTTTA TACACGAAGC CATCCGCGAT TGCTTGACTA GCCACGAAAG TTACGTGACG
GATACCGGTG TCGAAATTGG CAACGCCAAG ACGGCTGCGG AGCAGGTGTG GTCTATCATT
CAAATGGTGA CGGGTGACAG CACCAACGGG TTGGAATATG CAGTTCTGGA AGCGATCTTT
TCCTTAATTG TACAATCAAA TGCGGTCTCA TCGTTCCGTT TTGTATACCT TTCGCGTGTG
TTATTGGAAC TAACGAGGCT GGAACCCGCC ATCATGTCGC CCGCCATTGC CATTGCGGTC
TCGACTTTAT TTCAGGACTA CATGCCAACT CTGGTCCCCA TGGCACGATA CAATCTGAGT
CGATGGTTCG CCTTTCACTT GGTTAACACC GATTACCAGT GGCCCGCAGC GTACTGGAAA
CATTGGGAAC CCTTTGTGCA GTACGGTTGG AAGAATAGTC GCGGAGCTTT TGTGAAGGGT
GCACTGGCAA TTCTGCTGGA AAACGAAAGC GATGCGGGTA TGTTGGTGAA GGAATGCCTG
CCCAAGAACA GCCTTTTGGT CGACCATTTA CTTCCCGGGC TCACGACGTC GGCTTTACCA
GACGACAGCG CTTTGGCGTC CTTTGCAAAG GATGTCTCTT CACGTATATG GGATAATCGC
GAGGACGAAC ATTCGCTGTT GCAATATATT GTAGGGGACG AACTCTCCGA AAGTGTGACG
ACCGACTTGG CTGGTCTGCC CGTAGGAGAG AGGACGTGGT GGCGGACACA TGCCGTAGCT
CGAGCTTTGC TGTCGGTGAC CAAGCAAGAG CACACGTATT TGGCCCTATC TATCGCACAA
GAGCGTACCG CCAACGAGGA CGCAATGGAT GCGACCATTG AAGCACCTGC AGACATTTTG
TCCTTGCTGC TAGATGCATT GGTGCAGTAC ACGCCACTCC TGCTGGGAGT TCTCGCCAAA
GATCTCGACG GTCAAGCGAG CGCGGATGCT GCTCCGGTTC AGGGCGAGTT ACATGTCCTC
CAAGAAATAT CGAATCATGT ATTGTATTCT CGTACCACAC TGGATGCAGT TGTGAGTTCC
CTGCTTCACC ACAAGGTTGT TTCACCCGAT GCCGTAGTTC GCTGGTCCTT GGGCGATATG
GGACAGGAGA CTCCCGGGGT CTTGGCAATT CACTGGTGGG ATATGTCAAC TATGGCAATA
CACTACGGGC TCACCAATCT GTTTGCGGTG ACACCTGCTT CAAACCAGGG CGAGATGCAA
GTCGAAGACA ATCAAGACGA AAGCCCGACC ATGAAAAATG CACGTTTGTT TCTGGAGCCG
ATCATCGAGT ACACTGTGGG TCGCATTTGC CACCTTTTAT CGTCGGCGAG TCATACTGTA
GAAAGCAGCA AACTGACAAA TACACAGGTT GATCTGGTTG AAGGCTTCAA GTGTTTGGTT
CGGCAAACAA AAAGGTGTTT GCTGCATGTA CTGCTCAGCT CTTCCGTCAT TGGGCAACAG
CTGCGGCCCG CTACGGTGCG AAAGTACCTC ACGAATTCTT GTCTTTCGGG TTCCAACCTA
CTGTCTATGT GTCAGGAAAC CGACGGATCG TCGGCCATGA ATACGTTCCG GACAAGTTTG
AAATTTATGG CATAATACTT CATTCCAGGA GTTTTGTAAT GTCAAGGACG CTGTCTTACT
GTTAATGATC GATCGCTGGT TTGCTGTAAA TTGCTGATCG TTTTGTTTCG TAGCTAGAGT
GCTTCTAGTT CCAGTAAGAC CCCGTCTTCG ACGTCTCTAC ATCCGTATTA GTCGGCTGTA
ATATTCTTGC CTTTACATAA ATTACCTACT GCGCTGCGTA CGTACCTGAC TTCAACCCAA
TCTGCCAGTA TGCTTTCTGA GACCGACGTT AATCCTCGCG TCGCCCACCG ACTCCGTCAG
CTTGTTCACC GAGCGCAGCC GCCGACGCCG CCATCGCCTT TGCCGTTGCG TCTCTCCCCG
AACTCCTCCG CTCATCGTCT AAGCACAGCG CTACAAAAGT TTTCGAACTC TGCAAGCGCC
TCTCCGAACG CAAAGCCACC GCTTCAAAAT TCAAAGATGA CTCCTCCCTT CCGAAATCGG
TGCGGATCCG ACCGTCTCCG CACGGTTCGC GGGCAGCCAT GAAGACCGCG GAATTTACTG
AAGCTGCTCA AAGTATTCAA GAACTTCACC GGACTTACCG CCTTGGGATG AAAGCTCAAT
TCTTGAAACT TGTCAATCTT GAAGTCAAGG TTCTCCGTGA CGAACTGTCA ACCCTTTTTG
CCTAG
 
Protein sequence
MSSYTDRRDY TSGGHRGGGG GYRGSGGYGG GGARGGGRGG GYRGGRGGYH RGGRQAHTGT 
SPYSRHSGGT TNGGPRRSGN RFAVESTRRD PQQELLRNLL VLLHQVGDLQ STNRNNNNNN
NSNSNDNTSP VRTIVTTQKQ NIQGLTQVLC GNNPELFLQH EAEDSPANIQ YAEQLAGPLA
AGLVHAIMAA PLQTPCYVGL TLAVHVTAHA RDATLFGGFA SRCVRYATRA MARDLDALLL
DSDPNSSTSS TTHSGIGDSG SGSTTTRQSR VPSHRLVVRV RMLLRYLVLL ARAGILQLDH
DTAYAVTGPA HSNPVSVLGL LQTMVQAARQ AKERDGNQSV AVVLASLVLS TVPYLATLLP
SETVRQTLVQ PLEASVQSYR STFAPGFGCT AILLREEQIE DIGASIEEED DEEEDDDEEE
EGSGQVCDNF QDLMRTVQYW VKQEDTTVAS RLALFRDAPW EGLEAKVSSL TSTTLDASEI
EESAHTPLVY TETPLTIPIF TDCQSLSALV GGLHSAPEDD SLFWAEIDLD GIFVGRLPIF
GPPPEVADND DDEDDDQDME AAAPVNERLQ AYRSGYGMVD RYFIHEAIRD CLTSHESYVT
DTGVEIGNAK TAAEQVWSII QMVTGDSTNG LEYAVLEAIF SLIVQSNAVS SFRFVYLSRV
LLELTRLEPA IMSPAIAIAV STLFQDYMPT LVPMARYNLS RWFAFHLVNT DYQWPAAYWK
HWEPFVQYGW KNSRGAFVKG ALAILLENES DAGMLVKECL PKNSLLVDHL LPGLTTSALP
DDSALASFAK DVSSRIWDNR EDEHSLLQYI VGDELSESVT TDLAGLPVGE RTWWRTHAVA
RALLSVTKQE HTYLALSIAQ ERTANEDAMD ATIEAPADIL SLLLDALVQY TPLLLGVLAK
DLDGQASADA APVQGELHVL QEISNHVLYS RTTLDAVVSS LLHHKVVSPD AVVRWSLGDM
GQETPGVLAI HWWDMSTMAI HYGLTNLFAV TPASNQGEMQ VEDNQDESPT MKNARLFLEP
IIELIWLKAS SVWFGKQKAA DAAIAFAVAS LPELLRSSSK HSATKVFELC KRLSERKATA
SKFKDDSSLP KSVRIRPSPH GSRAAMKTAE FTEAAQSIQE LHRTYRLGMK AQFLKLVNLE
VKVLRDELST LFA