Gene PHATRDRAFT_49678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49678 
Symbol 
ID7198161 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp386790 
End bp389231 
Gene Length2442 bp 
Protein Length729 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184454 
Protein GI219128509 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.452781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCT CTACTGCCGT TGTATCACTC ATAACCGTCG CACCACTGGT CGTGGGCGCC 
GCTGAGGAAT CTCGATACCT TAAGACCGGA ATGGTGCGTC TCTCAATCGG CTCGCTGCTT
GTTGGATCTC CCGGGTGCTT GTTACCTCAA CGATTATCTC TTTTTACTGT AGAAATCCAA
GTCTGGCAAA GGCAAAGGCG TCCGCCAGCT ATCTGGACAT CGAAAGAGTG AACATCCTGC
GTCCGATAAT GGCAAGGGAA AGGAAAAAGG AAGTAATCGC ATGAATAGCT CTCCTAAGAA
GAATAGAGCG AAGAGCAGTG GAGGTGAGGC GGACAAGTGT GCGGGAAAGA ATGGGTTTTC
TCGGTTCCCC TTTAAAGGGC TGGACAACTC GCAGCGTCTT CTGCTTCGCG AAGGTGTCGT
TGACTCCATC AAGCCTTCTT TGTGTGGAAA CGAAGGCAAC AAGAATGTCA TCTTGGTGGT
TGGAGATGGT ATGGGATGGG AGATGGTCAG ATCCGGTGCT ATTGCTAAGC AGGTGGTGGA
TGAGCTGGAA GGTCTCGGTT GCGATACCAC CACAGGTTGC CCAGACAACA GTGCTGCGAT
GAATGCTTTC CGTGGACGAA CACTAGACGA CTATTATACT GAGGGTAAGC TTTCTTGCTG
CAAGTTATTG CTGTGTAAGA GTAAACAACC CAAAGCCTCA CGACCAACTC CTGCTTTCTC
TTAAGGTAAG GGAAGTGGTA TGTCTTTCCA AGAGTTGGAT GGCTATGCTT TGATGACAAC
CACCACCACT GTCACTCAAG AACCCAACCC TGGAAACCAC TATGCTCCTT CTCGAAGTCT
TCTTGAAGGT GATGTCTCTG AGCACGAGAG TGGTCAGGCT GCCCTTGCTC TTGACGAGTG
TGGTTTCCCG ATCGATTTCT CTCCGCTTGA CTTTGAAGCC GACGGCGGCA ACATGGTCCT
TTGGGACAAT AAAATGGGAG GAGAATTTCC TTGGGACAAA CGCTACTATC AGGAGCGTCC
CGATACTTCA ACCGGATTCG ACCCGGAGTA CATTATGCGT CATGCGACGG ATTCGGCTTC
TACAGCTGGA ACAATGGCCA CTGGTCACAA GGCTGCCGTG AACATGATGT CACAAACACT
TTACGAAGAA GACGTTAGCA CGCTTGTGGA AGACGCCATG TATTGCGGTA TGGCTGGAGG
TGTCGTTACT TCCGTTCCCA TGCTCCATGC TACTCCTGGA GCCTTTGTGA CGCATACGAA
CTCCCGCTCC GATCGCGACT CCTTACGTCG TAGCTTCATG CAGGTTCGTC CCACAATGGC
CAGTGGTGTT TGTGGAGGCC GCTACTATCC CTTCGAAGAA GACCTGGAGA GCATGATGAA
CGGCGCCCTT TCCAGTGAGT GGACCTTTTT GTACCAGAAT AACATGACAA CGGCCGACGC
TTTCTACGAC CCGATTGCGG ACCTTGATCC TGACAATGGC GATCATCTCC TTGTTTGCTT
GGGTGGCGAC TACACCACTA GCGGCCAACA AAATCTTCCG TACCGTGGTG TTGACGGTAC
ATACTCGAAT CGTTGGTGTA GTTCTGGTGA AGGACAAACA GACCCCGATA CTGGTGCTGT
GATCGGAATC ACTGCCACAA CTCCAGATGA ACTCTGCAAC CATTACGAAC AGGAAGAAAT
TGAACAGATT CCCCATATCT CCGAAAATGT CAAAGCTGCT TTGGACTTTC TCGGAAAGGA
CGATGATGGT TTCTTTCTAA TGTATGAGCA GGGAGATGTA CGTTGTGTTC CTGTCTACCC
ACCCCTCGCT TTCTCCCGGA ATCTCACGTG TTTCCACTTT GTATCTCTTA GATTGATTGG
TCCGCTCACG CCAACCACAT GGACGACATG ATTGGAACCA TGTTTGACGT TTCGGAGTCG
GTGCAGGTCA TCATTGACTG GATCATGGAT AACGGTGGCT GGGATAAGAA CGCCCTCTAC
GTCACTGCCG ACCACGACCA CTACCTTACT CTGAAGGACA ATTTTCCCGA GGCCTTGGCC
CACTTGCTCA TCCGCGGTGA ATCCCACAAC ATTACGCCTC AGAGTAATTC CGGCGTAAAC
CCGTGGGATG CCGGTATCGG AGTCGGTCGT CACGAAGATG ACTCCCAGAG TGTCACCGAG
CATATTAACG ACTTTTCTAC CTGGTCGGAA GACGACGTTG ACGCGGTGGG CCACTTCTGG
GGCGCCAACG GTTCCGGCGG CAACGGCTGG GGTAGCCACT CGACGCGCCC CGTCCCGGTC
AGTTACATGG GAGACGATGG CTGCATCGAA GCGTTGACTG GTACCGGCTT TCAGGTTCTT
GGCCGCGACG TGAAGGGGCA TCACGGTAAA ATCGACCAGA TGCATTTGCA CGCTTGCATG
CTCAAGAACC TGTTCGGTCT CTAATCGGAT TCTTGGTGAT TG
 
Protein sequence
MKFSTAVVSL ITVAPLVVGA AEESRYLKTG MKSKSGKGKG VRQLSGHRKS EHPASDNGKG 
KEKGSNRMNS SPKKNRAKSS GGEADKCAGK NGFSRFPFKG LDNSQRLLLR EGVVDSIKPS
LCGNEGNKNV ILVVGDGMGW EMVRSGAIAK QVVDELEGLG CDTTTGCPDN SAAMNAFRGR
TLDDYYTEGK GSGMSFQELD GYALMTTTTT VTQEPNPGNH YAPSRSLLEG DVSEHESGQA
ALALDECGFP IDFSPLDFEA DGGNMVLWDN KMGGEFPWDK RYYQERPDTS TGFDPEYIMR
HATDSASTAG TMATGHKAAV NMMSQTLYEE DVSTLVEDAM YCGMAGGVVT SVPMLHATPG
AFVTHTNSRS DRDSLRRSFM QVRPTMASGV CGGRYYPFEE DLESMMNGAL SSEWTFLYQN
NMTTADAFYD PIADLDPDNG DHLLVCLGGD YTTSGQQNLP YRGVDGTYSN RWCSSGEGQT
DPDTGAVIGI TATTPDELCN HYEQEEIEQI PHISENVKAA LDFLGKDDDG FFLMYEQGDI
DWSAHANHMD DMIGTMFDVS ESVQVIIDWI MDNGGWDKNA LYVTADHDHY LTLKDNFPEA
LAHLLIRGES HNITPQSNSG VNPWDAGIGV GRHEDDSQSV TEHINDFSTW SEDDVDAVGH
FWGANGSGGN GWGSHSTRPV PVSYMGDDGC IEALTGTGFQ VLGRDVKGHH GKIDQMHLHA
CMLKNLFGL