Gene PHATR_36844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_36844 
Symbol 
ID7204669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011679 
Strand
Start bp492940 
End bp494088 
Gene Length1149 bp 
Protein Length382 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185898 
Protein GI219121345 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTGGA ATACGCTCAT CGGGCAAGCC GATCGCGCCT TCCGTCTCGG TATACAACTC 
GAAAAGAACG GGCAACCCCG TAAGGCGAGT GCCTCTTTTC ACGAAGCCGC CACTCTGTAC
CAATGCTATT TGGACTCGGA GAGTGAATTT GGACACGTCA CGTCGCTTTC TCAGGAAGAC
AGTCAAGCAA TTTTAGCCTA CGCCTGTATG CGTCTGGCCT TCCTCAATCT CGACGCCCTT
GGCGACCCCA AAGCGGCGGC TCGATTGTAC AAGGAAGCCT CCGCAATCGA TCCCTTTCCG
TCCGCCGAAT CTTTCGACGG GATAGGCCAG GCACTGGAGG CTGCATTTGG GGGCCAGCAC
TTGGAAGACG CCATTGAACA GTACCGCAAA GCGCTCGAAC TCGCGCCCGA GCGACAAGAA
TCGCAATTTC ACGTCGCGGT TGCCTCGGAC CGCCTACAGC AATCCGACCA ATCCGAGGAG
ATTTTTGAAC GATTGCGCCG GGACGAGTCC AAGTGGAGCT GTCTCGTCGA CTCGTGGGGA
TATGTACGGT GGCATACGCG TAAAATCCCG AACGACAGCT TGTACTTGTA TCGCGGAACA
CGGGATATTA TGGAAGTCGC CTTGAATGCG GCTCTGCCTT TGATCGAACA AGGTGGGCTT
GTTTGCGAAT TCGGCGTAGG TAGTGGGCGA AGCTTGCGAA TGGCACAAGA TATTTTGCCT
TTGGACGCTC GAATTCATGG CTTTGATACG TTCACCGGCC TCCCTCAAGC ATGGGGGACG
GAACCGATCG GGACGTACTC GACCGGGGGA GTCGCACCGA ATATGGAAGG GAAGGTGACC
TTCCACCGCG GTCTCTTTCG TGATACAATC GGTCCTTTTC TCAAAGAACA GGAGGAAAGC
ACCTTTTTGG CGTACGCCAA CGTAGATTGT CAGCTTTATT CCTCCACGTT GGATATTTTG
GAAGGCTTTC ACGGTTACAT TGTACCGGGC ACCATTCTAA TTTTCGATGA ATATATTTGC
CATCCAAGTT GGCGGTATGA TGAGTTCCGG GCTTGGCGAG AATGCTGCAA ACGGTTTGGA
TGGAAGTATG AATATCTTGC GTTTAGTCTC AGCACGAAGC AGGCCGTGGT TCGGCTGACG
ACCGCGTGA
 
Protein sequence
MAWNTLIGQA DRAFRLGIQL EKNGQPRKAS ASFHEAATLY QCYLDSESEF GHVTSLSQED 
SQAILAYACM RLAFLNLDAL GDPKAAARLY KEASAIDPFP SAESFDGIGQ ALEAAFGGQH
LEDAIEQYRK ALELAPERQE SQFHVAVASD RLQQSDQSEE IFERLRRDES KWSCLVDSWG
YVRWHTRKIP NDSLYLYRGT RDIMEVALNA ALPLIEQGGL VCEFGVGSGR SLRMAQDILP
LDARIHGFDT FTGLPQAWGT EPIGTYSTGG VAPNMEGKVT FHRGLFRDTI GPFLKEQEES
TFLAYANVDC QLYSSTLDIL EGFHGYIVPG TILIFDEYIC HPSWRYDEFR AWRECCKRFG
WKYEYLAFSL STKQAVVRLT TA