Gene PHATRDRAFT_48834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48834 
Symbol 
ID7195133 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp399322 
End bp401198 
Gene Length1877 bp 
Protein Length597 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183481 
Protein GI219126473 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.935241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTTGCC TTGCTCATCA CCACTGGGGC GGAAGGCTAG TTTCAAGGAT TGCTATGATC 
GGTTTGGTGC TCATTCTTGT CGGTTTGAAT TTGTGGAATG ACTCCGCTCC AATCAACCCA
ACAATTCGAT CTTCAGATAA CTCTGGCACG CAGTCCTCGA GTCTTTCCCG AAGCTTTCCT
CTCCCCTACC GCAATGATAC TCACGAAGGA AACAAAGGTG TAAGTCAGGA TCAGTGGGTG
CAATGCAAAA GAATTGAAAA GTTTTCGGTC TTATTCGTCA AAATTCTTTT GTGGTGGTGC
AGAATGGCAA CGAAAACTTC ACTTCTCCCA GCAATCTAAG TACCACTCCT AATAACGCAA
ATTCCTCGTC AAAAAGGACA ATTTCCCCTG CACTACAACC GCGATCAGAT GCATCAAAAG
ACTCCATATT CACCGTTCAG GAGGTAAAAT CGGCTGAGGC TTTATCTAAA ATGGAAAGCG
AGAGCCTGTC GAAAAATCTC TCTTCTGCGC TGCTCTCGAA GCATTTTCCA ACAGCGGATC
AACGAGTCCG ATTTTACATG TCATCCTGGT ACGAGCCACC ATGCGATCAG GACGAATTGC
TCGAAATTGT CAAGTATGTT GGTAGCAGGG GCAATGAAGG CAGCACAGCG GCCAACGAAA
CCGGAGAGGA GCAAAAAGAT GACAGTATGA CACAATTCAT TCCTTCCTTC TCCCTCCACC
GGCACGTACA ATCCCCACAA ATATCCAGCT CCATTATATT CAAGGGAGCA GCTAGGGCCG
ATACCGACGT CGTATTTGCT TTGGACAAAT CTGCTCTCGA TTCGTGCGAA TTTAATCCCC
GACCAGACCA AAAGAAAATC TTGGAGAGCA TCTATTGTCC AGAACTTCGA GACACATTAT
TACTGCCTTA TCAAAACCAA ATTGGCCTAA CCAACACTTC AAAAAACAAC GAGCTCTCGG
TTGTGTTACT TGCGCAAGTC GGCGACGCCC TGTCATCCAG AGCTATGGAC GATTTTGGAA
AGCCACATGG ATACTCTGCG CAACCTACTG TACCTCATTT TACTAAAGTA CGGCTTGCTT
GGGATAACGT GACTGCAAGA GTTTCCCTAA TGAAAGCATC ACCTAGGTCC TGCTCAACCT
TACAAACACG ACGAATTAAC CGCGGGAAAC TGGAACCAAT TATTTGGAAA ATGGGAATCA
AGCGGCACTA CAAAGGAGTG GAGGACGTCC CCAGTGAGGA CGTACCTTGG GAAGAGAAGC
GGGATGTTGC TGTGTTTCGG GGTACCTCTA CTGGGGATTT TGACCAAAAA ATGCCGGCTC
GCGAGCGTTG TCGTCAAAAT CAGCGCTGTC GGCTTGTGTT GGACTACCAC AATTCGTCTC
TCGTGGATGC AAAGTTTACC AACATACTCA GCAGGAGCAA CCTGCCATTA GCGATCGACG
GTATCCCTAT AAATGGCAGT CATCTCCAGA GATATGAGCA GCTAAGGTAC AAGGCGTTGA
TATTCATGGA AGGCAACGAT GTCTCTACCG GATTGAAGTG GGGATTGTAT TCCAACTCGG
TTGTTATGAT CACAAAGCCA TCAATTTCGT CATGGGCCAT GGAAGAGCTC TTGGAACCGT
ACGTACACTA TGTGCCTTTG AGGGACGATC TATCGGACGT GGAAACGCAG ATGAAATGGA
TCGTGGAGCA CGACAGGGAG GCGAAGGAGA TTGCGTTGCG GGGGCAGCTT TGGATGCATG
ACCTGCTGTA CGCCGAGGAG TCCGAGAGGG ACAATGCGGC AATCAATGAA GAGATTTTGC
GGCGATATCA GACTCATTTC CGACCCGGCA TTGCGGTCAA GGAAGAGCTT CTATTCTATC
CGAAGCCGTT GAAGTAG
 
Protein sequence
MTCLAHHHWG GRLVSRIAMI GLVLILVGLN LWNDSAPINP TIRSSDNSGT QSSSLSRSFP 
LPYRNDTHEG NKGNGNENFT SPSNLSTTPN NANSSSKRTI SPALQPRSDA SKDSIFTVQE
VKSAEALSKM ESESLSKNLS SALLSKHFPT ADQRVRFYMS SWYEPPCDQD ELLEIVKYVG
SRGNEGSTAA NETGEEQKDD SMTQFIPSFS LHRHVQSPQI SSSIIFKGAA RADTDVVFAL
DKSALDSCEF NPRPDQKKIL ESIYCPELRD TLLLPYQNQI GLTNTSKNNE LSVVLLAQVG
DALSSRAMDD FGKPHGYSAQ PTVPHFTKVR LAWDNVTARV SLMKASPRSC STLQTRRINR
GKLEPIIWKM GIKRHYKGVE DVPSEDVPWE EKRDVAVFRG TSTGDFDQKM PARERCRQNQ
RCRLVLDYHN SSLVDAKFTN ILSRSNLPLA IDGIPINGSH LQRYEQLRYK ALIFMEGNDV
STGLKWGLYS NSVVMITKPS ISSWAMEELL EPYVHYVPLR DDLSDVETQM KWIVEHDREA
KEIALRGQLW MHDLLYAEES ERDNAAINEE ILRRYQTHFR PGIAVKEELL FYPKPLK