Gene PHATRDRAFT_49722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49722 
Symbol 
ID7198417 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011692 
Strand
Start bp63994 
End bp65869 
Gene Length1876 bp 
Protein Length431 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184562 
Protein GI219128736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.703059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGAAGCGGC CTGGAGCGCG AACACAAAAT CACACAAGTC TTCCGTGTCA GCACGGTTCG 
TCCTTGGTTT TTCTAGAGGC TTCCAATTGG GACCATGATC GCTGTATCTC CAGAGCCTAC
GGGATCTGGA GAACCTGCAG TCTCTGCGGA ACATGCCTTA AGCGTAGTAG GACAGCTTTT
GGGTGCTTCC TTCCTTCGCG AAGCCGAAAA GAACATCTGC TCGTGCTCGG GTTCCGTTGA
CAGGCTCCGA GAGTGCGGCA TTCTGCCCGA CACGCCCCAA ATGTCTCACT TTAGCCTCCA
ATGTCCTTCC ATATCGCTCG TTCACGAAAA GCATACGAGC GTCGATCACC GCACTCGTCA
AACTGCGCAT GACTTGGCCA TTGCCCTCCT CTCGCGTCCC GTTGTCTTGC GGCGGGCCAA
CTACCATTGC TCTTCCAGCA TGAAACCGCA AACCAGCCAG GGTGCTTCCA ATCCCGTCTT
CCAAACGGAC TCATTGCCGC AGCTGTCGCA GCAGATTCTT GAAAACGCGT ATCAATCCTT
CACCGTGCTC ATCGACAGCC GCCTGCGTGC CTACGCGAGC TTTCTGGCAC GACACGCAAT
GGCCGTCGCC GATGAAAAGA CCAACGAGAT GGGCATGTTC AGCGTGGAGC AAAAGCTGGA
GACACTCTTG GATGTTGGCG GTAAGATCAC CGTTTCGCGA GTCTCCACGC GCTTTGATGT
GGCGGAAGTC GAGGGCGTTC AGGAAGGCGA TCACTACTCG TTTCCTCTTT CGTTTTACGT
CGAAATGTCG CTGATGATCC CGCGTCCTCT GGCCACCGAT GAGATGGTCT CCGTTGCCTT
TTCCGCCCCG GGAACGATTG CGGGTAAGTC GCTTTTGCCT TTTGCCCTCG AGACTTTGCG
ATGGCTACCT CAACACGCAG CGTACGTACT CACCAATAGA TTTGTGCCTC GATTGTGTCA
ACAGCAATGG TTGGCGAAAA GCAGATTCTT TCTCAAGTCT CGGTCTCGTT GAATGTGGAT
GCACTTCTAT CGGAGATGAT GGACCGCGCG TCTTGTATTG TGGCCGCGGT AGTGGAGATC
GCCAACAACG CCTTTTGCAT TCCGGAAGAG CCCAAGAGTA TCCAACGCGG TGACAGCTAT
CTCGCTATGC CGCCACCACC ACCCCCTCCG CAACCCATCC CCAGACAAAA TTCGACGAAT
CTGAGGGCCA AGCTCGTTAA CCCGCTTGAA CTTCTCAGCA ATGCCGCTGC CGAACTCCCG
GTTGTTTCGC CAGATCTGTC CGGCCTGATG TCCCCAAGAC ACGGTGTGCC ACCTTTGACC
TTGGAGATCC CTACAGTGGA TCCCTACCTG GAGGACACTG ACGATTCCGA GAAGGGTGTT
TCCGAATTCT CTGCCGATCA GTGCGCAGAT ATTGTCGACG GCGTCTTTGG CGCTCTCGAC
GATGCCTTTC TCAAGGAACC GCGCTACAAA AAGGCCAAGG GGCAACCCTG ACCAACGCCC
TTTTACCCAC TATATACAAA GTGACACGAT GTCGCAACTT CTACAATCTC CAACCAAAAC
CTCGAAAATT CCTCTGTTGC ACAGAACCGA TCATTCCACC GCTTCAAGTC TTCGCATTCT
GCTGTTTCCG AGCTATACAT TTACCCGCGC ATCAATTCTC AAGTTTTGTA TCTCTGCTCG
GCGCAACCTC TGTAGACATC CCAGACAGCC CTCTCATCTG ACGTACCCAT CCACTTATCT
CAATATTGAA TCAATTATCA GCTATAGGGC AAAAGGCTGC CCACGTTTCC TTCTCCATGT
ACTGTTTTGA CGCCTTTCGT CTGATTCCGT AGCTAAATAT CACGAAATAG AACACTTTAA
CACGGACGAT GGACCT
 
Protein sequence
MIAVSPEPTG SGEPAVSAEH ALSVVGQLLG ASFLREAEKN ICSCSGSVDR LRECGILPDT 
PQMSHFSLQC PSISLVHEKH TSVDHRTRQT AHDLAIALLS RPVVLRRANY HCSSSMKPQT
SQGASNPVFQ TDSLPQLSQQ ILENAYQSFT VLIDSRLRAY ASFLARHAMA VADEKTNEMG
MFSVEQKLET LLDVGGKITV SRVSTRFDVA EVEGVQEGDH YSFPLSFYVE MSLMIPRPLA
TDEMVSVAFS APGTIAAMVG EKQILSQVSV SLNVDALLSE MMDRASCIVA AVVEIANNAF
CIPEEPKSIQ RGDSYLAMPP PPPPPQPIPR QNSTNLRAKL VNPLELLSNA AAELPVVSPD
LSGLMSPRHG VPPLTLEIPT VDPYLEDTDD SEKGVSEFSA DQCADIVDGV FGALDDAFLK
EPRYKKAKGQ P