Gene PHATRDRAFT_49236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49236 
Symbol 
ID7195534 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp321454 
End bp322832 
Gene Length1379 bp 
Protein Length396 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183854 
Protein GI219127254 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.011629 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTACAGGG AACGCTACAA ATCTACCATC ATGCATTTAT CTCTTCAGGC CATTAATGCG 
GACGACTTTG CTGTACTTGC CAACAAGCTA AAAGCTGAGC TTGGCCAAGA GAAAGGGAAC
CATATTCACG GGAGCATTAT TCCGCACGAA AATGATATTC TGCTTGGCAG AGGTACGATT
TCTAAGAGGT TCGAGCATGT GTTGAGCGCT CTCACTGATA AAGAAAACGC TTGTGTATTC
GTTGCGCATC TCTCTCACCT GGCTTTTTTC GACCAAGTGT AGGAGGCAAG AATAACCAGC
ACTTCGGAAA TATTCAACTG CGAAACATGA CTCGGCAATT CTGCTCCGCC TATTACGGCG
CCACCAAAAA GGAAAAGCCC GCCGTTGCTC GAGGTTTGGT TCAATGTATC CAGAACTTAA
ATCCACCGGG ACGTTTCCTG AAGCATTCAT TGGGAGGGTG GGAAGAGGCA ACAGCATCTG
TCGCTCAAGA GAAAGCTAGC CAGAGCCTCC GTGATACGGT GGCGTCAGTC TTAAAGGAAG
CAAACATCCA GGATGGCACA CACCCGAAAA ACGAAGTTTT CACTCAAAAA TGTCTTGACG
AACTACATCC CGGGTCGAGT CGCCATCCTA AATCTGCAAC CAAAGATTCA GTCTTATTTC
AAGATTCTAC TGAGCAATTG CCTAACAAAA TCGCCCGTTC ATATCAGGAG AGTGAGTCTT
CATGTCAGGT TCGCATGCCC GATAGCTTAG CTGTGAACTT TGGCGACCAA ATGTTACTGT
CGATGGACGC GACGTCGTCT ATCTCCCTCC AAGAAGAGTT CAATGAAAAT GGCAAGAGAA
GTAGAATCTC TTTCTTAGAA GCCGATTTGA ATAGCAAGAT CCAACGTTTT TCTACGGATC
GCGCCTTCTG CACCATGGGG CTTATAGACA ATTCCGATTT TAGTTCGATG GCTAACTCGA
AGCCCCAGTA CCAGATATCG TCCCAGGTAA CAACCAGTCG TGTGCCATGC ACAACATCGA
CAAAACTCTC GTTCGACACC TGGACTGGAA ATGGTGGCTT TAGCCTCAAT GATGCAGCTC
GGTACGCTCA CCTCGCAGCG CAAAGGAAGA GGCAGTTCAG CACCAGCATA AACAGCGAAA
GCAAGGAAGA TCAAGAGATT GAACTTTTCA GTGCTGATGC GCTCGATTCT ATTGTTTGGG
ACGACGATGA TATCAAGCTT GATAGTACCT CTTCTTTTCA ACTTGCTCCG GGTCAACACC
AGGCCGCAAG CGCTTGGGAC GACGAAGACG CTTTTCGTCG GCGCATTAGG AATCTCTTAC
AGGAATTGTA GCCGCTGATA CATCATCATT TCTAAAATAA TATATTTGCT TGTGTGCGT
 
Protein sequence
MHLSLQAINA DDFAVLANKL KAELGQEKGN HIHGSIIPHE NDILLGRGGK NNQHFGNIQL 
RNMTRQFCSA YYGATKKEKP AVARGLVQCI QNLNPPGRFL KHSLGGWEEA TASVAQEKAS
QSLRDTVASV LKEANIQDGT HPKNEVFTQK CLDELHPGSS RHPKSATKDS VLFQDSTEQL
PNKIARSYQE SESSCQVRMP DSLAVNFGDQ MLLSMDATSS ISLQEEFNEN GKRSRISFLE
ADLNSKIQRF STDRAFCTMG LIDNSDFSSM ANSKPQYQIS SQVTTSRVPC TTSTKLSFDT
WTGNGGFSLN DAARYAHLAA QRKRQFSTSI NSESKEDQEI ELFSADALDS IVWDDDDIKL
DSTSSFQLAP GQHQAASAWD DEDAFRRRIR NLLQEL