Gene PHATRDRAFT_48737 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48737 
Symbol 
ID7195000 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011687 
Strand
Start bp111170 
End bp112645 
Gene Length1476 bp 
Protein Length331 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183292 
Protein GI219126078 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0824211 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAAAATCTTC TTACGTCAAT CGTAGACAAT GTTACTGTTA GTCTGTCTCC AGTGGTCTAT 
ATTACTCCTT GCGTTCGAGT CAGGGCCCTT TGCTTCTCAG CTAACTCAAA GACATAACTC
ATGCATCTAG TTTCAATTGC GGGTCGAAAT TTGCTTTACA GCCATTCGAA AAATTATCGG
TCTCCTTGCA GTCTATTATG AACTTTTCTC AACCTCGTTT TGCGAGGCAT CGTAGCTGGC
GAAAGTTGCT TCCCGTACTT ACGGTTCTCA GTCTACTCTC TTTTGGATTC TGGAATTGCT
TTGCTCGGAC CGGGGATTCA CTAGAAAGCG AGATTCGCAA AGATGCCCTC GGCGCGCCGA
ATGGTATGCC GGCTGAGCTT TCAACAGCGC TAGCTTACAA GCAAAGTTTT GGCTTTTTTG
ATGATATTCT TGACGGGGCA TGGAGGAAAA TGCAGGAACG TGCCAGAATT TTTATTCAAT
ATTCGAATCC CCACAATCCA AATCAAGGTC AAACCGATTC GGCCAGGTGG TATGTTGAGA
ATCTTGAACC TGATTTTACT TGTCCACAAG TTCAGCGGGT CGGAGGACAC GGTGATGGAC
CGAAATGGAC CTGTGATCCT AATCGGCTTT TGAAAGAAGA ACCATGTCTG ATATACTCGG
TTGGCTCCGC AGGTAAATAC CAATGGGAAG ACGGCCTGAT CCACCTCTTG GGAGGTACGC
ATTGTGAGAT TCATGTGTTT GATCCGGGAG CTTTTGCACG ATCCAGGGAC GTGGAGGACA
AAAATATTCA CTACCACCAG TGGGGATTCT CAAGCAGCTA TGTTAAATCA TTTGTACCCG
ATATTTATTC CATGGGAGAA GCTTCCGGCA AACCAGTTAT GAAGACATTT CAGGATACCT
TACGAGAGCT CGGTCACGTA CATCGTACAA TCCATGTTCT GAAGCTAGAT TGTGAGGGCT
GTGAATGGTG AGTAAGACTG TGTTCACCCC ACTCTATTGC CGATTGCAAA CATGTCTGGT
TTATTTTTCC GTCGCGGATT GCTCACTTGT TGATGTTTGC GAAGGGTGAA CTACAGGGAT
TGGATTGAAC TGGACATTAG GCAGGTATTG ATTGAGACGC ATCAGCTTCC TGATCGACGA
GCCGGGCCTG GTGCGCTGAC GCCTTCCACT TTTTTTGATG AATTTCGGAA AAACAACTTT
GCGATGTTTA GTAAAGAAGC AAATGTCATC GCCCAAGGAA CATGTGTTGA GTTTGGATAC
GTCAAGCTAC ATCCTGACTT CTGGCACTGA TAGTGATAAC GCATGTCTTT CGCTTACTCA
CCTTTGAGCG ACCACTGACA GTGAGTGCAG GACGCAGTTC GCTACTTGCG GATTTGAAAG
TTCTCTATAG TTTAATTTGC GAGGCTTTGG AGGATAAAAT GCAATGGTAA AAACCGATTT
CACAGGCAGA CTTATATAGA CAGACAGTAA ATTGTC
 
Protein sequence
MNFSQPRFAR HRSWRKLLPV LTVLSLLSFG FWNCFARTGD SLESEIRKDA LGAPNGMPAE 
LSTALAYKQS FGFFDDILDG AWRKMQERAR IFIQYSNPHN PNQGQTDSAR WYVENLEPDF
TCPQVQRVGG HGDGPKWTCD PNRLLKEEPC LIYSVGSAGK YQWEDGLIHL LGGTHCEIHV
FDPGAFARSR DVEDKNIHYH QWGFSSSYVK SFVPDIYSMG EASGKPVMKT FQDTLRELGH
VHRTIHVLKL DCEGCEWVNY RDWIELDIRQ VLIETHQLPD RRAGPGALTP STFFDEFRKN
NFAMFSKEAN VIAQGTCVEF GYVKLHPDFW H