Gene PHATRDRAFT_47187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47187 
Symbol 
ID7201963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp748196 
End bp749442 
Gene Length1247 bp 
Protein Length362 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181436 
Protein GI219122194 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAGGTC GTAGCAAAAG ACCCTCGGGT GGACGTATGC TACGCATCAT TGCTGCGGTG 
CTAGTACTGT GCTTATGGGC CTTTAAATTT CCCATATTGT TCGATGAGTC GATTGACCGA
GCAAAGGATT CTCCGGCAAC GCACAGTTCC TCAGGGAATA ACTGGGACTT TCTGACGTTG
CCGCAGACAC CTTTGGCAGC AAACCAAAAC CTGATACAGA TCGAATTTGA TTGGATCAAT
GCACGTTTGC TCGGACCCGA CTCTTACGGT GGAACCGATA AATGCAGTCT TTGTGGGTGT
CCTGGCGTTC CAACCCCTCG AGACTGCCCG CAGTGGTACA CAGTGGAGGA CATCCAAGAC
TCCGTACACT TTATGGATGA TCACAGCGAT GTGCTTATTC CGCACTCACG TGCTTTGTTG
AACAAAAGAC GATTAGAGGC GGAAGGCGAC TGTCAGGTGA AGGAGGTTAC TCAAGCCCGA
GGTGGATGGT GTCTTACACC AAATCCTAAA GGGGAAAAAA TGACTACGGC TAATGGCAGC
TTTCTTGTAC CATTCCATCA TGTTCCTCCA GCAAAGCGTC TCGTCGATGA AATTGACAAG
CTGATTCGAG ACGAAGACGT AAGATCAATA GTCGACTTTG GTGCAGGTGT AGGACAGTAC
AAGTTGGCCT TGACCGAGCG GCATCCTGAT CTACAGTACT ATGCGTATGA TGGGGCTGGT
AATGCTGTTA ATTATACCAA CGACTACCTG GAATACTTTG ATCTGACCAT TCCTTTGGGA
CTACCCAAGG CGGATTGGGT TTTGTCACTT GAGGTGGGAG AGCATGTGCC TAGTAAATAC
GAAGGTATGG TCCTACGGAA TCTGCATCGA CACAATTGCA AAGGAATAAT ATTGAGTTGG
GGTGTTCTTG GCCAAGGAGG ATATGGCCAT GTCAACAACC ACTCCAATGA TTATATCATC
AAAGTGATTG AACAGCTTGG ATATGTTCTA GACCAAAAAT TGACTGCACG ATTCCAGCAA
GCCAAGGATA ATTACTGGTG GTTCAAAAAA TCAACCATGG CCTTTCGACG TACGACTGAA
GTCTGTTAGT GTACCCGCTG GCTGCGTTGA AAAAGAGTGC TGCTTCGGTT TTTCAAAATA
CGATTTGATG ACATTTGGTG ACTGGATCCC GGCTTGGTTG GATGCGATGG CACGGTGATA
GAAAAATTGA AGAGAACGGT TTTCTATAAA CTCAATTGCA CTTATTT
 
Protein sequence
MGGRSKRPSG GRMLRIIAAV LVLCLWAFKF PILFDESIDR AKDSPATHSS SGNNWDFLTL 
PQTPLAANQN LIQIEFDWIN ARLLGPDSYG GTDKCSLCGC PGVPTPRDCP QWYTVEDIQD
SVHFMDDHSD VLIPHSRALL NKRRLEAEGD CQVKEVTQAR GGWCLTPNPK GEKMTTANGS
FLVPFHHVPP AKRLVDEIDK LIRDEDVRSI VDFGAGVGQY KLALTERHPD LQYYAYDGAG
NAVNYTNDYL EYFDLTIPLG LPKADWVLSL EVGEHVPSKY EGMVLRNLHR HNCKGIILSW
GVLGQGGYGH VNNHSNDYII KVIEQLGYVL DQKLTARFQQ AKDNYWWFKK STMAFRRTTE
VC