Gene PHATRDRAFT_48337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48337 
Symbol 
ID7203556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp151553 
End bp153713 
Gene Length2161 bp 
Protein Length678 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182783 
Protein GI219125011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAACGTAGGA CTGCGGCCAG TGCTCTTTGG AGAGGAAACG ACTAGAGCAT CCACAGTGAA 
CAAGGGTTTC ACTATCGATA ACCGCACGCA TTGCGACGGT AAGAGTCTGC CGTTTAGTGG
CACCATGAAT TCGCTGTTTC ATCGATTGAA TGATCGCCGC AAATCCCGCA GAAAGTCCAA
GGAAGACGAG CCGCAGTCAT CGTCTACTGC GGCACTGTCC TCGAAAGACT TGCAGAGTAC
GGAATTGGAA CCCACAGCGA GACAAACCCG TCGGGAAACA GCGCGACGGC GGTCTACCAA
AGTGTCGGAA TCGTCGAGAC GCAGTATGAA CGGCAATGCG GGCTCTCCGG TTGGTACATC
AACCGTAAGT GTCAATGATC GCAGTCCCCG AACGAGGAAT ACAGACCCCC CAAGCCCCCG
TCGATCCAAA CGGACTCTGC AAGGGGTACA AGAGGAGCAG CAAAAATCCA TCAATGCGTT
GCTGGGAAAC GGAACTAGAA GACACCGTAC CTCGTTACCT TCAACGCCCG AAACACAACG
CTCGACTCCT GGTTCCGTAG CCGACCGAAC ACGATTGACT CCAGCGGAAG CCACCGTGAA
ACGACTACGA GCGCGACAAA GCAGACCACA TACCGGCAGT GCCAGTACTA GTGCCGTTCC
CGAAAAGAAA TCTGTCAAAG ACAAGGAGCC GCCGCACGAT CAGCGTTCGG AAAACCTGGA
AACCGAATCC GTCGTTGCGG ATCAGTGCAA TGAAGTGTCT CGACAAGCGG CAGCTTTGGA
CCAGGAAGGA AACTCTTTTT TTGAAAAGGG CGAATATGAT CAAGCATTTT TGCGGTACGA
AAAGGCACTG ACTTTGAAGC GTTCGATCAT GGAAGATTTC AAACCCCGCC TTGCCGCCAC
AAAGGCTGCA ACTTCCGCAC AACACGAAGC CTCCCTCGTC GCCTCTGTCG CTACGTCTAT
CAACAACATG ACATACTTGA AACAACGCGC CGGTCAAGCG TCCGCGGAGG AATCGCTGGC
ATCTTATCTA AGAGCGCTTC AGATGAAGCG CGAAATTCTT GGACCCGATC ATCTGTCGGT
GGGGAAGACG CTCAATAATA TTGGTTCCGT CTTTTATCTG AAACGCGAAT TTGAGCCTGC
GCTCAAAGCT TACCAGGATG CCCATGTTAT TTTAGCGAAG CAACTCGGAG CCTCTCATTT
GGATGTGGGG ACAATCATTT CTAATATTGG CGACGTGTAT GCGGCTATGG GGGAGCGTTC
GCTTGCACTG GAAAACTACC ATAAGGCTCT GGACATCCGA TGGACGACTC TGGGGAAACA
AGACCCCAAA GTCGTTCGCC TAATGCAACA AGTAGCCTCT CTGGAAACAG GCTCACAACC
GCAGAAACCG GTGGGGGATT TATCGGATAG TGAAGACGAA GAGTTTGCCA GAGAAGATAG
AGCCCGACAC GAGGTCATTC AGAAAGAGGT CAAAACGTTG AAGAAAGAAC TAGCAGAGGA
TATGAAATTT TTTGATTTGA TGGAGCGACA AATGGCAATT GATATGGTGA AGGATAAAAC
GAGGATCTTT CGAGAAATGC ATGATCTTGA CAAGCAAGGA AAGGAAGTTG GTAAGAATAG
TGAAACAAAA TCCTCTACGG ACTTGGAAGA CAGCTTCTCG AGCGTTCCAG GCGCTGCATC
GCCTAATCCA ATGCCAACCA TACCACACTC TCCGGTCATT GCTGCAGTGA ATGCGCAAAT
GGAACGTAGG ACGCCGATGA AGTCGTCACC GCGGCTAGAA ACTCCGAAAT CCACGAAATC
GTTGAGTGCG CAAGAACGGA ATGAGGCGCT TAGCAGCGTA CGCACGAGAC TAGCCAAGCT
ACGAAACGAT CGTGCCGCAG CCGGGAAAGA CCAAAAAGAA TACGAACGAA AGTCGTACCT
GGCCTCTCTG CAACAAAAGA AGAATGAGGC GACGCCACGA CGGCATTACA TGGACGCTAC
CGCATCTTCT GCTGCAAAGT CCTCGTACAG TTTGTCTCCG ATACCCATCG CAGCCTCGCC
AATTGCGCAT TCAGTGCCGG GTCAAACAAA GATCGATTGG AAAGAGAGCA TAAGCGCTCG
ACGAAAACTC TCTTTGACGC CAGAAGTGGC TCCTATCGAT GTGGAGGTCA ATCGAGCCTG
A
 
Protein sequence
MNSLFHRLND RRKSRRKSKE DEPQSSSTAA LSSKDLQSTE LEPTARQTRR ETARRRSTKV 
SESSRRSMNG NAGSPVGTST VSVNDRSPRT RNTDPPSPRR SKRTLQGVQE EQQKSINALL
GNGTRRHRTS LPSTPETQRS TPGSVADRTR LTPAEATVKR LRARQSRPHT GSASTSAVPE
KKSVKDKEPP HDQRSENLET ESVVADQCNE VSRQAAALDQ EGNSFFEKGE YDQAFLRYEK
ALTLKRSIME DFKPRLAATK AATSAQHEAS LVASVATSIN NMTYLKQRAG QASAEESLAS
YLRALQMKRE ILGPDHLSVG KTLNNIGSVF YLKREFEPAL KAYQDAHVIL AKQLGASHLD
VGTIISNIGD VYAAMGERSL ALENYHKALD IRWTTLGKQD PKVVRLMQQV ASLETGSQPQ
KPVGDLSDSE DEEFAREDRA RHEVIQKEVK TLKKELAEDM KFFDLMERQM AIDMVKDKTR
IFREMHDLDK QGKEVGKNSE TKSSTDLEDS FSSVPGAASP NPMPTIPHSP VIAAVNAQME
RRTPMKSSPR LETPKSTKSL SAQERNEALS SVRTRLAKLR NDRAAAGKDQ KEYERKSYLA
SLQQKKNEAT PRRHYMDATA SSAAKSSYSL SPIPIAASPI AHSVPGQTKI DWKESISARR
KLSLTPEVAP IDVEVNRA