Gene PHATRDRAFT_49982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49982 
Symbol 
ID7198770 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp26465 
End bp28525 
Gene Length2061 bp 
Protein Length614 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184877 
Protein GI219129398 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.806954 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATGGGGCCA TTCATGCTGT GCAATAGTTG ACTGTGATCG TGAGACTTTC GCTTTTTCAG 
ATTGACTGCC TGGATTGATA CGGTTCAGCA GTCGACCATG TCGGAAAGGG GTTTGTCGTC
ACCCCGCGGA AAGTCTACTT TCCCAGTGAT ACGTCGCAAC AGTTTCAACG AAAGCGAGTA
CAGTCGATCA TCCTACGAAT CGTTAGGTCT CGTCGATCAT GTGCATCGCC TGGAGGCGCT
TTTGTCCAAT TCGCTAGAAA CCAGCAGTAC ACTGCAGACA TGTGTCGATG AAGACGATGA
CTTGGCAGCG GAATTGAAAC GGCTAGAACA CGCAGAGCAG TTGCTGCGGC AAGAGTTGGA
AGACCTCGAA GTAAATGGCG TCCCGAAGCC GGATTCATCG ACGCAAGGCG TTTTTTCGAG
CGACAATCGA GCTCGCGAAA GGACGCTGAG CGCTTCCAGA GAGGAACTGA CAGTGAGCAA
TGTAGAAGAC GACGACGACG ACGCAGACGA TATGTTTAAC TATATGCTGA ACTTTACGAC
TGGAGCGTCT CCCCTTTCCA CTTCGAAAGG ACGCCTGAAA ACAGACAGCG ACAGGTTCGA
TGCCATTTCA ACCCCCCCGC ACACACCGCC TCGAACCGTC CTGTCACCAC TGCGGCAACG
TCGCTTGGAT GTCCTGGGTT TGGATGACAA CGCTTCTTTC CACGAACAAA GCTGCGCCCA
AGAAGATGGC AACACAAGAC GGTCGTCGAG TTGGAGTGTG GAGGATTGGT CCTCCTACAA
TCAACCGTCG GAGGGGAGCC TCGAGTCGGA GAGCATCGAG TCTGGATCGG TATTCTCCAA
TCTAGGCGTC GCCAGCGTAG ATCGCTCCTC CTCCTTTCCA GTGGACGTGC TAGCCCGTGG
CGATGTTGAC GATGAGGATA AACCTGTTAA AGCGACAAAG GTTGATGTAT TGGCTGATGG
CGATGTGGTT GAAGTGGAAG AGATCGATGA AGTGCCTCTC GAGGCTGCCG ATACACTGGC
ATCCTCAACA CCGTGGGCCA GGAGAAGGAT TCACAATCAA ACAGAGATGC AGCAGAGTAA
ATCCACAACA CCAATCATTT CAAGTCGAAG TATTACGACT TCCGCAGACA CAGTTCAAAG
CAGTTCTATC GAAGTTAGTC GGAGTCCATC ATCAGCAGCC ATACCAAAGA GAATCCCCGC
AATCACCCCC CGCCACACAA AGACACCAGA ATATGATCAC TTTCAAGGCC CAGATACCTC
CTCTCAACAT ACCATTCCCA CTCGAAAATC ATCGCGAAAG AAAAATGAAT GGTCGGCCAT
GCTGTCCCAT TCGGGGAAAG CAAAGGTCAT ACTGCGTCCC TTGGCCAATG ATCGTCCTGC
TACAACAAGG GATAATTCAA TGTCTCCTCG TCTCTTTCCA AAGACTCGAA ATTCGCCTTC
GGCGTTTGGC AATGGACATG CCGTCGAGAC CCGCGGTATA CTCCAAACGG AAACTTATCC
GTCGTCGATC TCCAAGTGCG ACGATCCCCA ACGATTCGAA GATTTACAAT CTTTCGGAGC
GGGGAGTCAA AATCCGGCCC AGCAACAAGC CAATGACGAA AGCACATCGC GGCGGGTTGG
AAGTCTCGTC TCGGCAGCTC CGAGGCAGGA TACACGCCGG CCGAAGGTTC CTCACCGTAT
CGCGATGGGC TATGCTCATC ACCAACTGCG AGACGATGAC GATCAGGATG CGCAGGTGTC
TACATGCGGA TGGAAGTTCG GTGGCGAAAG GACTTCGAAC GGAATATTCT TTTGCTTAGC
CTTGATTGCA TTGGTTCTGT TAGTTGCGGT GCCCACGGCG TTCATCTTGG GAAGTCGATA
CAACAAACCT TAAACCCCTC GCTGCTTTCT GTAAATTTTA ACCGAGGCGC CGGTGCACTT
TCCCGAATTG ATGACAGTAC CCGCTCTCCT TGCTCCGCTG TTGCCGCCCG CAAGGCCACG
TTACGCTATT CCATCGACTA TTATTCTACA GGCGGTGCAC CGCACCTTGC CGGTATCCTG
CACCTTTTTC AATATCACTA A
 
Protein sequence
MSERGLSSPR GKSTFPVIRR NSFNESEYSR SSYESLGLVD HVHRLEALLS NSLETSSTLQ 
TCVDEDDDLA AELKRLEHAE QLLRQELEDL EVNGVPKPDS STQGVFSSDN RARERTLSAS
REELTVSNVE DDDDDADDMF NYMLNFTTGA SPLSTSKGRL KTDSDRFDAI STPPHTPPRT
VLSPLRQRRL DVLGLDDNAS FHEQSCAQED GNTRRSSSWS VEDWSSYNQP SEGSLESESI
ESGSVFSNLG VASVDRSSSF PVDVLARGDV DDEDKPVKAT KVDVLADGDV VEVEEIDEVP
LEAADTLASS TPWARRRIHN QTEMQQSKST TPIISSRSIT TSADTVQSSS IEVSRSPSSA
AIPKRIPAIT PRHTKTPEYD HFQGPDTSSQ HTIPTRKSSR KKNEWSAMLS HSGKAKVILR
PLANDRPATT RDNSMSPRLF PKTRNSPSAF GNGHAVETRG ILQTETYPSS ISKCDDPQRF
EDLQSFGAGS QNPAQQQAND ESTSRRVGSL VSAAPRQDTR RPKVPHRIAM GYAHHQLRDD
DDQDAQVSTC GWKFGGERTS NGIFFCLALI ALVLTRSPCS AVAARKATLR YSIDYYSTGG
APHLAGILHL FQYH