Gene PHATRDRAFT_47250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47250 
Symbol 
ID7202303 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp81550 
End bp83585 
Gene Length2036 bp 
Protein Length574 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181477 
Protein GI219122283 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.110596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATATT GGAGGGACGA TCGACGAACG ACACCCAACA AACGCAAATT ACGGTACGGT 
ACGGAAAAAC CAATCTCTAG CTAGCTAGCT AGCTCTAGTA GTAGTAGGGG AATATGCGAG
GTGAGTGTCA AAGTGTCAAG CCTGGTTTTT GGCAAATAGG TACATGTGCG AGTGCCACCC
TCGGCCCCCA CTGAATAGTA CGTTCGTCCA GGGTTCCGCT TTCGCAGTAA ATGGCCACCT
GCTGCTCACC TTCCAATGGA CAACTTACTT ACATACTTAC AGATACTCTT ATTATCCGGA
GCAAAGAATA GCAGCGACGC ACTCCGCACA CGACCTTTCC ATCAACAACG TGGCGATGAC
TCGTACCCTC CGGAGAGGAA ATCAGCTTTT CTCCGTAGCC GCGGTGGTCT GGGCCTACGC
GTACGTCAAC TGGAATAGTA ATCCCTACAA GTTGTTGAGG ACGGCCGAGG CGTTGGCGCC
TCCCATCAGC TACAGACGGA CAGTGTCACC GCTGGGCTAC ACGTATCGGA CGGATGCGTC
TCGACGAATA CAGTTCTTTC CGTTGACTGC GACTAGAAGC ACCGGTTCCG CCGTCGACAC
CGTCAAAAAT GGCGACCAAG ACTCTTCGAT TAGCGTACCG ACCGGCAAAC GTTTCGTGGC
GCGTCAACTA GCCTCGCTCC CCCAACGCGC CGTGCGCATC TATTCCGAAT ACGTCAGTCG
ACTATGGAGA GAAACCAATC CCGAAGCTCG TCAAATTATA TCCCAAGACA AGGCCGCTAC
CGCTATTCGT CGGGTGCAGC ATCTGATCCG AGGCGAGCAG CTGGCGGGGG TCGTCTCGTG
GGAAGAAAAG AACGGATTGG CACTTGCGTG TGACCACGTT TTGGAAGCCA TTGAGCAGGC
CCATCGAGAG ACGGCGCGCA ACATTTCCGC CCCCGTCACG AATGGAATGG ACATGTCCAT
CGAGTCAAAG CCCACAAGCA AAACAGCACC ACCTCCTACG TTGCAGAAAA AAAGCCGCTC
GGTTTTGTTC GGTGCCATCA TGGGTGCCGT AGTAGCGTGC TGGGTCTTTT CGGGAAACTA
CATTTTCACC GGCTTATTCA CGCTCATGAC CTTGCTTGGT CAATTGGAGT ATTACCGCAT
GGTCATGGGG ACCGGAGTCA ATCCGGCACG GCGCATATCG GTCTTGGGGT CCTGCTCCAT
GTTTTTGACG GCCCTATTTA CACCGAGTCT CCACGAAATT TGTTTGCCCA TATTCGGTCT
CTACGCCATG ATTTGGTTCT TGACCATGAA ACGGACCGTC ACTACAATTC CGGAAATTGC
CACGACTTTT ACCGGTATGT TTTATTTGGG TTACGTTCCT TCTTTCTGGG TCCGTATACG
GATACTAGGT ACCCAAGAAC CAACGCGATT GGCTTCGGTC GCCGAGCCCT TCTTGCGCTT
TCTCGCGGAC AAGTCGCAGG CTAAACTCGT GCCCAGCTTT ATCCCACAAG CTGTGGTGCT
TCCCATCACC ACCGGGTCTA TCTTTATCTT TTGGACTTGG CTGTGCCTCG CCTTTAGTGA
CGTCGGAGCT TATTTTGTCG GGCGACGGTA TGGGAATACC AAACTGGGTG CAGTCGCGCC
CGCCGCGGGA GCAACCAGCC CCAACAAGAC TGTCGAAGGG GTACTGGGAG GGTGTGCTGT
CAGTGGTCTA TTGGGTGTAT TCGGTATGTC GATGCTTTAG GAACTTTCGG CTCGGACAGA
GAATACGCTG CTAACCCATC GTTTTCGTTT CTTTCTGTTA CGTAGGAGCT TGGGCACAAA
AGTGGCCGTA TTGGGGCGTC ACCGGGGCCG TACACGGAAT ATTATTGGGT CTCATTGGTC
TCATTGGAGA TCTGACGGCT TCGATGATCA AACGCGATGC CGGCGTCAAA GATTTCGGCG
ACTTGATTCC GGATCACGGT GGCATTCTGG ACCGCGTGGA TAGTTTCATT TGGTCGGCAC
CCTACTCGTG GCTCGTCATC AACTCTGTCA TACCGTTTTT GAAAAGCGTC GCTTGA
 
Protein sequence
MKYWRDDRRT TPNKRKLRYS YYPEQRIAAT HSAHDLSINN VAMTRTLRRG NQLFSVAAVV 
WAYAYVNWNS NPYKLLRTAE ALAPPISYRR TVSPLGYTYR TDASRRIQFF PLTATRSTGS
AVDTVKNGDQ DSSISVPTGK RFVARQLASL PQRAVRIYSE YVSRLWRETN PEARQIISQD
KAATAIRRVQ HLIRGEQLAG VVSWEEKNGL ALACDHVLEA IEQAHRETAR NISAPVTNGM
DMSIESKPTS KTAPPPTLQK KSRSVLFGAI MGAVVACWVF SGNYIFTGLF TLMTLLGQLE
YYRMVMGTGV NPARRISVLG SCSMFLTALF TPSLHEICLP IFGLYAMIWF LTMKRTVTTI
PEIATTFTGM FYLGYVPSFW VRIRILGTQE PTRLASVAEP FLRFLADKSQ AKLVPSFIPQ
AVVLPITTGS IFIFWTWLCL AFSDVGAYFV GRRYGNTKLG AVAPAAGATS PNKTVEGVLG
GCAVSGLLGV FGAWAQKWPY WGVTGAVHGI LLGLIGLIGD LTASMIKRDA GVKDFGDLIP
DHGGILDRVD SFIWSAPYSW LVINSVIPFL KSVA