Gene PHATRDRAFT_37886 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37886 
Symbol 
ID7202827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp322222 
End bp323367 
Gene Length1146 bp 
Protein Length381 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182046 
Protein GI219123468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0323798 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCGA TAGCGCCAAT CGTATCAGCT TTGGAAGACA CTGAGGAGCC GAACTCGGCT 
TCTGAAAGAC TTACAGCTGA CTGGAAGCAC AGACTCGTGG TACATTTTGA CATCAACGAG
ACGATCTTGG TCGGCGACGA TGCCGGGGGA GATACGCGCG AGGATTGTAT CCATAAGATA
ATCGCAAAAT CTGCGTACGT TAAGATACCT TTCGGGTACG CGGGGGGATC CTACGAGGAC
AACTCGAATT TAGAGCCAAC CGAATGGTGG AATGGGCTTT TAATCAGGGA GGAATACGAC
GAGAAGCTAG CTTTAAATCG GGTTCCTCCG TTGTACACAG GATGGCAGTG GCCACCAGGC
TGCTGCCCTT ACTATCGGAC TGCTTTTAAA AACCGTGCCA GAACTTTTGT GAATCACCAT
GGCTCGTTAT ACAAATCGAC CTATCTCAGA GTGGAAGAGC TACTCCCAAT CCCAGATTCC
AAGCCTGGAA ACGCTTTTTC CGTTTTTGCA CACATGCTAC CTGCTTTTTT TGAAACTGTT
GTAAAGCTTT CCAGCAGACC CCAGCCCTAT ACATTGGTTT TTCGTACCAT GGGTTCTGAT
CTCGAAAAAA TTGCAACAGC GTTCAATGCT TTTGCTTCTG GCAAACATCC CAACTATCCT
AATTTTCAGC GAGAGGACTT GATCATAAGC CGGCATGATC TTGTTGAAGG ACGATGGTCG
AAAGAAGTCG ACTTGGACGG AAATCACGTC TTCCAATTTT GGAGAGCCGG TGAGATGATT
GCTTCTGGAG ACGCGCAGGT GCTCGACTTT CTTGACTCTC GAAGCGTTTG CGGTATTCAG
GATGACTACG AATTTTGGAA GGTCCACAGA CACCAACCTT GGGCCGGCAA GCCCGTTTGG
ATTCCCCGAA GCAAGGAAGT TCAGCATATT TTGCTGGACG ACAACATTCA CAATTTGTCT
CATGATAGCA TAGCTAGTAC CCGAGTGGAG CGAGAAGATG GTAGTTTCCG AACACTGTCC
GATGAAGAAA TCAGAGATCA GCAAGGTATC CACCTTGTGA GGGTACCCAC TGTTGCCCCA
ATTCTTCAGC CGACGTGGTT TCTCGAACAA ATAGATAGTG CCCAAAGACG GTTTGTAAGC
GAATGA
 
Protein sequence
MDSIAPIVSA LEDTEEPNSA SERLTADWKH RLVVHFDINE TILVGDDAGG DTREDCIHKI 
IAKSAYVKIP FGYAGGSYED NSNLEPTEWW NGLLIREEYD EKLALNRVPP LYTGWQWPPG
CCPYYRTAFK NRARTFVNHH GSLYKSTYLR VEELLPIPDS KPGNAFSVFA HMLPAFFETV
VKLSSRPQPY TLVFRTMGSD LEKIATAFNA FASGKHPNYP NFQREDLIIS RHDLVEGRWS
KEVDLDGNHV FQFWRAGEMI ASGDAQVLDF LDSRSVCGIQ DDYEFWKVHR HQPWAGKPVW
IPRSKEVQHI LLDDNIHNLS HDSIASTRVE REDGSFRTLS DEEIRDQQGI HLVRVPTVAP
ILQPTWFLEQ IDSAQRRFVS E