Gene PHATRDRAFT_40163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40163 
Symbol 
ID7195933 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011690 
Strand
Start bp293625 
End bp294816 
Gene Length1192 bp 
Protein Length371 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184106 
Protein GI219127779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.258243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGGTG CCTTTGATCT GCTTCACTAC GGACATATGA ATGCGTTTCG TCTTGGTCGT 
TCACTTGGAA CACACCTGGT GGTCGGAGTC AACTCGGACG AGTCTATCAG CCAATGCAAA
GGGCCTCCCC TCATGAACGA CGAGGAACGG ATGACCATGG TTAGTGCTTG CAAGTTTGTC
GACGAAATCT TGCCCAATTG TCCATACATT ATGAATCGCG AATATTTAGA CTACGTTATT
GAAACGTACA AGATCGATTA CGTCATTCAT GGTGACGACC CGTGCATCGT GGACGGTAAA
GATGTATATG CCGCCGCCAA GGAAGCCGGA AAGTACAGGG GAATTCCACG AACGGAAGGA
GTTTCCACTA CCGACATTGT CGGCCGTATG CTCCTCATGA CCAAGGAACA CCACTATCAC
AACGAGACTT CCTCGATCGA CGAACGAGAC GATGAGGTGC CAAAATCTCC TGGAAGTTCG
CGGGAGTGGC TCGGGCGACA ATCCAAATTT TTGACGACTA GTCGTATGCT GCAATTATTC
AGTGCCGACG TACAGGCACC CACACCACAC ATGCGGGTTG TTTACATCGA TGGAGCCTGG
GACTTATTTC ACCCTGGCCA CGTGGCGATC CTGAGAGCTG CTCGTGAAGT AAGAAAGCCT
GAGTGCTGTT TTTGTGTTGC CGAATCATCC CGTTACTAAC CGATGCCTTT TTTTGGAACC
GTAGCGTGGT GATTATCTAA TTGTCGGTAT TCACGGTGAT GCCACCGTCA ATCGCGTTCG
GGGAATGAAC TTGCCACTCA TGAATTTGCA TGAACGCGTA CTCAGTGTTT TGGGTTGCCG
ATTCGCTGAC GACGTTCTGA TTGACGCACC GTATGATGTC TCCATGGAAA TGATTGCCTC
ACTTAATATT TCGGAAGTCG TCGGTACCAA CGATCACGAC ATTGGTGAAT TTGAGATGAA
ATCACAGACG CATCGGTACC GGCATGCGGA ACAAGCTGGG TTATTGCATT TGATGGACAT
TCCGAGCAAA TTTAACATGG GACGAATTGT GGAACGCATC CAACGCAATC AGGAAGCCTA
CCAAGCCAAA TTTGAACGGA AAATGGCAGC AGAGCGAGAA TTCTATGAGC AGAAGCGCGC
CAGTGAATAC GATGCGGCCT TTCATGAAGG AAGAGTAACT TTTGTGAGCT AG
 
Protein sequence
MDGAFDLLHY GHMNAFRLGR SLGTHLVVGV NSDESISQCK GPPLMNDEER MTMVSACKFV 
DEILPNCPYI MNREYLDYVI ETYKIDYVIH GDDPCIVDGK DVYAAAKEAG KYRGIPRTEG
VSTTDIVGRM LLMTKEHHYH NETSSIDERD DEVPKSPGSS REWLGRQSKF LTTSRMLQLF
SADVQAPTPH MRVVYIDGAW DLFHPGHVAI LRAARERGDY LIVGIHGDAT VNRVRGMNLP
LMNLHERVLS VLGCRFADDV LIDAPYDVSM EMIASLNISE VVGTNDHDIG EFEMKSQTHR
YRHAEQAGLL HLMDIPSKFN MGRIVERIQR NQEAYQAKFE RKMAAEREFY EQKRASEYDA
AFHEGRVTFV S