Gene PHATRDRAFT_31408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31408 
Symbol 
ID7196616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp26629 
End bp27789 
Gene Length1161 bp 
Protein Length386 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177000 
Protein GI219110497 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACAT CGCGCTATGG TTGTTGGTTT CTAGCGATTG TTGGATTGTC TACAGTCACC 
AACGCTTTAG TAGCGACCAG CCCTCCTCGC AAGCTGTCCG CACTTGCCGA TCCTACGTTT
GTCACTCCTG ATGTTTCTTT ATTAACCGAA CAGGCTTGCA TTGAAACAGC GGAACGAATG
CAACGAGTGG TTGTTCCGGT TTCTCCCTCC ATTCAATCGG ACGGTGCCGT AGGCGTTTCT
TACGTACACT GGATGGCATC CCCGAACGAG AAGTCTCCAC AACGCTTACC CGTTGTCTTG
ATTCACGGTT TTGATTCGTC TTGTTTAGAA TACCGTCGTT TTGGTTCCAA ACTTGCAGCA
CAAGGATTTG ATACGTACGC AGTGGACCTC CTCGGTTGGG GATTTACACA ACTCGAAAAC
GTGAACGATT TTTCTGCCAG TGCCAAGGTG GAAACCTTGA ACAGTTGGAT TTCAACAGTT
ATTGGTGAAA ACAAACCCTT CTGCATTGCT GGGGCCAGTC TAGGTGGTGC TGCCGCTATT
GAAGTCGCAG CCGGCAACGA AAACTGCCAA GCTCTGGTTT TGCTCGATGC CCAAGGCTTC
GTGGATGGCA TTGGTCCAAT GGCAGCCATG CCCAAGGCGA TTGCCAAACT TGGTGTGCAA
GTCTTGAAAA GTGTACCTTT GCGCAGTTCC GCCAATCAAA TGTCCTATTT CGATAAGATA
ACGTACGCAA CAGACGAAGC GGTCGTAGTC GGACGACTGC ACTGCACACG CGAAGGTTGG
TCTGACGCAC TCGTCAATTT CATGCAATCC GGTGGTTTCG CACCTTCCAC CAAAGTACCG
ACCATCACGG CACCAGCTCT CGTCTTATGG GGACGAGAGG ATGGTATCTT GGACGGAAAA
GAGTTTGCCA ACAGATTTTT AGAAACGCTG CCAGACGCCC GGTTGACATG GATAGAAGAA
TGCGGGCACG TGCCACACTT GGAAAAGCCG GAGGAAACTG CTACTGCCAT AAACGAATTC
TTGCGAGGCA TTCCTGTTCG CGGCAGCGCG ATTGTCGAAG AGTCCAGTAA GTCAGCGCTA
TCATCTCAAT GGATTGGCGT TGCAGGAATT GGAGCGACAG CTCTGGCGGC CTTGGCTGCA
GATTTTTTGC CGGTGCATTA G
 
Protein sequence
MRTSRYGCWF LAIVGLSTVT NALVATSPPR KLSALADPTF VTPDVSLLTE QACIETAERM 
QRVVVPVSPS IQSDGAVGVS YVHWMASPNE KSPQRLPVVL IHGFDSSCLE YRRFGSKLAA
QGFDTYAVDL LGWGFTQLEN VNDFSASAKV ETLNSWISTV IGENKPFCIA GASLGGAAAI
EVAAGNENCQ ALVLLDAQGF VDGIGPMAAM PKAIAKLGVQ VLKSVPLRSS ANQMSYFDKI
TYATDEAVVV GRLHCTREGW SDALVNFMQS GGFAPSTKVP TITAPALVLW GREDGILDGK
EFANRFLETL PDARLTWIEE CGHVPHLEKP EETATAINEF LRGIPVRGSA IVEESSKSAL
SSQWIGVAGI GATALAALAA DFLPVH