Gene PHATRDRAFT_31778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31778 
Symbol 
ID7196120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp902951 
End bp904036 
Gene Length1086 bp 
Protein Length361 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176679 
Protein GI219109852 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCCGCA AAAGTAACAA GTCGAGGAGA ACAACGAGAA AACCGGTAAA ATCCAGTTCT 
TCTTGTTGGA ATTTTCTACC CGACGCTGGA CCAGGTTGCC ACATTGCACG CCATGACACA
TTCTGTTCCT CGCCAACGTC TTGGGTCATT GAATGGGAAA CGGTGGACCC GGCCGACTTC
TCCGATGAAG ACAACGAAGT TCCAGATGTA GCTGTTGATC CTGAAGAAAG AACGCTTTCT
TTGTGTAATA TTCGAGACAA ACATGTGGTT GCCTATTTAA CGGTATTCGA CACAGTACTC
CGAGGAGCCG ATGGCAGAGA ACTAGAGAGG GGCAGAACGG TGAACAACCA AGGCGTCTCC
CAGTCGTGCA CTACTCTCAT TGTGCTATGC CCACCTTTTA CATTTGCTCA TCTCTGCTAT
TTAGATATCT CAAATGAGGG CGAAATGCTG GAAAACTTAA AAATTGAAAG CGACGTACAA
GAATGGAAAA AGCACCCCAA TCCATCGGAC ACGCACTTGA CATCGGTATC GTTTCCTTTC
CGTTTGGAAG GAGGCCCATA TTTATGTACG CAAGGAGAAG GTGGACAACT TACGCATTTT
TTTGCGGGCA ACCAACACGC CCTCGACTTT CGATGCCCGG TCGGAACGCC ACTATTGGCG
GTGGGACATG GCACGGTAAT TGATGTGAAG GACACGAATG CGAAAATTAC TGGGGTAGCC
GTGTCGAATT TGTTCGAGTG GAATTCTGTA CTACTCGAGC TTGACGGTTC ATCAAAAGAA
GGTAAAGGGG ATCCACTCTT TGTAGAATAT GTACACATTC AAGGTGCCTC CGTTCAAGTT
GGTGACAAGG TAAAACGCGG TCAGGTGATT GCCACTAGTG GAACCGTTGG ATTCAGCCCC
GAGCCGCACT TGCATTTTTG CGCATACCGG AGCCCCGACA TGAGCGCACC TACCGTCCGT
GTATATTTTC ATTCTACTAG AGATCCGCAG GAGACGTTCC TTCCACGAGC GGGGCAATAC
TACGATACCA ACGGCCTCGT TGAACAAGAC GGTGATACCG CAACGAACGA CGACGTCGGT
TCCTAG
 
Protein sequence
MGRKSNKSRR TTRKPVKSSS SCWNFLPDAG PGCHIARHDT FCSSPTSWVI EWETVDPADF 
SDEDNEVPDV AVDPEERTLS LCNIRDKHVV AYLTVFDTVL RGADGRELER GRTVNNQGVS
QSCTTLIVLC PPFTFAHLCY LDISNEGEML ENLKIESDVQ EWKKHPNPSD THLTSVSFPF
RLEGGPYLCT QGEGGQLTHF FAGNQHALDF RCPVGTPLLA VGHGTVIDVK DTNAKITGVA
VSNLFEWNSV LLELDGSSKE GKGDPLFVEY VHIQGASVQV GDKVKRGQVI ATSGTVGFSP
EPHLHFCAYR SPDMSAPTVR VYFHSTRDPQ ETFLPRAGQY YDTNGLVEQD GDTATNDDVG
S