Gene PHATRDRAFT_31599 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31599 
SymbolSSDH 
ID7195958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp487840 
End bp489255 
Gene Length1416 bp 
Protein Length471 aa 
Translation table 
GC content52% 
IMG OID 
Productsuccinate semialdehyde dehydrogenase 
Protein accessionXP_002177097 
Protein GI219110691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTACAGG GAGTAGCCGT TGTGGACGGA AGCATCATCA ACAAGAACCC GGCTACTGGT 
GAAGTTATCA GTCGGGTTCC GTGCACACCA CTCGATGAGC TGGATGTCAT GATTCGCGAG
GCCAAATTGG CGCAAAAGTC CTGGGCGACG ACTCCCGTGG CGAAACGTAT CCAACTATTG
CGGGATGGGC TGCAGGCAAT CGAAGCCAAG TCGGGACAGC TTGCTCGGTG TATTACGACC
GAAATGGGAA AGCCCATTGC CGAAGCGCGG GAAGAAGTCG AGTTTGCCGT CGGCAAAGCC
GAGTATTTGA CATTGTTGGA AGCTTCTTTA CAACCGCAAC AGCGCGGATC CTGTACGGTG
ATCCGCCAAT CCCTTGGTGT GGTGACCGTC CTGAGCCCTT GGAACTTTCC CGCCGACGAA
ATTTTGCTAC TGCTTTTGCC GGCGCTGGGC TCGGGCAATA CCGCGATTGT CAAACCTTCC
GAAGTCTCCC CGGAAACGGG TGCGATTGTC GTGAACTGCT TAGCACAGTT CTTACCAGAA
AACGTATTGC AATTGGCCCA AGGAGATGCA CAAGTGGGTG CACACTTGGT TTCGCACAAA
GACGTAGATA TGGTGGCCAT GACGGGAAGT AGTGCGACGG GCCAGAAGAT TTTGACTGCG
GCAGCTCCGC ATCTGAAACG ATTCGTTTTG GAAATGGGCG GAAAAGATCC CATGGTCGTG
TTTGATGACG CTGATTTGAA TCAAGCTGCA AAAGATGCCG TCTCTTACTC TTTATCCAAC
ACGGGACAAG TCTGTTGCTC CATTGAACGC ATTTATGTGG CACAGACCAT TTACCGCGAG
TTCCAGGAAC GCGTTACTCG CTGTGCTGCG GAGTACCACG TGGGCAACGG CATGGACGAA
AACGTCAATG TTGGTCCCAT GGTATCCATT CGACAACGCG ATCATGTAAA GCGACACGTC
GAAGATGTCA TTGCCAAAGG TGCCAAAGTC TTACACCAAA GTAAGATTCC GTCAACTGCC
AGTACAGAAT CCTCTTTCTT CCCCGTGACA GTTCTGGCGG ATGTGAAGGA GAACATGCAC
ATGTACCATG AAGAAACTTT TGGCCCCGTG GTCGCATTGA CTCCATTTGA CGGGTCCGAA
GAAGAAGCTA TTCGGCTGGC CAACGATACT GAATATGGAC TGGCGAGCTG CGTCTATACG
CAAGATATGG ACAGAGCACA GAGAGTAGCG AGCGCAATCG AGGCTGGGCA AATCGGTATC
AACTGCTACT CATTGGAAAA CATGGACGTG GCGTGCCCCT GGGTAGGCCA CAAGAAGTCT
GGTTTTGGGT ATCATTCCGG CCAGGAAGGG TTCCATCAGT TTTCTATTCC CAAAACGCTG
GTGTATGTTC CCAGTGCACA TAGTACGAAG GATTGA
 
Protein sequence
MVQGVAVVDG SIINKNPATG EVISRVPCTP LDELDVMIRE AKLAQKSWAT TPVAKRIQLL 
RDGLQAIEAK SGQLARCITT EMGKPIAEAR EEVEFAVGKA EYLTLLEASL QPQQRGSCTV
IRQSLGVVTV LSPWNFPADE ILLLLLPALG SGNTAIVKPS EVSPETGAIV VNCLAQFLPE
NVLQLAQGDA QVGAHLVSHK DVDMVAMTGS SATGQKILTA AAPHLKRFVL EMGGKDPMVV
FDDADLNQAA KDAVSYSLSN TGQVCCSIER IYVAQTIYRE FQERVTRCAA EYHVGNGMDE
NVNVGPMVSI RQRDHVKRHV EDVIAKGAKV LHQSKIPSTA STESSFFPVT VLADVKENMH
MYHEETFGPV VALTPFDGSE EEAIRLANDT EYGLASCVYT QDMDRAQRVA SAIEAGQIGI
NCYSLENMDV ACPWVGHKKS GFGYHSGQEG FHQFSIPKTL VYVPSAHSTK D