Gene PHATRDRAFT_14962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_14962 
Symbol 
ID7203619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp481051 
End bp482253 
Gene Length1203 bp 
Protein Length400 aa 
Translation table 
GC content50% 
IMG OID 
Productcystathionine beta-lyase 
Protein accessionXP_002182846 
Protein GI219125142 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTACTT CTCTGGCACA CGCAGGTGTG GTTACTGGAA AGAATGCGGC CATGTCTCCT 
CCGTTGCACA TGGCAACAAC TTATACCAGG CCAGCCGATG GATCTTATCA AGAAGGAGAC
TTGATTTACA CACGTATGGA CAACCCTACA AGAAATCTAC TTGAGGCGGA GACTGGCAGG
CTTGAATGTC ACGGTCGAGC TGTGAACTCC GACACGCCAA TCATAAGTTG TGCATTTGCG
TCGGGTATGA TGGCCGTGTC TTCGATCGTT CTCGCCCACC GGTTACCATT GAAGGTTTTG
CTGCCAGTCG ATGCCTACCA TGGTGTTCCG ACTGTCCTTC GAGACGTCTT TTCGCGTTTC
GATGTAGAAA TCCGTACTGT CGAAATGAGT GATCCAGCGG CCATCGAAGC TGACCTGGCA
AAAATATCGG TAAAAGACGA TGTCATTGTA TGGATGGAGA GCCCTTCGAA CCCAAGAGTT
GACATTATTG ACATTTCTTT AATAAGCAGC ATCGCAGAAA AATCGGGCCG TCGCGTCACT
ACTGTGGTCG ATTCCACCCT CGCTCCTCCA ACGATTCAGC AGCCTCTCCA GCTTGGTGCG
GATTTGGTTA TGCATTCGGC GACAAAGTAC CTCGGTGGAC ATTCAGATCT ACTCTGTGGT
GTCGTGACAG CGTCTCCATG GACTAATCGT GGTCGTTTCA TTGGGCCACT TATACGGCAG
GTGCAAGTCG CTGTCGGAGG TGTGGCCTCT CCACTGGATT CATGGCTCAC GCTGCGTGGT
CTAAGAACCT TGGCTATCCG TAGCAGTCGC CAATGCGAAA CTGCTCTCCT TCTTGTCAAA
TATCTACAGC ACCATCCATT GGTAGACAAG GTCTATTATC CTGGACTGGA AGAACACTTT
GGCCACAAAA TTGCTAAACG TCAAATGAAG AATGGATTTG GAGGTGTTTT CAGTGTTGAA
ATGATCGGCG AGAGCTATGC GTTTGCGTTT GCGGCGGCCC TGACAGTCGT TCAACGAGCT
ACCAGCCTCG GCGGGACTGA AACTCTAATT GAACATCGGG CGAGTATAGA GCCACCTGGC
CGCGTAGTTA GTCCACGGGG ACTACTGAGG GTCAGCGTAG GCCTGGAACA CGCATCTGAT
ATTTTGTCTG ACTTTGAAAG CGCCATGGAC ATTGTTCAAA CGATTCATGG TATTCGTGGC
TAA
 
Protein sequence
MATSLAHAGV VTGKNAAMSP PLHMATTYTR PADGSYQEGD LIYTRMDNPT RNLLEAETGR 
LECHGRAVNS DTPIISCAFA SGMMAVSSIV LAHRLPLKVL LPVDAYHGVP TVLRDVFSRF
DVEIRTVEMS DPAAIEADLA KISVKDDVIV WMESPSNPRV DIIDISLISS IAEKSGRRVT
TVVDSTLAPP TIQQPLQLGA DLVMHSATKY LGGHSDLLCG VVTASPWTNR GRFIGPLIRQ
VQVAVGGVAS PLDSWLTLRG LRTLAIRSSR QCETALLLVK YLQHHPLVDK VYYPGLEEHF
GHKIAKRQMK NGFGGVFSVE MIGESYAFAF AAALTVVQRA TSLGGTETLI EHRASIEPPG
RVVSPRGLLR VSVGLEHASD ILSDFESAMD IVQTIHGIRG