Gene A9601_02801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_02801 
SymbolglyA 
ID4716965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp257996 
End bp259267 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content36% 
IMG OID640077980 
Productserine hydroxymethyltransferase 
Protein accessionYP_001008675 
Protein GI123967817 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.632622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATCC TTCAAAATCT TAAAGAAAGT GATCCAGTAA TATCAAATTT TATCAACTCT 
GAAAAAAATA GGCAGGAAAC TCATCTTGAG TTAATCGCAA GCGAAAATTT CGCATCAATT
GCTGTTATGC AGGCTCAAGG TTCAGTCCTT ACAAATAAAT ACGCCGAGGG GTTACCTCAA
AAAAGATATT ACGGGGGATG TGAATTTGTT GATGAAATCG AAGAATTAGC TATTCAGAGA
GCGAAAAAAT TATTTAATGC AAATTGGGCT AATGTTCAAC CCCATAGTGG AGCACAGGCA
AATGCTGCTG TTTTCCTAAG TCTACTTAAA CCTGGCGACA CAATCATGGG GATGGATTTA
TCTCATGGTG GACACTTAAC ACATGGGTCT CCAGTAAATA TGAGTGGTAA GTGGTTCAAT
GCAGTTCACT ATGGTGTAAA TAAAGAAACT AGTGAATTAA ATTTTGATGA AATAAGAGAG
ATAGCACTTG AAAAAAAACC AAAATTGATC ATATGCGGAT ATTCTGCTTA TCCAAGAACA
ATCGATTTTG AATCGTTTAG AAATATTGCA GATGAAGTTG GGGCTTTTTT AATGGCTGAT
ATTGCACATA TTGCCGGTCT TGTAGCAAGT AAACTTCATC CAAATCCAAT ACCTCATTGT
GATGTAGTAA CTACAACTAC TCATAAAACA TTAAGAGGGC CTAGAGGGGG ACTTATCTTA
TGTAAAGATG CAGAATTTGG AAAGAAATTT GATAAATCTG TTTTTCCTGG CACTCAGGGC
GGGCCCCTCG AACATATAAT AGCCGCTAAA GCAGTCGCAT TTAGAGAAGC CTTACAGCCA
GATTTCGTTA ATTATTCCCA ACAAGTAATA AAAAATGCAA AAGTTCTAGC TTCAACTTTA
ATAAATAGAG GTATCAATAT CGTTAGTGGA GGCACTGATA ATCATATTGT TTTACTCGAT
TTAAGGAGTA TCAATATGAC TGGTAAAATT GCTGACTTGC TTGTAAGTGA AGTTAATATC
ACTGCAAATA AAAATACTGT TCCATTTGAT CCTGAATCAC CTTTTGTAAC CAGCGGACTA
AGGTTAGGAA CTGCTGCTTT AACTACTAGA GGCTTTAATG AGAATGCTTT TGCTGAAGTT
GGCGAAATTA TTGCTGATAG ATTACTTAAC CCAGACAATT CACTGATTGA AAGTCAATGT
AAAGAAAGAG TATTAACCTT ATGTAATCGT TTTCCTCTTT ATGAAGGCAA ACTTGAAGCA
TCAATTAAAT GA
 
Protein sequence
MNILQNLKES DPVISNFINS EKNRQETHLE LIASENFASI AVMQAQGSVL TNKYAEGLPQ 
KRYYGGCEFV DEIEELAIQR AKKLFNANWA NVQPHSGAQA NAAVFLSLLK PGDTIMGMDL
SHGGHLTHGS PVNMSGKWFN AVHYGVNKET SELNFDEIRE IALEKKPKLI ICGYSAYPRT
IDFESFRNIA DEVGAFLMAD IAHIAGLVAS KLHPNPIPHC DVVTTTTHKT LRGPRGGLIL
CKDAEFGKKF DKSVFPGTQG GPLEHIIAAK AVAFREALQP DFVNYSQQVI KNAKVLASTL
INRGINIVSG GTDNHIVLLD LRSINMTGKI ADLLVSEVNI TANKNTVPFD PESPFVTSGL
RLGTAALTTR GFNENAFAEV GEIIADRLLN PDNSLIESQC KERVLTLCNR FPLYEGKLEA
SIK