Gene NATL1_03361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_03361 
SymbolglyA 
ID4779415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp310495 
End bp311730 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content38% 
IMG OID640083602 
Productserine hydroxymethyltransferase 
Protein accessionYP_001014165 
Protein GI124025049 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.498926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATGTG ATCCAAGTAT TGCGAAATTA ATAAACAATG AATTATCAAG ACAAGAAACT 
CATTTAGAGC TTATCGCAAG TGAGAATTTT GCCTCTAAGG CCGTAATGGA AGCCCAAGGA
TCAGTCCTAA CAAATAAATA TGCTGAAGGT CTCCCTAACA AACGCTATTA CGGAGGATGT
GAGTATATCG ACGGAATTGA GCAACTAGCA ATAGATAGAG CAAAAAACCT TTTTGGGGCC
AACTGGGCAA ACGTCCAACC TCACAGCGGA GCTCAAGCTA ACTTTGCAGT TTTCCTTAGC
CTTCTAAAGC CGGGGGACAC AATTATGGGA ATGGACTTAT CTCATGGAGG TCACCTCACT
CATGGTTCAC CTGTAAATGT AAGCGGCAAA TGGTTTAAAA CTTGCCATTA CGAAGTTGAT
AAAAAGACTG AAATGCTCGA TATGGATGCA ATAAGAAAAA AAGCAATTGA AAATCAACCT
AAATTGATTA TCTGTGGATT CTCTGCCTAT CCTCGAAAAA TTGACTTCAA AGCTTTCAGA
TCAATAGCTG ATGAGGTAAA TGCTTATTTA TTAGCTGATA TTGCTCATAT TGCTGGTTTA
GTAGCAAGTG GACTTCACCC AAGTCCAATC CCATATTGTG ATGTAGTTAC AACAACCACT
CACAAAACTC TTAGAGGGCC AAGGGGTGGA CTAATCCTCT CAAAAGATGA GGAGATAGGA
AAAAAACTTG ATAAAGCAGT ATTTCCTGGC ACCCAAGGAG GTCCTTTAGA ACATGTAATC
GCAGCCAAGG CTGTTGCATT CCAAGAAGCT TCTGCACCCG AATTCAAGAT TTATAGCCAA
AAAGTAATCT CAAATGCACA AGTTCTTTCT AATCAACTTC AAAAAAGAGG AATTTCAATT
GTAAGCAAAG GAACTGACAA TCATATAGTT CTTCTTGACC TTAGAAGCAT TGGTATGACA
GGTAAAGTTG CTGATCAATT AGTAAGTGAT ATTAAAATAA CCGCGAACAA AAACACTGTA
CCTTTTGACC CCGAGTCCCC ATTTGTTACT AGTGGCCTAA GGCTAGGTTC AGCAGCCCTT
ACGACTAGAG GTTTTAATGA ACAAGCCTTT GAAGATGTTG GTAATATCAT TGCAGATAGA
CTACTTAACC CTAACGATGA AGATATAAAG GAAAATTCAA TCAATAAAGT ATCTGAACTT
TGCAATAAGT TTCCTTTATA TAGTGAAAAC ATCTAA
 
Protein sequence
MKCDPSIAKL INNELSRQET HLELIASENF ASKAVMEAQG SVLTNKYAEG LPNKRYYGGC 
EYIDGIEQLA IDRAKNLFGA NWANVQPHSG AQANFAVFLS LLKPGDTIMG MDLSHGGHLT
HGSPVNVSGK WFKTCHYEVD KKTEMLDMDA IRKKAIENQP KLIICGFSAY PRKIDFKAFR
SIADEVNAYL LADIAHIAGL VASGLHPSPI PYCDVVTTTT HKTLRGPRGG LILSKDEEIG
KKLDKAVFPG TQGGPLEHVI AAKAVAFQEA SAPEFKIYSQ KVISNAQVLS NQLQKRGISI
VSKGTDNHIV LLDLRSIGMT GKVADQLVSD IKITANKNTV PFDPESPFVT SGLRLGSAAL
TTRGFNEQAF EDVGNIIADR LLNPNDEDIK ENSINKVSEL CNKFPLYSEN I