Gene Nmul_A0004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0004 
SymbolglyA 
ID3786442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp5730 
End bp6980 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content57% 
IMG OID637810072 
Productserine hydroxymethyltransferase 
Protein accessionYP_410705 
Protein GI82701139 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0112] Glycine/serine hydroxymethyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCCT CATACAATAC TCTCGAAACC GTCGACCCCG ATCTCTGGCA AGCCATCAAA 
GGCGAGATGC AGCGCCAGGA AGAATATATC GAGCTGATTG CTTCGGAGAA TTATGCAAGT
CCTGCAGTGA TGCAAGCGCA GGGCTCGGTG CTGACTAACA AGTATGCCGA GGGTTATCCC
GGCAAGCGCT ACTACGGCGG CTGCGAGTAT GTGGATGTCG TGGAACAGCT GGCAATCGAC
CGGGTAAGGG CGCTGTTCGA CGCGGAGTAT GTCAACGTCC AGCCGCATTC GGGCTCACAG
GCCAACGCGG CAGTCTATCT GACCGCCTTG AAGCCGGGAG ATACTCTGCT GGGGATGTCG
CTCGCGCATG GCGGGCATCT GACACATGGC GCTTCGGTCA ATCTCAGCGG CAAGATTTTC
AATGCCGTTT CCTATGGCCT TCGTTCTGAT ACCGAAGAGC TGGACTATGA CGAGGTTGCG
CGCCTGGCGC ACGAGCATAA GCCCAAGCTG ATCGTGGCAG GGGCTTCGGC TTACTCGCTG
GTGATAGACT GGAAGCGCTT CCGTAAGATT GCCGACGATA TAGGGGCCTA TCTGTTCGTG
GATATGGCGC ATTATGCGGG ACTGGTTGCG GCAGGATACT ATCCCAATCC CGTCGGCATC
GCCGATTTCG TCACGAGCAC CACCCACAAG ACGCTGCGCG GTCCGCGGGG CGGGATCATC
ATGGCCAGGG CCGAGCATGA AAAAGCGCTC AATTCCGCTA TTTTCCCGCA AACCCAGGGG
GGGCCGTTGA TGCATGTCAT TGCCGCCAAG GCAGTAGCCT TCAAGGAAGC CGCCAGCCAG
GAGTTCAAGG ACTACCAGGA ACAGGTAATC GACAATGCGC GCGTAATGGC GAAAGTGTTG
CAGGAGCGCG GATTGCGCAT CGTCTCGGGG CGCACCGACT GCCACATGTT TCTGGTGGAC
CTCCGCCCCA AGTATATTAC CGGCAAGCAG GCCGCCGAAT CGCTGGAAGT GGCGCATATC
ACCGTCAACA AAAATGCGAT TCCCAACGAC CCGCAGAAAC CCTTCGTCAC CAGTGGGATT
CGAATCGGCT CTCCCGCCAT CACCACCCGC GGCTTTGCCG AATTCGAATC CGAACAACTG
GCTCATCTGA TAGCTGATGT ACTGGAAGCG CCGACCGATT CCTCGGTGCT CACGGAGGTT
GCACGCCAGG CAAAAGCGCT ATGTGCAAAA TTTCCGGTTT ACCAAGGGTA A
 
Protein sequence
MLSSYNTLET VDPDLWQAIK GEMQRQEEYI ELIASENYAS PAVMQAQGSV LTNKYAEGYP 
GKRYYGGCEY VDVVEQLAID RVRALFDAEY VNVQPHSGSQ ANAAVYLTAL KPGDTLLGMS
LAHGGHLTHG ASVNLSGKIF NAVSYGLRSD TEELDYDEVA RLAHEHKPKL IVAGASAYSL
VIDWKRFRKI ADDIGAYLFV DMAHYAGLVA AGYYPNPVGI ADFVTSTTHK TLRGPRGGII
MARAEHEKAL NSAIFPQTQG GPLMHVIAAK AVAFKEAASQ EFKDYQEQVI DNARVMAKVL
QERGLRIVSG RTDCHMFLVD LRPKYITGKQ AAESLEVAHI TVNKNAIPND PQKPFVTSGI
RIGSPAITTR GFAEFESEQL AHLIADVLEA PTDSSVLTEV ARQAKALCAK FPVYQG