Gene Nmul_A0210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0210 
Symbol 
ID3784587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp222914 
End bp224416 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content57% 
IMG OID637810282 
Productglycine dehydrogenase subunit 2 
Protein accessionYP_410910 
Protein GI82701344 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.37883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGATAT TTGAACATTC TCGCCCTGGC CGGCGCAATT ATTCCCAATC TCCGAAAGCT 
GCGGAAGCAA CGGATATTCC GGAGAAGCTG TTGCGCAAGA CCCTGCCGCT GCTGCCTGAA
GTGTCGGAGA TGGACGCAGT GCGCCACTAC ACCCGGCTGT CGCAAAAGAA TTTTTCGATA
GACACGCATT TTTATCCGTT GGGTTCCTGC ACGATGAAGT ACAACCCAAG GGCGTGCAAC
TCCCTCGCGA TGCTGCCCCA GTTTCTGGCG CGCCATCCCA GATCGCCGGA AAGCACCGGC
CAGGGATTTC TTGCGTGCAT GTACGAGCTG CAGGAAATTC TGAAAGATGT AACCGGTATG
GCCGGGGTAA GCCTTACGCC CATGGCGGGA GCACAGGGCG AGCTGATCGG CATCGCAATG
ATACGCGCCT ATCACGAGTC GCGGGGGGAC ACGGCCCGAA CCGAAATCAT CGTTCCCGAT
GCGGCGCATG GCACCAACCC TGCTACAGCG GTCATGTGCG GCTACAAAGT CGTCGAGATC
GCCACCGACA AGGAAGGTAA TGTGGACATG GCTGCCCTGA AAGCCGCGGT CGGCCCGAAA
ACGGCTGGGC TGATGCTGAC CAATCCTTCC ACGCTCGGTG TATTTGAAGA GAACGTCGCC
GAGATGAGCA GGATTGTCCA CCAGGCAGGC GGATTGCTCT ATTACGATGG CGCAAACCTG
AACGCCATAC TTGGTAAGGT CAAGCCAGGC GACATGGGCT TTGACGTCAT TCACATCAAT
CTTCACAAGA CGTTTTCCAC TCCGCACGGC GGTGGAGGCC CGGGCTCGGC CCCGGTGGGC
GTCGCACCGA GGCTGTTGCC GTTTATGCCT GTACCTATAG TGGCATTTGA AAACGGAACC
TATCGCTGGC AGACGGAGAA GGATATACCG CAATCGATCG GAAGGCTTTC GGCGCACATG
GGTAACGCAG GCGTTCTGTT GCGCGCCTAT GTGTACGTGC GTCTGCTGGG GGCGGAAGGC
ATGCATCGAG TGGCCGAGTT TGCGGCGCTT AACGCCAACT ACCTGATGGC TGAACTGCGA
AAGGCGGGTT TCGAGATCGC CTATCCCAAC CGCAGGGCCA GTCATGAATT TATCGTTACC
CTGAAGGACC TGAAGGAAAA AACTGGCGTC ACCGCGATGA ATCTCGCCAA GCGTCTGCTG
GATAAGGGTT ACCATGCTCC AACGACCTAT TTTCCGCTCC TCGTGCCGGA ATGCCTGCTG
ATTGAACCCG CTGAAACCGA ATCAAAGGAG ACGCTGGACG CCTTCGTAAC AGCAATGAAG
GAAATCCTGG AGGAAACCCG TACGCAGCCG GACCTGGTGA AAAGCGCGCC TCATACCACG
CCCGTCCGCC GGCTGGATGA CGTAAAGGCC GCACGCGAGC TGGATCTGGC CTGGAAAGCA
CCCACGCGCA ACATAACCAG GACTGAAACC CTCACCCCGA TCCCAACCGT AAGCGTAGCT
TAA
 
Protein sequence
MLIFEHSRPG RRNYSQSPKA AEATDIPEKL LRKTLPLLPE VSEMDAVRHY TRLSQKNFSI 
DTHFYPLGSC TMKYNPRACN SLAMLPQFLA RHPRSPESTG QGFLACMYEL QEILKDVTGM
AGVSLTPMAG AQGELIGIAM IRAYHESRGD TARTEIIVPD AAHGTNPATA VMCGYKVVEI
ATDKEGNVDM AALKAAVGPK TAGLMLTNPS TLGVFEENVA EMSRIVHQAG GLLYYDGANL
NAILGKVKPG DMGFDVIHIN LHKTFSTPHG GGGPGSAPVG VAPRLLPFMP VPIVAFENGT
YRWQTEKDIP QSIGRLSAHM GNAGVLLRAY VYVRLLGAEG MHRVAEFAAL NANYLMAELR
KAGFEIAYPN RRASHEFIVT LKDLKEKTGV TAMNLAKRLL DKGYHAPTTY FPLLVPECLL
IEPAETESKE TLDAFVTAMK EILEETRTQP DLVKSAPHTT PVRRLDDVKA ARELDLAWKA
PTRNITRTET LTPIPTVSVA