Gene Nmul_A1927 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1927 
Symbol 
ID3784223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2216478 
End bp2218331 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content56% 
IMG OID637812013 
Producthypothetical protein 
Protein accessionYP_412614 
Protein GI82703048 
COG category[E] Amino acid transport and metabolism
[S] Function unknown 
COG ID[COG0665] Glycine/D-amino acid oxidases (deaminating)
[COG4121] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03197] tRNA U-34 5-methylaminomethyl-2-thiouridine biosynthesis protein MnmC, C-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.224364 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGACT GGCAAAACGG ACAACTGTAT TCCACTCGCT TCGGTGACGT CTACTTCTCA 
AGAGACTCGG GACTGGAAGA AAAACAGTAC GTCTTCCTGC AGGGCAACCG GCTTGCGGAC
CGTTTTGAGT CCTTGCAGCC TGATACCGCG TTTTCCATTG GAGAAACGGG ATTCGGCACA
GGATTGAGCT TTTTATGCAC TTGGCGGCTA TTCATCCAGA TTGCACCCCT TCGGACCAGT
CTTGATTTTT TCAGTGTTGA AAAATATCCG CTTGATGAAA AAGAACTGAG CGCAGCGCTC
GCGCTTTGGC CCGAACTGGG CCCATACGCC GATGAACTTA TGCTGCGCTG GCAGCGGCGT
GTACCCGGAT GGAATCGGTG GAGCTTCGCC GGAGGAAGAG TGCGTCTCAC GCTGGCAATA
GAGGACGTGA CCCGGGCGCT GCCTGAAACG CACGGTATCG ATGCATGGTT TCTTGACGGC
TTTTCACCGG CGCGAAACCC GGAAATGTGG ACACTCCAGA TTTTTCACTG GATTGCGCGG
GCGTCGCGAG CAGGCGCAAC CTTTGCGACC TATACCAGTG CCGGCGTTGT TCGTCGCGGT
TTGGAACAAG CAGGGTTTCA GGTCAAAAAA ATATCCGGCT TTGGCCATAA GCGTGAAATG
CTGCAAGGTG ACCTTCCTGG CCCCCCTCCC GTTCGACTGG CTCCCACCAC CGCGATCGTT
ATCGGAGGGG GAATAGCAGG GTGCGCCGCT GCTTCGGCGC TGGCCAGTCG TGGACTTATA
GTTGAACTTC TGGAATCACA CACCCTTGGC GCGGGTGCGT CAGGCAACCC GATTGGTATA
CTGCACGCCC GCCTGAGTGC AGGAATGAAC GCCCTGCACC GCTTTGTGCT GGCATCCTAC
GGACATGCGC TCGCCTTGCT TGACGAAAAA ATACCCGTCG ATGGCGTCAT GCGGAGTGAA
TGCGGAGAAC TGCAGCTGTC ATTCTCCGCC GAAGAAGCAA GACGAATCGG GAAGCTTGCG
ACCCTCGACT GGCCCGCGCA TGTTTTCCGA CCAGTAGATG CGGCTGAAGC ATCGGCCCTT
GCGGGAATTG AGCTTTCATA TGGTGGCCTT TGGTTTCCCG GTAGCGGTTG GCTTGCTCCG
CCTCAACTTT GTGTAGCCTT GCTTGGCAGT CAGGCTATCA CCCTGTATAC CGGTCGCACG
GTAAAATCAC TTACCCCAAC GAGTCACGGG TGGCGTGTGC AAGCGGAAGA TCAGAGGAAG
CAAGCGTGGT CTCTGGAGGC CGAGATAGTT GTGGTTTGCA CCGGATATCA GGTGAAATCG
CTTCCAGCAT TGGCAAATCT GCCGCTAACC CCGGTACGGG GACAGCTTAC CTTGATCCCT
GCAACAACCG CAAGCCAGAA TCTCCGCACC ATCGTATGCG GGAGTGGCTA TTTCTCCCCT
GCTGTTGCAG GACGACATAT GGTGGGAGCA ACCCATCGTT TTAACGATAC ATCGATTAAC
CTGAATGTAT CGGAGCATGC GGAAAACTTA TCCAGACTGC GAGAAATTTC TCCTGTCCTC
CGCAGGTTGA GTGACGAGGT AAGTCAAGAT ATCAGGCAGC TTGAGCAATT GGATGGACGC
ACATCTATCA GGGGGTCTGT TCCAGGCGCC ATGCCGCTCG TCGGCGAACT TTTGCCCGGA
CTGTATACCA GCCTCGGCCA TGGAACGCGT GGACTGATTA CCGCGGGAAT TTCAGCCGAA
TTGGTCGCGG CAACCGCCTG CGGGCAACTG CTGCCGTTGC CATTGTCCGT TGTCAATGCG
CTCTCGCCTG TCCGAAGAGC TTCTCCCGCT ATTCCGGTTT CAATCAAGGG ATAG
 
Protein sequence
MLDWQNGQLY STRFGDVYFS RDSGLEEKQY VFLQGNRLAD RFESLQPDTA FSIGETGFGT 
GLSFLCTWRL FIQIAPLRTS LDFFSVEKYP LDEKELSAAL ALWPELGPYA DELMLRWQRR
VPGWNRWSFA GGRVRLTLAI EDVTRALPET HGIDAWFLDG FSPARNPEMW TLQIFHWIAR
ASRAGATFAT YTSAGVVRRG LEQAGFQVKK ISGFGHKREM LQGDLPGPPP VRLAPTTAIV
IGGGIAGCAA ASALASRGLI VELLESHTLG AGASGNPIGI LHARLSAGMN ALHRFVLASY
GHALALLDEK IPVDGVMRSE CGELQLSFSA EEARRIGKLA TLDWPAHVFR PVDAAEASAL
AGIELSYGGL WFPGSGWLAP PQLCVALLGS QAITLYTGRT VKSLTPTSHG WRVQAEDQRK
QAWSLEAEIV VVCTGYQVKS LPALANLPLT PVRGQLTLIP ATTASQNLRT IVCGSGYFSP
AVAGRHMVGA THRFNDTSIN LNVSEHAENL SRLREISPVL RRLSDEVSQD IRQLEQLDGR
TSIRGSVPGA MPLVGELLPG LYTSLGHGTR GLITAGISAE LVAATACGQL LPLPLSVVNA
LSPVRRASPA IPVSIKG