Gene Nmul_A2156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2156 
Symbol 
ID3784396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2450361 
End bp2451848 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content58% 
IMG OID637812244 
Producttyrosinase/peptidase 
Protein accessionYP_412841 
Protein GI82703275 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACCC GAAAGAATCA ATCCACGTTG ACAGCCGCGG AGAAGGCGGC CTTTGTTGGG 
GCGGTAAAAG CACTCAAGGC AAATGGCTCC TATGATGTAT TCGTGGCCCA GCACCGTACC
GCCTTTCTTG CCGGCGTGAA CGATCCGGCA CATGGCGGTC CTGCCTTTCT GCCTTGGCAC
AGGGAGTATC TCCGCCGGTT CGAGCGTGCC CTTCAGCAGA TCGATCCCAG TGTTTCCATC
CCCTATTGGG ACTGGACAGT TGATCGCACG ACGAATGCTT CCATCTGGAA TGCGAATTTC
ATGGGTGGAA ATGGAACGGG CCCCGGCGGA CGCGTGATGA CGGGGCCGTT CGCCTTTTCC
ACGGGAGAGT GGACGCTTAC TGTTCTGGAC CCCGGTGACA CGGATAATTT TCTCACCCGT
GCCTTCGGCG CCATGGGAGC GTTGCCCACC CAACAGGGAG TGAATGCCGC CATCAATATC
GTGCCCTATG ATTCAGCGCC CTGGAATCGT AACAGCAGCA TGAATACGAG TTTTCGAAAC
CATCTCGAGG GGATTATCCA CAATCCCGGC CACATGTGGG TAGGCGGCTC GATGATGGCT
ATGTCCTCCC CCAACGATCC GGTGTTCTGG CTGCATCATT GCAATATCGA TCGGTTATGG
GCAGTATGGC AGCGGGAAAA TCCGGGGCAG AATTATCGTC CGCCGAGCGG CACGGCGGGC
GTGGTGAACG GCCATGGACT GGATGACCCG ATGCCGCCCT GGAACAACGA AGCTTCGCCG
CCTACGCCCC GGGATGTTCT CGATCACCAT GCGCTTGGCT ACACGTACGA TGACGAGGAA
GAAGAACCTC CGCAGATCGT ACCCCTGACC CTTGATGCGG CTCCGTTTGC CGCTTCCATA
GGCCAGGCGG GAGAAGTGGA CACATATAGC TTCGTTGCCT CAAGCCAGGG GAATTATATT
ATCGAAACCG AGGGTTCCAC CGATGTAGTG GCCGCCCTGT ATGGTCCGGA TGATGCCAAT
GCGCTCGTTG CCGAGGATGA CGACAGCGGC GTCGGCCGGA ATCCGCGCAT TGCACGAGAC
CTGGCGCCGG GAACATACTA TGTTCGCATA AGGCACTATA GCGGCTCATC CACTGGAAGC
TACCGTATTT CAGTACGAGG GTCAGGAGGC CCGCAGCCGG GTATCCAGAC CATTCAGATA
AATGGTCCGG CAGTGCAGGG CACACTCTCC GCCAATGAGA GGGATCTGTA CACCTTTACT
GTCAGCACGT CCGGCTCCCA TACGATAGAA ACCGCTGGTA GCACTGATTG CTTCCTCACG
TTATTCGGCC CCGACAGCCA GACTGCCGTC ATTGCCCAGG ATGACGACAG TGGCCCGGGA
ACCAATTCGC GCATCGTGCA AAACCTCGGG GCCGGTGTCT ATTATGTTCA GGTCAGGCAT
TACAGCCCGA CCGGTACAGG GGCGTATAGT GTTTCCGTCA GAACATGA
 
Protein sequence
MGTRKNQSTL TAAEKAAFVG AVKALKANGS YDVFVAQHRT AFLAGVNDPA HGGPAFLPWH 
REYLRRFERA LQQIDPSVSI PYWDWTVDRT TNASIWNANF MGGNGTGPGG RVMTGPFAFS
TGEWTLTVLD PGDTDNFLTR AFGAMGALPT QQGVNAAINI VPYDSAPWNR NSSMNTSFRN
HLEGIIHNPG HMWVGGSMMA MSSPNDPVFW LHHCNIDRLW AVWQRENPGQ NYRPPSGTAG
VVNGHGLDDP MPPWNNEASP PTPRDVLDHH ALGYTYDDEE EEPPQIVPLT LDAAPFAASI
GQAGEVDTYS FVASSQGNYI IETEGSTDVV AALYGPDDAN ALVAEDDDSG VGRNPRIARD
LAPGTYYVRI RHYSGSSTGS YRISVRGSGG PQPGIQTIQI NGPAVQGTLS ANERDLYTFT
VSTSGSHTIE TAGSTDCFLT LFGPDSQTAV IAQDDDSGPG TNSRIVQNLG AGVYYVQVRH
YSPTGTGAYS VSVRT