Gene Nmul_A0916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0916 
Symbol 
ID3786461 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1042708 
End bp1043715 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content57% 
IMG OID637810998 
ProductAraC family transcriptional regulator 
Protein accessionYP_411611 
Protein GI82702045 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATATTC ATATTCTTGC GCTCGACCAG GTTTTCGACA CCGGATTATC AACGCTCCTG 
GACACCCTGA GTATCGCGAA CGATCTCGCG GTTTCTGCCA ACGCAGTGAC ACGATTTGAC
CTGACGATCG CGGGGGTGCG CCGGAATATT CGTACCAGTC AGGGATTTTC TGTACCGGTA
GTGCCAGCGG CGCGATGCAG CCCGCCGGAT GTGGTATTGA TTCCGGCGCT TGGGGCAAAA
ATGCCGGAAA CGCTGCGGTT GGCGCTTGAG CGGCCGGATG TGTGCGAGGC GGGCGACCTT
TTGCGGCAGT GGTCCAAAGA GGACGTTCTT ATCGGCGCCG CCTGCACTGG AACCTTCGTT
CTCGCCGATA CTTTGCTTCT CAATGACCGG AGCGCTACCA CATCATGGTG GCTTAGCCCC
TTGTTTCGGG AACGTTATCC CCGCGTGCGC CTGGAGGAAT CGCGCATGGT GGTAAGCTCG
CCCGGGTTGG TTACTGCGGG TGCTGCACTG GCACATATCG ATCTGGCGCT TTGGCTCATA
CGCCAAAGCA GTCCCACGCT CGCAGAAATG ACAGCGCGTT ATCTGCTGAT AGAACCACGA
GCGTCACAGG CAGTTTTTGC AATTCCTGAT CACCTTGCAC ATGCCGATCC ACTGGTTCAG
CAATTCGAAC GCTGGGCTCG CCACAGGCTG GGTGAACGTT TCTCCCTGAG CGAAGCAGCC
AGTGCGACAG GCACAAGCGA GAGAACGCTT TCGCGGCGGC TAAAGGCTGT TCTGGGAAAA
TCCCCGCTTT CTTATTTTCA GGATCTTCGT ATTGAGCGCG CTGTATATCT CCTGGGGACG
AGCAACGATA ATGTAGACGC GATTGCTGCC CAGGTGGGTT ATGCGGATGG TACAACCTTG
CGCACCCTTC TTCGCCGCAG GGTCGGTCGA ACGGTGAGCG AGCTTCGAGC CAGAACCCGG
GAGATTTCCA GTTCGTTCAA CGACTCTCAA GCACAGGATA TCGAGTGA
 
Protein sequence
MHIHILALDQ VFDTGLSTLL DTLSIANDLA VSANAVTRFD LTIAGVRRNI RTSQGFSVPV 
VPAARCSPPD VVLIPALGAK MPETLRLALE RPDVCEAGDL LRQWSKEDVL IGAACTGTFV
LADTLLLNDR SATTSWWLSP LFRERYPRVR LEESRMVVSS PGLVTAGAAL AHIDLALWLI
RQSSPTLAEM TARYLLIEPR ASQAVFAIPD HLAHADPLVQ QFERWARHRL GERFSLSEAA
SATGTSERTL SRRLKAVLGK SPLSYFQDLR IERAVYLLGT SNDNVDAIAA QVGYADGTTL
RTLLRRRVGR TVSELRARTR EISSSFNDSQ AQDIE