Gene Nmul_A1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1667 
Symbol 
ID3785654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1904464 
End bp1905621 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content56% 
IMG OID637811753 
Producthydrogenase formation HypD protein 
Protein accessionYP_412357 
Protein GI82702791 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0409] Hydrogenase maturation factor 
TIGRFAM ID[TIGR00075] hydrogenase expression/formation protein HypD 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTTA CCGCTCAGGA GTGGCTAAAA AAAATCCATG AGCTGCCACT CGATCGCCCT 
GTACGTATCA TGAACGTATG CGGAGGCCAT GAGCGTTCGA TTACCATGGC AGGAATCCGA
AGCGCCCTGC CGGAATGTGT CGAACTCATC GCCGGGCCAG GGTGCCCGGT CTGTGTCTGC
CCTGAAGAAG ATGTATACCA GGCGATTCAG CTCGCGCTGC GCACCGATAT GATTCTGGTC
ACTTTCGGTG ATATGCTGCG CGTGCCCGTG AACGTGCCGA AGAAAGAAGT GCGATCCCTG
GAGCAGGCCA AGGCAGCAGG AGCGGATGTG CGTCCCATTG CCAGTCCGCG CGAAGCAGTC
AGGATAGCTC AGCAAAACGC GAAACGACAA GTCGTTTTCT TTGCTGCGGG TTTTGAGACC
ACCACAGCCC CGGTAGCAGC CATGCTGCTG GAGGGGGTAC CGGACAATTT ATCCATCCTG
TTGTCAGCGC GGCGCACATG GCCTGCGGTC GCAATGCTGC TTGATTCTGA TGCACCAGGC
TTCGATGGGT TGGTGGCGCC CGGTCATGTT TCCACTGTCA TGGGGCCGGA GGAGTGGAAT
TTCGTCTTCG AAAAGCATGA CATTCCCACT GCCGTTGCCG GCTTCCAGCC CGTGTCACTG
CTAGCCGCCA TGTATTCCGT ATTACGCCAA CTGCTTGAAG GGAAGCGTTT TCTGGATAAT
TGTTATCCTG AGTTAGTGCG GCCCGGGGGA AATCGAGCCG CACAGGCGCA ACTCGCGGAA
GCATTGAATG ACACGGATGC CAACTGGCGC GGCATTGGTG TTATCCCATC TTCCGGTTTC
AGTCTCCAAA AGCGCTTCGC AAAGAACGAT GCGCGACTTC AATTCCCCGA TTTCGATACA
GAGAACCGCA AGCGCGCTGG CCAGATGCCG CCCGGTTGCG AATGCGCGAG CGTAGTCCTT
GGAAGAATAA ATCCAAACCA GTGCAAGATT TATGGCCATG CCTGCACACC GAAAACACCT
GTGGGCCCGT GCATGGTGTC GGACGAAGGT GCTTGCCGCA TCTGGTGGGC AGCAGGCGTA
CGGGAGAACA CGGCCACTGG AGTGAAGACG GTGGCAGATA GCTCTTCTAT TCCGGTGTTA
CCTGAGAAGC CAGAATAA
 
Protein sequence
MTLTAQEWLK KIHELPLDRP VRIMNVCGGH ERSITMAGIR SALPECVELI AGPGCPVCVC 
PEEDVYQAIQ LALRTDMILV TFGDMLRVPV NVPKKEVRSL EQAKAAGADV RPIASPREAV
RIAQQNAKRQ VVFFAAGFET TTAPVAAMLL EGVPDNLSIL LSARRTWPAV AMLLDSDAPG
FDGLVAPGHV STVMGPEEWN FVFEKHDIPT AVAGFQPVSL LAAMYSVLRQ LLEGKRFLDN
CYPELVRPGG NRAAQAQLAE ALNDTDANWR GIGVIPSSGF SLQKRFAKND ARLQFPDFDT
ENRKRAGQMP PGCECASVVL GRINPNQCKI YGHACTPKTP VGPCMVSDEG ACRIWWAAGV
RENTATGVKT VADSSSIPVL PEKPE