Gene Nmul_A2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2379 
Symbol 
ID3784970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2706455 
End bp2707552 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content57% 
IMG OID637812468 
Producthypothetical protein 
Protein accessionYP_413060 
Protein GI82703494 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.444977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATCA ATCTTCTGGA TTTTGATGCG AAAGGGCTTA CCGGTTTCTG TGCGGAAATC 
GGTGAGAAGC CGTTCCGCGC CCGTCAACTG CTGCGCTGGA TACACCGGAC CGGTGAAGCC
GATTTCGATG CCATGAGCGA CCTCGCGAAA GGATTGCGGG AAAAGCTGGC GGCTGCCGCC
GTGATAGAAC CGCCGAAAGT CATCAGCGAT CATACCGCTT CCGATGGTAC TCGCAAATGG
CTGCTGTCGG TCGGCGCGGG TAATGGAATC GAAACAGTCT ACATACCGGA AACGAGCCGG
GGGACGCTGT GTATTTCCAG CCAGGTAGGG TGTGCGCTCG CATGCGCATT TTGTTCCACC
GGGAGGCAAG GGTTCAACCG TAATCTGACG GTGGCGGAAA TCATCGGCCA ATTGTGGTGG
GCGAATAAAG CGCTGACGGA AACGTTCACG AGCGAGGCGG GACGCGAGCG TCCCATCACC
AATATAGTCA TGATGGGGAT GGGAGAGCCG CTGACGAATT TCGAAAATGT AGTGACCTCG
CTCGACCTGA TGCTGGACGA CAATGCTTAT GGCTTATCAC GGCGCCGGGT GACCGTGAGC
ACGTCGGGAA TAATCCCGGC CATGGACCGT CTCCGCGAGC GTTGCCCCGT CGCGCTGGCG
GTATCCCTGC ATGCTCCCAA CGACGCGTTG CGCGATCAAT TGGTGCCGAT CAACAGGAAA
TACCCGATCA GGGAACTGCT GGGCGCATGC GAGCGCTATC TTCAATCCGC ACCCCGAGAT
TTCATCACTT TTGAATATGT CATGCTGGAT GGCGTGAATG ACAGCGTGGC GCAAGCGCGT
GAATTGGTGC AACTGGTAAG GGACATTCCC TGCAAGTTAA ACCTGATTCC GTTCAATCCT
TTTCCTGATT CAGGTTTCAG GCGTTCTTCC GCAAACGCCG TATCCCGCTT TCGCGATGTG
TTGATGGAGG CAGGATTGGT GACTACGGTA CGCAAGACGC GGGGAGATGA TATTGCCGCG
GCCTGCGGCC AGCTCGCGGG AAAAGTCCTC GACAAGACAC GCCGCGTCCC CCGCAACATT
GCGGAAGCAG CTGGATGA
 
Protein sequence
MSINLLDFDA KGLTGFCAEI GEKPFRARQL LRWIHRTGEA DFDAMSDLAK GLREKLAAAA 
VIEPPKVISD HTASDGTRKW LLSVGAGNGI ETVYIPETSR GTLCISSQVG CALACAFCST
GRQGFNRNLT VAEIIGQLWW ANKALTETFT SEAGRERPIT NIVMMGMGEP LTNFENVVTS
LDLMLDDNAY GLSRRRVTVS TSGIIPAMDR LRERCPVALA VSLHAPNDAL RDQLVPINRK
YPIRELLGAC ERYLQSAPRD FITFEYVMLD GVNDSVAQAR ELVQLVRDIP CKLNLIPFNP
FPDSGFRRSS ANAVSRFRDV LMEAGLVTTV RKTRGDDIAA ACGQLAGKVL DKTRRVPRNI
AEAAG