Gene Nmul_A1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1993 
Symbol 
ID3785017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2289280 
End bp2290806 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content56% 
IMG OID637812082 
Producthypothetical protein 
Protein accessionYP_412680 
Protein GI82703114 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCAC CGCTGCTGAT CGCGCAATCC CACCATCCCC TTGGTTTATT GCCCGGCTTC 
GCCAACCGGC ACGGCCTGAT CACGGGTGCG ACCGGCACAG GCAAGACCGT TACGCTTCAG
GTGCTGGCAG AACGCTTCTC GTCGATCGGC GTGCCCGTAT TCATGGCGGA CATAAAGGGC
GATCTGGCAG GTCTCTCCTT TCCTGGCGCC GATTCAGCCA AAATGAAGGA GCGGCTTGCG
CAACTCCACC TTCCGGAACC TGAATGGACT GCATATCCGG TTACGTTTTG GGATATTTAC
GGAAAAGCCG GACATCCGGT GCGCACCACC ATATCCGACA TGGGCCCGCT GCTGTTGGGC
CGGTTATTCA ATCTGAACGA AACGCAGCAG GGTGTGCTGA CACTGGTATT CAAAATTGCG
GATGATAACG GACTTCTGCT GCTTGACGTC AAGGATCTGC GCGCGCTGCT GCAGTTTGTC
GGCGACAAGG CGAAGGACTT CACGGTGCAG TATGGTAACA TTTCCGCCGC CTCGATCGGT
GCCATTCAGC GTTCGCTTCT GCAGATCGAG CAGCAGGGGG GGGATATCCT GTTCGGCGAA
CCGATGCTCG ATATTCATGA CCTGATGCAG GTGGACAGCA GAGGGCGGGG GATGATAAAC
ATTCTCGCAG CCGACAAACT GCTGAACGCT CCCAAGTTGT ATTCCACCTT TTTGCTGTGG
ATACTGTCAG AGCTGTTCGA ACATCTTCCT GAAATAGGCG ATCCCGAGAA ACCGAAATTG
ATATTTTTCT TTGATGAAGC CCATCTTTTA TTCAATGATG CGCCCCGTCC TCTTCTGGAA
AAAATAGAGC AGGTCGTGAG GTTGATACGA TCCAAGGGAG TGGGCGTGTA TTTTGTAACA
CAAAATCCCC TCGATGTTCC TGAAACTGTT CTGGGACAGC TCGGCAATCG CGTGCAGCAT
GCCCTGCGCG CGTTTACGCC GCGCGATCAG AAGGCGGTGC GGTCGGCTGC GCAAACCATG
CGGGCGAATC CCGGTCTGGA TACCGAAAAG GTGATCAGTG AGCTTTCGGT GGGTGAAGCG
CTCGTTTCGT TGCTGGACGA GAAAGGCCGT CCCGCCATGG TTGAGCGGGC GTTTATTTTG
CCGCCTCAAT CGAGGATAGG ACCGGTCACC GATGCGGAAC GCGCAGGTAT CATCAAATCC
TCGATGGTTT TCGGCCATTA CGAGCAACAG GTGGACCGTG AGTCCGCATA CGAAATACTC
GGAAGTCGGG CTCCGATGAC GCAGAATACC GGGAATGAGG AAGCTCCCGT CCGTGCCAGG
GTAAAAAAGA CCGAATCCGA AGAGGGTATG ATGGAGGTAC TGGGTGATAT GCTGCTCGGC
AAAACCGGCC CGCGCGGTGG ATATAAACCT GGCATTCTCG ATACCGCTGC ACGCAGCGCG
GCGCGCTCCA TCGGTTCGCG TGCCGGCCGT GAAATCTTCC GCGGGATTCT GGGAGGTATG
TTCGGCGGCA GTAACCGCAG ACGTTGA
 
Protein sequence
MAPPLLIAQS HHPLGLLPGF ANRHGLITGA TGTGKTVTLQ VLAERFSSIG VPVFMADIKG 
DLAGLSFPGA DSAKMKERLA QLHLPEPEWT AYPVTFWDIY GKAGHPVRTT ISDMGPLLLG
RLFNLNETQQ GVLTLVFKIA DDNGLLLLDV KDLRALLQFV GDKAKDFTVQ YGNISAASIG
AIQRSLLQIE QQGGDILFGE PMLDIHDLMQ VDSRGRGMIN ILAADKLLNA PKLYSTFLLW
ILSELFEHLP EIGDPEKPKL IFFFDEAHLL FNDAPRPLLE KIEQVVRLIR SKGVGVYFVT
QNPLDVPETV LGQLGNRVQH ALRAFTPRDQ KAVRSAAQTM RANPGLDTEK VISELSVGEA
LVSLLDEKGR PAMVERAFIL PPQSRIGPVT DAERAGIIKS SMVFGHYEQQ VDRESAYEIL
GSRAPMTQNT GNEEAPVRAR VKKTESEEGM MEVLGDMLLG KTGPRGGYKP GILDTAARSA
ARSIGSRAGR EIFRGILGGM FGGSNRRR