Gene Nmul_A1079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1079 
SymbolsucC 
ID3784693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1245277 
End bp1246467 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content55% 
IMG OID637811163 
Productmalate--CoA ligase subunit beta 
Protein accessionYP_411774 
Protein GI82702208 
COG category[C] Energy production and conversion 
COG ID[COG0045] Succinyl-CoA synthetase, beta subunit 
TIGRFAM ID[TIGR01016] succinyl-CoA synthetase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00492419 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGATATAC ATGAGTATCA GGCAAAAGAA ATTTTAATGA GGTATGGGGT CAAGATAGCG 
GAAGGGGGGT TGGCATATAC GATAGAAGAA AGTGTGCAGC GTGCCAGGGA AATCGACGGC
AATGTATGGG TGGTGAAGGC GCAGATCCAT TCGGGGGGGC GGGGCAAGGC GGGCGGCATC
AAGGTATGCA GGACTCATGA CGAAGTCCGG GCAGCCTCTG AGGAGTTGCT GGGAAAGATT
CTGGTGACCC GTCAGACCGG AGCCGTGGGG AAGGTGTGTA CACGGGTATA TGTGGAAGCG
GGCACGCACA TTGCCAGGGA GATGTATCTC TGCTTTCTGA TAGACAGGAG TTCGGAGCGC
ATCGTCATGA TAGGCTCGGG GCAGGGTGGA ATGGAAATCG AGGAACTGGC TCACACAAAT
CCTCAGGCCA TCAAGAAGAT TTTTATCGAA CCCGCAGTTG GGTTGCAGGA TTTTCAAGCG
AGAGAGATGG CTTTTGCACT AGGGGTGGAA GCGGCACAAC TGCCTCATGC CGTTAAAACC
ATTCGGGGGT GTTACCGCGC CTTGCGTGAT CTGGATGCGA ACATGGTGGA AATCAACCCC
CTCGTGATCA CTGGGAGCGG CGAACTTCTT GCTCTTGACG CAAAAATGAG CTTCGACGAA
AACGCCTTGT TTCGCCGGCA CGAGGTTGCC GAATTGCGTG ATAAAACACA AGCCGATCCT
CGGGAGGTGG CAGCCTCCGA TCATGGCTTG AGCTACATCG GATTGAACGG TGACATCGGA
TGCATGATAA ACGGCGCCGG GCTTGCCATG GCAACGATGG ATATGATCAA GCTCGCGGGC
GGCGAGCCGG CAAATTTCCT TGATGTGGGA GGAGGGGCGT CCGCGGAGCG TACGGAAAAG
GCGTTTCGCC TGGTTTTGGC TGATAAAGGA GTCAAGGCGA TGCTGGTCAA TATTTTTGCA
GGTATTAATC GCTGCGACTG GATTGCGCAA GGCGTCGTGC AGGCGGTAAG AAATATCGAT
ATGAAAATCC CGCTGGTCGT GCGCTTGTCC GGTACAAATG TCGAGGAGGG CCAGCGGATC
ATTGCCGAAA GCGGTTTGCC GATCATCACA GCGGGAACGC TGGCGGAAGC AGCGGAGAAG
GTTGTCCAGG CGCGCAATGG CGCGGTTGCG GAAGAGTGCA AAGGGATATA A
 
Protein sequence
MDIHEYQAKE ILMRYGVKIA EGGLAYTIEE SVQRAREIDG NVWVVKAQIH SGGRGKAGGI 
KVCRTHDEVR AASEELLGKI LVTRQTGAVG KVCTRVYVEA GTHIAREMYL CFLIDRSSER
IVMIGSGQGG MEIEELAHTN PQAIKKIFIE PAVGLQDFQA REMAFALGVE AAQLPHAVKT
IRGCYRALRD LDANMVEINP LVITGSGELL ALDAKMSFDE NALFRRHEVA ELRDKTQADP
REVAASDHGL SYIGLNGDIG CMINGAGLAM ATMDMIKLAG GEPANFLDVG GGASAERTEK
AFRLVLADKG VKAMLVNIFA GINRCDWIAQ GVVQAVRNID MKIPLVVRLS GTNVEEGQRI
IAESGLPIIT AGTLAEAAEK VVQARNGAVA EECKGI