Gene Nmul_A0515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A0515 
Symbol 
ID3785644 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp583866 
End bp585746 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content56% 
IMG OID637810597 
Productdihydrolipoamide dehydrogenase 
Protein accessionYP_411215 
Protein GI82701649 
COG category[C] Energy production and conversion
[I] Lipid transport and metabolism 
COG ID[COG1249] Pyruvate/2-oxoglutarate dehydrogenase complex, dihydrolipoamide dehydrogenase (E3) component, and related enzymes
[COG4770] Acetyl/propionyl-CoA carboxylase, alpha subunit 
TIGRFAM ID[TIGR01350] dihydrolipoamide dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCAGC TTACCGAAAT CAAAGTACCG GATATTGGCG ACTTCAAGGA TGTCGCAGTG 
ATTGAAGTGC TGGTGAAACC CGGCGATACA GTGGAAAAGG AAACATCGCT TATCACCCTG
GAAACCGATA AGGCATCCAT TGAAGTTCCT TCACCCCAAA GTGGCATTGT GAAGGAACTT
AAAGTCAAGG TTGGCGACAA GGTTTCGGAA GGCTCGATCA TACTCATGCT GGAGGCCACA
ACAGCAGAAG CGGCTGCTCC TGCTAAAGCC TCTGAACCTG GTGGTGCGAG CAAGCAGGAA
GAGCCGGCTG AGAAGCCCGC TGCCAACGTT GCTCCTCCAT CGGAAAAAGA GGAAGAAGAG
AAAACGCCGG CCTCAGTCGC TGCTCCTGCT CCTGAATCCG TAGCACCACC CGCTGCACCT
TCGATTCCAC AAGGTGATAT TCATGCCGAT ATAGTGGTAT TGGGTGCAGG GCCGGGTGGC
TATACCGCCG CCTTTCGTGC TGCCGATCTC GGCAAGAATG TGGTCTTGAT CGAGCGCTAC
TCCACGCTTG GCGGCGTCTG TCTCAATGTG GGCTGCATTC CCTCCAAGGC GCTGCTGCAT
GTGGCGAAAG TCATCACCGA CGCCGAGGAA ACAGCGCAAC AGGGTATCGC TTTCGCTAAG
CCCGGCATAG AAATAGACAA ACTGCGAGGA TGGAAGGAAT CCATCATCGG CAAACTCACC
AAGGGATTGA CGGGGCTGGC GAAGCAACGC AAGGTGAAGG TAGTGCGGGG AACGGGCAGG
TTCACTTCGC CGAATATGAT CGAAGTGGAA ACCTCCGACG GAAAAAAAAC AGTGTCGTTC
GAGCACTGCA TCATTGCCGC GGGTTCTGCT GCCGCTCGCA TTCCCGGTTT CCCCTACGAT
GACCCGCGGA TCATCGATTC CACTGGCGCG CTCAAGCTGG AGAGCATCCC CAAGCGCATG
CTGATCATCG GGGGCGGCAT TATCGGTCTC GAAATGGCGA CGGTCTACGA TGCGCTCGGC
AGCAGGATCA GTGTCGTCGA ATTGATGGAT CAATTGATTC CCGGCGCGGA TGCCGATCTT
ATCCGGCCGC TGCACAAACG CATACAGAAA CGCTACGAGG CCATCTACCT CAAAACCAAA
GTCACCAGAA TTGAAGCGCT GCAAGAAGGA TTGAGAGTCA CGTTCGAAGG ATCCTCGGAA
GGTGGCGGCC CAGAGGGCAC TGGAGCACCC GAACCGCAAG TATATGACCG GATCCTGATG
GCGGTAGGAC GCCGGCCAAA TGGCCGCGAA ATTGGGGCAG AAAAGGCAGG TATAGCTGTG
AACGAGCGTG GCTTCATCCC GGTGGATAAG CAGTTGCGGA CCAACGTCTC TCACATTTTC
GCCATAGGCG ACATCGCGGG CGAGCCCATG CTGGCGCACA AGGCATCGCA TGAAGGCAAA
CTGGCGGCGG AAATCATCGC CGGCGGGGAG AAGATGAAGT CAGCAGCGTT CGACGCTCGC
GCCATTCCCT CGGTAGCTTA CACCGATCCT GAGATCGCCT GGATGGGCCT CACCGAAACC
GAAGCCAAAA AGCAAGGTAT CGAAATCGAA AAAGCGGTAT TTCCGTGGGC CGTCAGCGGT
CGTGCTTTGG CAATGGCGCG CGACGAGGGC ATGACAAAGT TAATCCTGGA CAAGAAGACG
CGGCGCATTC TCGGCGCGGG CATCGTAGGC ATCAATGCCG GCGAACTGAT ATCCGAAACC
GTACTGGGAT TGGAAATGGG CGCGGATATG GAAGATATCG GCCTTACCAT CCATCCGCAT
CCGACTCTAT CCGAAACAGT GTTCTTTGCC GCTGAGATCG CGGAGGGGAC GATTACCGAT
CTTTATATGC CGAAGAAGTA G
 
Protein sequence
MAQLTEIKVP DIGDFKDVAV IEVLVKPGDT VEKETSLITL ETDKASIEVP SPQSGIVKEL 
KVKVGDKVSE GSIILMLEAT TAEAAAPAKA SEPGGASKQE EPAEKPAANV APPSEKEEEE
KTPASVAAPA PESVAPPAAP SIPQGDIHAD IVVLGAGPGG YTAAFRAADL GKNVVLIERY
STLGGVCLNV GCIPSKALLH VAKVITDAEE TAQQGIAFAK PGIEIDKLRG WKESIIGKLT
KGLTGLAKQR KVKVVRGTGR FTSPNMIEVE TSDGKKTVSF EHCIIAAGSA AARIPGFPYD
DPRIIDSTGA LKLESIPKRM LIIGGGIIGL EMATVYDALG SRISVVELMD QLIPGADADL
IRPLHKRIQK RYEAIYLKTK VTRIEALQEG LRVTFEGSSE GGGPEGTGAP EPQVYDRILM
AVGRRPNGRE IGAEKAGIAV NERGFIPVDK QLRTNVSHIF AIGDIAGEPM LAHKASHEGK
LAAEIIAGGE KMKSAAFDAR AIPSVAYTDP EIAWMGLTET EAKKQGIEIE KAVFPWAVSG
RALAMARDEG MTKLILDKKT RRILGAGIVG INAGELISET VLGLEMGADM EDIGLTIHPH
PTLSETVFFA AEIAEGTITD LYMPKK