Gene Namu_4110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4110 
Symbol 
ID8449733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4537721 
End bp4538743 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content68% 
IMG OID645043156 
Productdehydrogenase E1 component 
Protein accessionYP_003203388 
Protein GI258654232 
COG category[C] Energy production and conversion 
COG ID[COG1071] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.0915017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0509347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCA AGACGAAGGA AGCACCGGCG GCCGCGGCCG CCGATGCCGC GGACGACCTG 
GGATCCCGGG ACCGGGATCT GCTGCTGCGC ATCTACCGCG CGATGGTGCT GACCCGGGCG
GTCGAGGACC GGATGGTCGC GATGTACAAG GGTGGCGACC TGCTCGGCTC CCTGTACACC
GGGCACTGGC ACGAGGGGAT CAGCGTGGGC GCGGCGTCCA CGCTGCGCGC GGACGACTAC
ATGGCCCCGA TCCACCGCGA TCTGGGCGCC CACCTGTACC GGGGCATGGA CGCGTGGCAG
GTGATGGCCA GCTTCATGGG CAAGGCCACC TCGCCCACCG GCGGTCGCGA CGGCACCCTG
CACTACGGCC GGCTCGACCT GGGCCATTAC AACCTGCCCA GCCACATCCC GGCCAACTTC
CCGGTGGCCA CCGGCATGGC ATTCGCGGCG AAATACCGTG GGCAGGACAA GGTCTGCCTG
GCGTTCTGCG GTGACGGCTC CACCTCGCGG GCCGATTTCC ACGAGGCGCT GAGCATGTCC
AGCGTGCTGG ACCTGCCCAA CGTGTTCGTC ATCGAGAACA ACCAGTTCGC CTACTCGACG
CCGATCTCGA TGCAGTCCAA GTCGATGCAG TTCTCCGACA AGGCCAAGGC CTACGGCATT
CCCGGGGTCA CTGTGGACGG CACCGACGTG CTCGCCGTGC ACGACGCGGT CGCCGAAGGC
GTCGAGCGGG CCCGCGCGGG TAAGGGACCC TCCATCGTCG AGGGCATCAC GATGCGGATG
CACGGGCACG CCGAGCATGA CCCGGCCGAC TACGTGCCGC CGGCGATGTT CGAGGAATGG
TCGAAGAAGG ACCCGGTCGA GCTGTTCGAG AAGCGGCTGG TGGCCGCCGG TGTCATCGAC
CAGGCGACAG CTGAGGACAC CCGGAAGCAG GCCCGCCAGG CGGCCATCGA CGCACGCAAG
AAGGCCCTCG CCGATCCGAT GCCGACGGCG GAGAACATCG AGGATGGCGT TTATGCCGAC
TGA
 
Protein sequence
MATKTKEAPA AAAADAADDL GSRDRDLLLR IYRAMVLTRA VEDRMVAMYK GGDLLGSLYT 
GHWHEGISVG AASTLRADDY MAPIHRDLGA HLYRGMDAWQ VMASFMGKAT SPTGGRDGTL
HYGRLDLGHY NLPSHIPANF PVATGMAFAA KYRGQDKVCL AFCGDGSTSR ADFHEALSMS
SVLDLPNVFV IENNQFAYST PISMQSKSMQ FSDKAKAYGI PGVTVDGTDV LAVHDAVAEG
VERARAGKGP SIVEGITMRM HGHAEHDPAD YVPPAMFEEW SKKDPVELFE KRLVAAGVID
QATAEDTRKQ ARQAAIDARK KALADPMPTA ENIEDGVYAD