Gene Rru_A2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2071 
Symbol 
ID3835497 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp2395812 
End bp2397317 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content67% 
IMG OID637826172 
Productmethylmalonate-semialdehyde dehydrogenase [acylating] 
Protein accessionYP_427158 
Protein GI83593406 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01722] methylmalonic acid semialdehyde dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGCC TTGTCTCCTA TACCCATTTC GTCAATGGCG CCCATACCGC CCCCGCCGGC 
GGCCGCAGCG CCCCGGTGTT CCTGCCGATG ACCGGCGAGA TCCAGGCCCG GGTGGATCTG
GCCAGCCGGG CCGAGGTCGA TGCCGCCGTG GCCCTCGCCG CCCAGGCCCA GCCGGCCTGG
GCCGCCCAGA ACCCGCAAAA GCGCGCCCGG GTGCTGATGC GCTTCCTGGA ACTGGCGCGC
CGCGATAACG AGGCCCTGGC CCTGTTGCTG GCCCGTGAGC ACGGCAAGAC CATCGCCGAT
GCCAAGGGCG ACATCCAGCG CGGCCTGGAA GTCGTCGAAT TCGCCATCGG CATTCCCCAT
CTGCTCAAGG GCGAATACAC CGACTCCGCC GGTCCGGGGA TCGATCTCTA TTCGATGCGC
CAGCCCTTGG GCGTGGTCGC CGGCATCACG CCGTTTAATT TTCCGGCGAT GATCCCGCTG
TGGAAGGCCG CCCCGGCCAT CGCCTGCGGC AACGCCTTCA TCCTCAAGCC CTCCGAGCGC
GATCCCGGGG TGCCGTTGCG TTTGGCCGAA CTGTTCCTGG AAGCCGGCCT GCCGCCGGGA
ATTTTCAATG TGGTCAATGG CGACAAGGAA GCCGTCGACG CCCTTCTTGA CAACCCCACG
GTCAAGGCCA TCGGCTTCGT CGGCTCGACG GCGATCGCCC AATACATCTA TGGGCGGGGC
ACGGCGGCGG GCAAGCGGGT GCAATGCTTC GGCGGCGCCA AGAACCACAT GATCATCATG
CCCGACGCCG ACCTCGATCA GACGGTCGAC GCCCTGATCG GCGCCGGTTA TGGCTCGGCC
GGCGAGCGTT GCATGGCGAT CTCGGTGGCG GTGCCGGTGG GCGAGGCCAC CGCCGAGGCC
CTGATGGACA AACTCATCCC CCGGGTCCGC GCCCTGAAGA TCGGCCCTTC GACCGATCCC
GAGGCCGATT TCGGCCCGCT GGTCACCCGC GAGGCCGTGG ACAGGGTCAC GGCCGCCGTC
GCCCAGGGTG TGGCCGAGGG CGCCGATCTG GTGGTCGATG GCCGGGACTT CTCCCTTCAG
GGCTATGAGA ACGGCTTTTA CATGGGCGGC TGCCTGTTCG ACCGCGTCAC CCCCGCCATG
GCGATCTACC GCGAGGAGAT CTTCGGCCCG GTGCTCAGCG TCGTGCGCGC CGCCGATTAT
GAGCAGGCCC TGCGCCTGCC CAACGAACAC GCCTATGGCA ATGGCGTTGC CATCTTCACC
CGCGACGGCG ACGCGGCGCG CGATTTCGCC GCCCGCGTCG AGGTCGGCAT GGTCGGCATC
AACGTGCCGA TCCCCGTGCC TTTGGCCTAT TTCACCTTCG GTGGCTGGAA GCGCTCGGGC
TTTGGCGATC TCAACCAGCA TGGCCCCGAC GCCGTGCGCT TTTACACCAA AACCAAGACC
GTGACCTCGC GCTGGCCCTC GGGCATCCGC GACGGCGCCG AGTTCGTCAT CCCGACCATG
CGCTAA
 
Protein sequence
MSSLVSYTHF VNGAHTAPAG GRSAPVFLPM TGEIQARVDL ASRAEVDAAV ALAAQAQPAW 
AAQNPQKRAR VLMRFLELAR RDNEALALLL AREHGKTIAD AKGDIQRGLE VVEFAIGIPH
LLKGEYTDSA GPGIDLYSMR QPLGVVAGIT PFNFPAMIPL WKAAPAIACG NAFILKPSER
DPGVPLRLAE LFLEAGLPPG IFNVVNGDKE AVDALLDNPT VKAIGFVGST AIAQYIYGRG
TAAGKRVQCF GGAKNHMIIM PDADLDQTVD ALIGAGYGSA GERCMAISVA VPVGEATAEA
LMDKLIPRVR ALKIGPSTDP EADFGPLVTR EAVDRVTAAV AQGVAEGADL VVDGRDFSLQ
GYENGFYMGG CLFDRVTPAM AIYREEIFGP VLSVVRAADY EQALRLPNEH AYGNGVAIFT
RDGDAARDFA ARVEVGMVGI NVPIPVPLAY FTFGGWKRSG FGDLNQHGPD AVRFYTKTKT
VTSRWPSGIR DGAEFVIPTM R