Gene Rru_A2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A2981 
Symbol 
ID3836426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3432473 
End bp3433426 
Gene Length954 bp 
Protein Length317 aa 
Translation table11 
GC content66% 
IMG OID637827095 
Product2-desacetyl-2-hydroxyethyl bacteriochlorophyllide 
Protein accessionYP_428063 
Protein GI83594311 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.383171 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCC TCGCCGTCGT CATCCAAGAA CCCGAGCGTC TGACGCTCAG CCGGCTGGAT 
CTCACCGATC CGGCGCCTGG CGACGTGGTC GTGGATGTCG AATGGAGCGG GATCAGCACC
GGAACCGAAC GGTTGCTGTG GTCGGGGCGG ATGCCGCCCT TCCCGGGGAT GGGATATCCG
TTGGTGCCTG GATACGAGTC GGTCGGTCGG GTGATCGCCG TTGGGTCGCA GGCGCGGGCC
AAGGTGGGGA CGCAGGTCGG TGATCGGGTG TTCGTTCCCG GTGCGCGGTG TTATGGCGCG
GTGAACGGCC TGTTCGGTGG CGCCGCGTCG CGGGTGGTTG TGCCCGCCGA TCGGGTGGTC
GCCCTTCCCG AGGGGTTGGA CGACAAGGGT GTGTTGTTGG CGCTGACGGC CACGGCCTAC
CACGCCATGG TCATCGCCGG GGATCTCCGG CCCGAGTTGA TCGTCGGTCA CGGCGTCCTC
GGTCGGTTGC TCGCCCGGCT GGTGGTCGGG GTCGGCGGTA CGGCGCCGAC GGTTTGGGAG
CGTAATCCGC AGCGTCGGAG CGGGGCGATC GGTTATGCGG TCGTCGATCC GGCGGAGGAT
CCGCGTAAGG ATTATCGCTG CATTTGTGAT GTCAGCGGCG ACGCGACGAT CCTTGATACC
CTGGTGGCGC GCTTGGCGCG TGGCGGTGAG ATCGTTCTGG CGGGGTTCTA TGAATCGGCC
CTGTCGTTCA CTTTCCCGCC GGCTTTCATG CGCGAAGCCA GGATCCGTGT GGCGGCCGAA
TGGCGGCCCG AAGATTTGGC GGCGGTGATC GACATGATCG TTGATGGGCG GATGTCGCTC
GATGGTCTGA TCACCCACCG CGAGGAGGCC CCGCAGGCCG CCTCGGCTTA TCGGACGGCG
TTCACCGACC CGTCTTGTCT GAAGATGGTT TTGGATTGGA GAGCTTGCTC ATGA
 
Protein sequence
MDTLAVVIQE PERLTLSRLD LTDPAPGDVV VDVEWSGIST GTERLLWSGR MPPFPGMGYP 
LVPGYESVGR VIAVGSQARA KVGTQVGDRV FVPGARCYGA VNGLFGGAAS RVVVPADRVV
ALPEGLDDKG VLLALTATAY HAMVIAGDLR PELIVGHGVL GRLLARLVVG VGGTAPTVWE
RNPQRRSGAI GYAVVDPAED PRKDYRCICD VSGDATILDT LVARLARGGE IVLAGFYESA
LSFTFPPAFM REARIRVAAE WRPEDLAAVI DMIVDGRMSL DGLITHREEA PQAASAYRTA
FTDPSCLKMV LDWRACS