Gene RPB_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3930 
Symbol 
ID3911737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4483569 
End bp4485113 
Gene Length1545 bp 
Protein Length514 aa 
Translation table11 
GC content70% 
IMG OID637885834 
Producthypothetical protein 
Protein accessionYP_487534 
Protein GI86751038 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.12121 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.390263 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGTG CCGAAAGTCT GGTGCGGACG CTGGTCCATG GCGGCGTGGA CGTCTGCTTC 
ACCAATCCCG GCACCTCGGA AATGCACTTC GTCGCCGCGC TCGACCGCGT CGAGGGCATG
CGCTGCGTGC TCGGCCTGTT CGAGGGCGTG GTCACCGGCG CCGCCGACGG TTATTTCCGG
ATGAAAGGCA CGCCGGCCTC GACGCTGCTG CATCTCGGCC CCGGCCTCGC CAACGGCCTC
GCCAATCTGC ACAACGCCAA GAAGGCGAGT TCCGGCATCG TCAACATCGT CGGCCAGCAC
GCCACCTACC ACATCGACTA CAACGCGCCG CTGACCTCCG ACATCGAGGG CCTGGCGCGG
CCGATGTCGG CCTGGGTCCG CACCTCGCCG GACGCGCAAT CGGTGGCGCG CGACGGCGCC
GCCGCGATTG CCGCCGCGAA GAGCGCGCCG CCGCAGATCG CCACCCTGAT CCTGCCCGCC
GACACCGCCT GGGGCGAGGC CGACGGCATC GCCGAGGTGC CGCAAGACAC CCAGCGCCCG
AGCTATTCGC CGCACGCGGT GGAAGCCGCG GCGCGCGTGC TGCGCTCCGG CGAGCCGACG
CTGCTGCTGC TGACCGGCGG CGCGCTCACC GAACACGGCC TCGAGCTCGC CGCGCGGATC
GCCGGCAAGA CCGGCTGCCG TGTGATGGGC CAGACCTACA ATCCGCGGAT GGCGCGCGGT
CGCGGCCGCT ATGCGATCGA GCGGATTCCC TATGTGATCG AAGCCGCGCT GCCGATCCTG
AAGGACTTCC GCCACATCGT GCTGGTCGAG GCCAACGATC CGGTGGCGTT CTTCGCCTAT
CCGAACAAGC CGAGCCTGCT GAAACCGGAC GGCTGCGAGG TACATCGCAT GACCGAGGGC
GGCGAGAATT CCACCGCAGC GCTCGAAGCG CTGGCCGGCG CGCTCGGCGC CAAGGCGGCC
GACGCCCAGC CGCAGACCCA TGTCGAGATC GCGCGGCCGA GCGGCGCGCT GACCCATGCC
TCGATCGCCC AGGCGATCGC GATGGCGATC CCGGACAACG CCATCGTGAT CGACGAATCG
ATCACTACCG GCCGCGGCTT CTTTCCGCCG ACGGCGGCGG CGGCGCCGCA CGACTGGCTG
CAGAACATGG GCGGCTCGAT CGGGTTCTCG CCGCCGGTCG CGGTCGGCGC CGCGGTGGCG
TGCCCGGATC GCAAGGTGAT CTGCCTGGTC GGCGACGGCA GCGCGATGTA CACGCTGCAG
GCGCTGTGGA CCCAGGCTCG CGAAAATCTC GACGTCACCA CCGTGGTGTT CGCCAACCGC
AAATATCAGA TCCTGCGCGG CGAGTTCGAC GGCGTCGGCG CCGGCGAGCC GGGCCAGCGC
GCGCAGGACA TGCTGTCGCT GGATCGGCCG AACCTCGACT GGGTGTCGCT GGCCCGGGGC
ATGGGCGTGC CGGCCCGCGC CGTGACCAGC GCCGATGAAC TCAACAAGGC GCTCGACGCC
GGCGTCGCCG GCAGCGGTCC GAATTTGATC GAAGTGCAGA TGTAG
 
Protein sequence
MNGAESLVRT LVHGGVDVCF TNPGTSEMHF VAALDRVEGM RCVLGLFEGV VTGAADGYFR 
MKGTPASTLL HLGPGLANGL ANLHNAKKAS SGIVNIVGQH ATYHIDYNAP LTSDIEGLAR
PMSAWVRTSP DAQSVARDGA AAIAAAKSAP PQIATLILPA DTAWGEADGI AEVPQDTQRP
SYSPHAVEAA ARVLRSGEPT LLLLTGGALT EHGLELAARI AGKTGCRVMG QTYNPRMARG
RGRYAIERIP YVIEAALPIL KDFRHIVLVE ANDPVAFFAY PNKPSLLKPD GCEVHRMTEG
GENSTAALEA LAGALGAKAA DAQPQTHVEI ARPSGALTHA SIAQAIAMAI PDNAIVIDES
ITTGRGFFPP TAAAAPHDWL QNMGGSIGFS PPVAVGAAVA CPDRKVICLV GDGSAMYTLQ
ALWTQARENL DVTTVVFANR KYQILRGEFD GVGAGEPGQR AQDMLSLDRP NLDWVSLARG
MGVPARAVTS ADELNKALDA GVAGSGPNLI EVQM