Gene RPD_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2070 
Symbol 
ID4022552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2317197 
End bp2318753 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content67% 
IMG OID637962263 
Producthypothetical protein 
Protein accessionYP_569206 
Protein GI91976547 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCGACA CCATCAACGG TGCACAGTCG GTTGTGCAGA CGCTGATCAA TTGCGGCGTC 
GAAGTCTGCT TCGCCAATCC CGGCACCTCC GAGATGCATT TCGTCGCGGC CCTGGACACG
GTCAGCGGGA TGCGGCCGGT GCTGTGCCTG TTCGAAGGCG TCGTGACCGG CGCCGCCGAT
GGCTACGGTC GGATCGCCGG CAAGCCGGCG GTGACCCTGC TGCATCTCGG ACCCGGTCTC
GCCAACGGCC TCGCTAATCT GCACAACGCC CGCCGCGCCG CGACGCCGAT CGTCAACATC
GTCGGCGACC ACGCCGCGCA TCATCTGCAA TACGACGCAC CGCTGACCTC CGACATCGTC
GGCTTTGCTC GGCCGGTTTC GAACTGGATC GTCGAGTCAA AGAGTGCAGG CGCGGTCGCC
AGCGACGTTG CGCGCGCGGT GCAGGCGGCG AAAGCTGCGC CGGGCGCAAT CGCAACACTG
ATCATGCCCG CCGACGTTGC CTGGAACGCC GCGGCGCGCG CCGCGCAGCC GCTGGTTGAT
TATGGGCCGG CGCGCGTGCA CGCCGACACG ATCGAGGCCG TCGCGAAGCT GCTGTCGAAC
GGCAAGAAGT CAGCGCTGTT GCTACGCGGC AATGCGTTGC TGGGATCGGG ACTTGAAGCG
GCAGGGCGGA TCCAGGCGAA ATGCGGCGCC CGGCTGATGT GCGACACCTT CGCGCCGCAA
ACGGAACTCG GCGCCGGCCG CGTGCCGCTC GAGCGCATCC CGTATTTCTC GGAACAGATC
ACTGCGTTCC TGAAAGACGT TGAGCAGTTG GTGCTCGTCG GCGCCAAACC GCCGGTGTCG
TTCTTCGCCT ATCCGGGCAA GCCGAGCTGG GGCGCGCCCG ACGGCTGTCA GTTCGAGTAT
CTGGCGCAGC CCCACGAGGA CGGCGCGCAG GCGCTGCGAG ACCTCGCCAC CGCGCTCGAC
GCGCCGGCGG AGCCCAAGAC GCGCACGCAA CTCGCTTTGC CAGATCTGCC CAAGGGCAAG
CTGAATTCGC TCGGCGTCGC GCAGGTGATC GCGCATCATA CGCCCGACCA CGCGATCTAC
GCCGAGGAAG CGGCGACGTC GGGCCTGCCG TTTCAGATGA TCGTGCCGCG GGCCCGGCCG
CATACGCATC TGCCGCTCAC CGGCGGTTCG ATCGGGCAGG GGCTGCCGCT TGCGATCGGC
GCGGCGATCG CCGCGCCCGA CCGCAAAGTT GTGTGTCCGC ATGGCGACGG CGGCGCGGCC
TATACGATGC AGGCGCTGTG GACGATGGCG CGCGAGCAGC TCGACATCAC CGTCGTGATC
TACGCAAACC GGTCCTACGC CATTCTCAAT GTCGAATTGC AGCGCGTCGG CGCGTCGGGT
GCGGGATCGA AAGCGCTGTC GATGCTCGAC CTGCACAATC CGGAAATGAA CTGGATGAAG
ATCGCCGAAG GCCTCGGGGT CGAGGCGAGC CGCGCCACCA CGGCGGAGGA GTTCGCCGCG
CAATACGCCT CGGCGATGAG CCAGCGCGGC CCGCGCCTGA TCGAAGCGCT GATTTGA
 
Protein sequence
MGDTINGAQS VVQTLINCGV EVCFANPGTS EMHFVAALDT VSGMRPVLCL FEGVVTGAAD 
GYGRIAGKPA VTLLHLGPGL ANGLANLHNA RRAATPIVNI VGDHAAHHLQ YDAPLTSDIV
GFARPVSNWI VESKSAGAVA SDVARAVQAA KAAPGAIATL IMPADVAWNA AARAAQPLVD
YGPARVHADT IEAVAKLLSN GKKSALLLRG NALLGSGLEA AGRIQAKCGA RLMCDTFAPQ
TELGAGRVPL ERIPYFSEQI TAFLKDVEQL VLVGAKPPVS FFAYPGKPSW GAPDGCQFEY
LAQPHEDGAQ ALRDLATALD APAEPKTRTQ LALPDLPKGK LNSLGVAQVI AHHTPDHAIY
AEEAATSGLP FQMIVPRARP HTHLPLTGGS IGQGLPLAIG AAIAAPDRKV VCPHGDGGAA
YTMQALWTMA REQLDITVVI YANRSYAILN VELQRVGASG AGSKALSMLD LHNPEMNWMK
IAEGLGVEAS RATTAEEFAA QYASAMSQRG PRLIEALI