Gene RPB_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0077 
Symbol 
ID3907817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp80003 
End bp81256 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content65% 
IMG OID637881958 
Productaspartate kinase 
Protein accessionYP_483700 
Protein GI86747204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.216369 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0840427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGC TGGTGATGAA ATTCGGCGGC ACGTCCGTCG CCAATATCGA GCGCATCCAG 
AACGTCGCGC GCCACGTCAA GCGCGAGGTC GATGCCGGCC ATGAGGTTGC CGTGGTGGTG
TCGGCGATGG CCGGCAAGAC CAACGAGCTG GTGGCCTGGT GCACCGAAGC CTCGCCGATG
CACGACGCCC GCGAATACGA CGCCGTGGTG GCGTCCGGCG AGCAGGTGAC GTCGGGGCTT
CTCGCGATCG CCTTGCAGGC GCTCGGCATC CAGGCGCGAT CCTGGCAGGG CTGGCAGTTG
CCGATCCGCA CCAGCGACGC CCATGCCTCG GCTCGGATCG TCGAGATCGA CGGCAGCGAG
ATCGTCAAGC GCTTCGGCGA CCGCAAGGAA GTCGCGGTGA TCGCCGGCTT CCAGGGCATC
AATCCCGAGA CCGGCCGCAT CACCACGCTC GGCCGCGGCG GCTCCGACAC CTCGGCGGTG
GCGATCGCGG CGGCGCTGAA GGCCGACCGC TGCGACATCT ATACCGACGT CGACGGCGTC
TACACCACCG ACCCCCGCGT GGTGCCGAAG GCGAAACGGC TCGACAAGGT GGCGTTCGAG
GAGATGTTGG AACTGGCGTC GCAGGGCGCC AAGGTGCTGC AGGTCCGCTC GGTCGAGCTC
GGCATGGTGC ACAACATGCC GATCTTCGTG CGCTCTTCCT TCGACAAACC CGAAGATATC
GATCCGCACG GCACGCCGCC GGGCACGCTG ATCTGCAGCG AGGAGAAAAT CATGGAGAAC
CACGTCGTCA CCGGCATCGC CTTTTCCAAG GACGAAGCCC AGATCTCGGT GCGCCGGATC
GAGGACAAGC CGGGCGTGGC GGCGTCGATC TTCGGGCCGC TGGCCGACGC CAACATCAAC
GTCGACATGA TCGTGCAGAA CGTCTCGGAA GACGGCAAGA CCACGGATCT CACCTTCACG
GTTCCGGCGT CGGATTTCGC CCGCGCCAAG CAGACGATTA CTTCGGCGCA GGACAAGATC
GGTTACGCCC GGTTCGACAG CGAGACCGAC GTCGCCAAGG TGTCGGTGAT CGGCTCCGGG
ATGCGCAGCC ATGCCGGCGT CGCCGCCCAG GCATTCGCCG CTTTGGCCGC GCGCAACATC
AATATTCGCG CGATCACGAC CTCGGAGATC AAGTTCTCGG TGCTGATCGA CGCCGCCTAC
ACCGAGCTCG CGGTGCGGAC ATTGCATACT TTGTACGGAT TGGATCAAGT TTAG
 
Protein sequence
MGRLVMKFGG TSVANIERIQ NVARHVKREV DAGHEVAVVV SAMAGKTNEL VAWCTEASPM 
HDAREYDAVV ASGEQVTSGL LAIALQALGI QARSWQGWQL PIRTSDAHAS ARIVEIDGSE
IVKRFGDRKE VAVIAGFQGI NPETGRITTL GRGGSDTSAV AIAAALKADR CDIYTDVDGV
YTTDPRVVPK AKRLDKVAFE EMLELASQGA KVLQVRSVEL GMVHNMPIFV RSSFDKPEDI
DPHGTPPGTL ICSEEKIMEN HVVTGIAFSK DEAQISVRRI EDKPGVAASI FGPLADANIN
VDMIVQNVSE DGKTTDLTFT VPASDFARAK QTITSAQDKI GYARFDSETD VAKVSVIGSG
MRSHAGVAAQ AFAALAARNI NIRAITTSEI KFSVLIDAAY TELAVRTLHT LYGLDQV