Gene RPB_2833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2833 
Symbol 
ID3910626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3226598 
End bp3227605 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content69% 
IMG OID637884733 
Productallophanate hydrolase subunit 2 
Protein accessionYP_486446 
Protein GI86749950 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.636901 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.872567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC TTCTAATCGA CAATGTCGGC CCGGCGACCT CCGTGCAGGA CGCGGGACGC 
CACGGCGCGC AGCGCTACGG TCTGACGCCG AGCGGCGCGA TGGATCGTCT GTCGCTGGCT
GCCGCGAACG TGCTGGTCGG CAACGATGCG TTCGCCGCCG CGGTCGAATT GGGGCCGCTC
GGCGCAACGC TCACCGCGCA CGACGGCGCG GTGCGGCTGG CCCTGACAGG CGCGGAGCGC
TCCGCCACGA TCGGCGACCG CGCGATCGCA CTCAACGAGT CCTTCCTGCT CGCCGACGGC
GAGACATTGA CGCTCGGAAT CGCGCGCAGC CAGGTGTTCA GCTATCTGGC GATCGCTGGC
GGCATCGATG GCGAGCCGAT GTTCGGCAGT CTCGCGGTCA ATGCCCGCGC CGGCCTCGGC
AGTCCCTACC CGCGGCCGCT GCAATCCGGC GACGCCATCC CGGCCGCGTC GGCAGTCGTT
GCGCCCGAAC GTCGCCTCGA TCTGCCGACA CCGCCCGACG GGCCGATCCG CGTCGTGCTC
GGCCCGCAGG ACGACGAATT CGGCGATGCC GTCGCGACCT TCCTCGACAG CGCGTGGAAA
GTGTCGGCGA CCAGCGACCG GATGGGCTAT CGCCTCGAAG GTCCGGAGAT CCGCCATCTG
CACGGCCACA ACATCGTCTC CGACGGCACT GTCGACGGCA GCATCCAGGT TCCCGGCAAT
GGCCAGCCGA TCGTGCTGAT GCCCGATCGC GGCACCAGCG GCGGCTATCC GAAAATTGCC
ACCGTGATCA CCGCCGATCT CGGTCGGCTC GCGCAGCTTC AGCCCGGGCG GCCGTTTCGT
TTCAGATCGG TGAGCATGGA GGAGGCGCAG GCCGAATATC GCGCAATGGC CGGGCTGATC
CGCGCCCTGC CCGACCGGAT CGCGGACGCG CAGCATATGA CGCTCGACCT CGACGCGCTG
CTGACGGCCA ACGTGGCGGG CGCGGCCACC AACGCGCTCG AAATCTGA
 
Protein sequence
MSKLLIDNVG PATSVQDAGR HGAQRYGLTP SGAMDRLSLA AANVLVGNDA FAAAVELGPL 
GATLTAHDGA VRLALTGAER SATIGDRAIA LNESFLLADG ETLTLGIARS QVFSYLAIAG
GIDGEPMFGS LAVNARAGLG SPYPRPLQSG DAIPAASAVV APERRLDLPT PPDGPIRVVL
GPQDDEFGDA VATFLDSAWK VSATSDRMGY RLEGPEIRHL HGHNIVSDGT VDGSIQVPGN
GQPIVLMPDR GTSGGYPKIA TVITADLGRL AQLQPGRPFR FRSVSMEEAQ AEYRAMAGLI
RALPDRIADA QHMTLDLDAL LTANVAGAAT NALEI