Gene RPD_2635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2635 
Symbol 
ID4023132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2954269 
End bp2955276 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content68% 
IMG OID637962833 
Productallophanate hydrolase subunit 2 
Protein accessionYP_569765 
Protein GI91977106 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.238129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000404715 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCAGGC TCGTCGTTGA CATCGTCGGC CCGGCCACCT CCGTGCAGGA TGCGGGCCGC 
CACGGCGCGC AACGCTATGG GCTGACCCCG AGCGGCGCGA TGGATCGATG GTCGTTAGCC
GCGGCGAACA CGCTGGTCGG CAACCCCGCA TTCGCCGCTG CGATCGAACT CGGGCCGCTC
GGCGCGGCGT TCACCGCGCG CGACGGCGCG GTGCGGCTCG CGTTATGCGG CGCGGAACGT
CCCGCCGCGA TCGGCAGCGG AGCGATTGCG CTCAACGAGT CGTTTCTGCT CGCTGAGGAC
GAGACGCTGA CGCTCGGCGT CGCGCGCAGC CATGTGTTCA GCTATCTGGC GATCGCAGGC
GGCATCAGCG GCGAACCGAT GTTCGGCAGT CTCGCGGTCA ATGCCCGCGC CGGCCTCGGC
AGCCCCTACC CGCGGCCGCT ACAGCCCGGC GACGTCATTC CGGCAAAGCC AGCGACGATC
GCCGCCGAAC GCCGTCTCGA TCTGCCGAAG CCGTCCGAAG CGCCGATCCG CGTCGTGCTC
GGTCCGCAGG ACGACGAATT CGGCGACGCC GTCGCAACCT TCCTCAATGG CGAATGGAAA
ATCTCCGCGA CCAGCGACCG GATGGGCTAT CGACTCGAAG GGCCGGAGAT CAGGCATTTG
CACGGCCATA ACATTGTCTC CGACGGCACC GTCGACGGCA GCATTCAGGT TCCCGGCAAC
GGCCAGCCGA TCGTGTTGAT GCCCGACCGC GGCACCAGCG GCGGCTACCC GAAGATCGCG
ACCGTGATCT CCGCCGATCT CGGTCGTCTC GCGCAATTCC AGCCCGGGCG GCCGTTCCGT
TTCAAGGCGG TGAGCATGGA CGAGGCGCAG GCCGAGTATC GTGCGATGGC GAAGTTGATC
CGCGCTTTGC CAGATCGTTT GCAGGATGCG CAACAGGGGA TGCTCGACCT CGACGCGCTG
TTCACCGCCA ACGTCGCGGG CGCGGCGGCC AATGCGCTCG ACGGCTGA
 
Protein sequence
MSRLVVDIVG PATSVQDAGR HGAQRYGLTP SGAMDRWSLA AANTLVGNPA FAAAIELGPL 
GAAFTARDGA VRLALCGAER PAAIGSGAIA LNESFLLAED ETLTLGVARS HVFSYLAIAG
GISGEPMFGS LAVNARAGLG SPYPRPLQPG DVIPAKPATI AAERRLDLPK PSEAPIRVVL
GPQDDEFGDA VATFLNGEWK ISATSDRMGY RLEGPEIRHL HGHNIVSDGT VDGSIQVPGN
GQPIVLMPDR GTSGGYPKIA TVISADLGRL AQFQPGRPFR FKAVSMDEAQ AEYRAMAKLI
RALPDRLQDA QQGMLDLDAL FTANVAGAAA NALDG