Gene Rsph17029_3094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3094 
Symbol 
ID4899083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp108951 
End bp110015 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content74% 
IMG OID640113696 
Producturea amidolyase related protein 
Protein accessionYP_001044966 
Protein GI126463853 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.604778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.778308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTCG AGATTCTGAC CGCAGGCCCC ATGCTGACCG TGCAGGACGC GGGCCGCTTC 
GGCCTGCGCC ACATGGGCGT CTCGCCCGCG GGCCCCATCG ACCGGGCCGC CATGGCGCTC
GCCAATGCGC TCGTGGGCAA TGCGCCCGGC GCCGCGGCGC TGGAATTCGC AGGCCCCGCG
GGCAGCTTCC GCTGCGACCG GCCGGTGCGC TTCGCGGTGG CGGGCGCCGA TTGCCCGATC
CGCATCGACA AGCGCGTGGT GCTGGCGGGC GAGAGCCACC GGCTGAACCC CGGTGAAACC
CTCACCGTGG GCGTGCCCGA AGGCACGGTC TGGGCCTATC TGGCCTTCTC CGGCGCCATC
GCCACGCCCG AGGTGCTGGG CTCGCGCGCG ACGCATCTGC GCTCGGGCCT CGGCGGCCCC
GAGGGGCGGG CGCTGGCGGC GGGCGACCGG CTGCCGTTCG GCCCCGACGA GGCAGACGCG
CCCTGCCTGC GCCCAGACAG CCGTCTGGAC GGCGCGGCGC CCTTCCGCGA GACGGGACCG
ATCCGGCTGA TCCTCGGCCC GCAGGACGGC CATTTCGCCC CCGAGATCGT GGCGCGCCTC
ACCGGATGCG ACTTCACCGT GACCCCGCAG CGCGACCGGA TGGCCATGGT GCTGGGCGGC
ACCGACCTGC CCGCCGCGCG CGGGCACGAC ATCGTCTCGG ACGGCACGGT GCCGGGCTCG
GTGCAGGTGC CGGGCTCGGG GATGCCGCTC GTGCTTCTGG CCGAGAGCCA GACCACCGGC
GGCTATCCCA AGATCGGCAC CGTGGCCTCG GTCGATCTGG CGCGGCTCGC GCAGATGCCG
GTGGGCGCGC AGGTCCGCTT CGCGCTGATC TCGGCCGAGG AGGGCGAGGA TCTCTGGATC
GCGCGGCAGG CGCGGCTCCG GCGGCTTCTC GAGGCGCTGG TGGCCAAGCC CGAGGGCGTC
CTGCGGTCGG ATTACCTCTT GTCCTGCGAT CTCGTCGGCG GCTTCTACGA GCCGGGCGAG
ATCGTGCGTC CCGTCACGAT TCGGGGCCCG GAGGAATGTT CATGA
 
Protein sequence
MSLEILTAGP MLTVQDAGRF GLRHMGVSPA GPIDRAAMAL ANALVGNAPG AAALEFAGPA 
GSFRCDRPVR FAVAGADCPI RIDKRVVLAG ESHRLNPGET LTVGVPEGTV WAYLAFSGAI
ATPEVLGSRA THLRSGLGGP EGRALAAGDR LPFGPDEADA PCLRPDSRLD GAAPFRETGP
IRLILGPQDG HFAPEIVARL TGCDFTVTPQ RDRMAMVLGG TDLPAARGHD IVSDGTVPGS
VQVPGSGMPL VLLAESQTTG GYPKIGTVAS VDLARLAQMP VGAQVRFALI SAEEGEDLWI
ARQARLRRLL EALVAKPEGV LRSDYLLSCD LVGGFYEPGE IVRPVTIRGP EECS