Gene RSP_3449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_3449 
Symbol 
ID3721735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007494 
Strand
Start bp519194 
End bp520258 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content74% 
IMG OID640073116 
Productputative allophanate hydrolase subunit 2 
Protein accessionYP_354954 
Protein GI77465451 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1984] Allophanate hydrolase subunit 2 
TIGRFAM ID[TIGR00724] biotin-dependent carboxylase uncharacterized domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCTCG AGATCCTGAC CGCAGGTCCC ATGCTGACCG TGCAGGACGC GGGCCGCTTC 
GGCCTGCGCC ACATGGGCGT CTCGCCCGCG GGCCCCATCG ACCGGGCCGC CATGGCGCTC
GCCAATGCGC TCGTGGGCAA TGCGCCCGGC GCCGCGGCGC TGGAATTCGC AGGCCCTGCG
GGCAGCTTCC GCTGCGACCG GCCGGTGCGC TTTGCGGTGG CCGGGGCCGA CTGCCCGATC
CGCATCGACA AGCGCGTGGT GCTGGCGGGC GAGAGCCACC GGCTGAACCC CGGTGAAACC
CTCACCGTGG GCGTGCCCGA AGGCACGGTC TGGGCCTATC TGGCCTTCTC CGGCGCCATC
GCCACGCCCG AGGTGCTGGG CTCGCGCGCG ACGCATCTCC GCTCGGGCCT CGGCGGCCCC
GAGGGGCGGG CGCTGGCGGC GGGCGACCGG CTGCCGCTCG GCCCCGACGA GGCCGACGCG
CCCTGCCTGC GCCCCGACAG CCGTCTGGAC GGCGCGGCGC CCTTCCGCGA GACGGGACCG
ATCCGGCTGA TCCTCGGCCC GCAGGACGGC CATTTCGCCC CCGAGATCGT GGCGCGCCTC
ACCGGGTGCG ACTTCACCGT GACCCCGCAG CGCGACCGGA TGGCCATGGT GCTGGGCGGC
ACCGACCTGC CCGCCGCGCG CGGGCACGAC ATCGTCTCCG ACGGCACGGT GCCGGGCTCG
GTGCAGGTGC CGGGCTCGGG GATGCCGCTC GTGCTTCTGG CCGAGAGCCA GACCACCGGC
GGCTATCCCA AGATCGGGAC CGTGGCCTCG GTCGATCTCG CGCGGCTCGC GCAGATGCCG
GTGGGCGCGC AGGTCCGCTT CGCGCCGATC TCGGCCGAGG AGGGCGAGGA TCTCTGGATC
GCGCGGCAGG TGCGGCTAAG GCGGCTTCTC GAGGCGCTGG TGGCCAAGCC CGAGGGCGTC
CTGCGGTCGG ATTACCTCTT GTCCTGCGAT CTCGTCGGCG GCTTCTACGA GCCGGGCGAG
ATTGTGCGTC CCGTCACGAT TCGGGGCCCG GAGGAATGTT CATGA
 
Protein sequence
MSLEILTAGP MLTVQDAGRF GLRHMGVSPA GPIDRAAMAL ANALVGNAPG AAALEFAGPA 
GSFRCDRPVR FAVAGADCPI RIDKRVVLAG ESHRLNPGET LTVGVPEGTV WAYLAFSGAI
ATPEVLGSRA THLRSGLGGP EGRALAAGDR LPLGPDEADA PCLRPDSRLD GAAPFRETGP
IRLILGPQDG HFAPEIVARL TGCDFTVTPQ RDRMAMVLGG TDLPAARGHD IVSDGTVPGS
VQVPGSGMPL VLLAESQTTG GYPKIGTVAS VDLARLAQMP VGAQVRFAPI SAEEGEDLWI
ARQVRLRRLL EALVAKPEGV LRSDYLLSCD LVGGFYEPGE IVRPVTIRGP EECS