Gene Rsph17029_3091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3091 
Symbol 
ID4898192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp106023 
End bp107303 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID640113693 
Productallantoate amidohydrolase 
Protein accessionYP_001044963 
Protein GI126463850 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.60133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.601215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC TGCCCCTGGC TGAGGACGCC CCCGCCCGCG CCATCCGCAG CACCATCGAC 
CGGGTGCTGG CCGAGGTGAA CGCCCTCTCC GAGGGCGGCC CCGGCTGGAC CCGCCCCTCC
TATTCCGATC TGGAAAGCCA TGCCCATGCG CTGATCGAGG CCGAGGCCCG CGCGCTCGGC
TTGAGCGTCA GCCGCGATCA TGCAGGCAAC CTCTTCGCCC GGATGGAGGG GCGCGACCCG
AGCCTTCCGG CCCTCCACTG CGGCTCGCAC CTCGACACGG TGGCCGAGGG CGGCGCCTTC
GACGGGCAGG CGGGCGTGGC CGCGGCGCTG GCCCTCGTCG CGGCCATGCG CGAGGCGGGC
GTCACGCCCG AGGCCGATTT CGTGCTGACC GTGACGCGGG CCGAGGAGAG CGTCTGGTTC
CCCGTCTCCT ATATCGGCTC GCGCGCGGCG CTCGGCCGGC TCCTGCCCGA AGAGCTCGAG
GCGCGCCGCG TCGACACCGG CCGGACGCTG GCCGAGCACA TGCGCGAGCA GGGCTTCGAC
CCCGACGCGC TGATGCGGGC CGAGCCGCCG AAGCCCGCGC GCTTCCTCGA GTTCCATATC
GAGCAGGGCC CCGTCCTCGA CCGGGCGGGC GAGCCCTACG GCATCGTCTC GGCCATCCGC
GGCGGTCTGC GCTATCGCGC GGCAAAGGTG CATGGCACCT GGGCCCATTC CGGCGGTGCG
CCGCGCGCGG GCCGCGCCGA CGCGGTGGTG GCCTTCGCCG ATCTGGTCAT GGCGATGGAC
CGCGCCTGGG AGACGTTCCT GTCGCGCGGC GCGGACCTCA CCGTGACCTT CGGCAAGGTC
GATGCAGCCT CGCCCGCCCA TGCCATGGCC AAGGTGCCGG GCGAGCTCGC CTTCTGCCTC
GACCTGCGCT CCGAGGATGT GACGGTGCTC GAGGCGGCCG ACCGGGTGCT GCGCGAGGAG
ATCCGCCGCA TCGAGACCGA ACGGCCCGGC ATCCGCTTCG ATCTGGGCAC CCAGAGCCGC
AGCCAGCCTG CGCGTCTGTC GCCTGCGATG ATCGACTGGG TGGCGGGCGG CGCCGCGCGC
CGGGGCGATG AGCCGCGCCG GATGCTCTCG GGCGGCGGCC ACGATGCGGC GGCCTTCGCC
AGCGCCGGCT GGGACAGCGT CATGGTCTTC ATCCGCAACT GGAACGGCAG CCATTGCCCC
GACGAGGGGA TGGATCCGGC CGACCTCGCC CGCGCGGTGG AGGCGGTCTT CGCCGCGCTG
TCGGGGAACG GCTCCCCATG A
 
Protein sequence
MTDLPLAEDA PARAIRSTID RVLAEVNALS EGGPGWTRPS YSDLESHAHA LIEAEARALG 
LSVSRDHAGN LFARMEGRDP SLPALHCGSH LDTVAEGGAF DGQAGVAAAL ALVAAMREAG
VTPEADFVLT VTRAEESVWF PVSYIGSRAA LGRLLPEELE ARRVDTGRTL AEHMREQGFD
PDALMRAEPP KPARFLEFHI EQGPVLDRAG EPYGIVSAIR GGLRYRAAKV HGTWAHSGGA
PRAGRADAVV AFADLVMAMD RAWETFLSRG ADLTVTFGKV DAASPAHAMA KVPGELAFCL
DLRSEDVTVL EAADRVLREE IRRIETERPG IRFDLGTQSR SQPARLSPAM IDWVAGGAAR
RGDEPRRMLS GGGHDAAAFA SAGWDSVMVF IRNWNGSHCP DEGMDPADLA RAVEAVFAAL
SGNGSP