Gene Rsph17029_0104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0104 
Symbol 
ID4895967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp119408 
End bp120682 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID640110687 
Productallantoate amidohydrolase 
Protein accessionYP_001041996 
Protein GI126460882 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0189735 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAGA CCCAGAGCCT TTCCCGCATC GACGCGGATC TTCTGAACGC GCTTATGGAG 
AAAGTGTCCG AGTTCGGCTC GACCGGCGAC GGCGGCATCG ACCGCCCGGC GCTGACCGAC
GCCCACCGGG ACGCGCGCGA CTGGTTCCGG TCCGAACTCG AGGCGCGCGG CTATACCGTG
CTCGTCGACG AGATCGGCAA TCTCTTCGGG CGGATCGATC TGGCGGGGCC CGGGGCGCCG
CTGGTGATGA TCGGCTCGCA TCTCGACAGC CAGCCCCGGG GCGGGCGGTT CGACGGCGCC
TATGGGGTGA TCGCCGCGCT CGCCGCCATC GAGACCTTCC GCCGCGACGG CGGCACGCCG
CGCTGCAACT ATGTCATCGC CGACTGGATG AACGAGGAGG GGGCGCGGTT CCAGCCGAGC
CTCCTCGGCT CGTCGGTCTT CGCGGGCCTC ATCGAGCTCG ACTGGGCGCT GGGGCGGCGC
GACCGTGACG GGCGGAGCGT GGGCGAGGAA CTGGTCCGCA CCGGCTACAA GGGCACCGAC
GCGGCGCCGC GCCCGGATCT CTATCTCGAA CTCCATATCG AGGGCGACGC CAAGATGGAG
ACGGCGGGCG CCCGGATCGC CCCGTTCCTG CGGCACTGGG GCGCGCTGAA GGTCCGGATC
GAGGTCACGG GCGAGCAGAA CCATACCGGC CCCACGCCGA TGGAAGACCG CAAGGATGCG
GTTCTGGGCG CGGCCTATAT CATCGCCGAG GTGCGGCGGC TGGCGGATGT GGCCGAGGAT
ACGCTCTTCA CCTCGGTGGC GCGGGTCGAC ATCTCGCCCA ATTCGCCCAA CATCGTCCCG
GGCAAGGCGG TCCTGTTCTG CGAGCTCCGC GCGCCCGAAC CGGCGATGCT CGACTGGTCG
GAGGCAAGCC TCCGCGCGGC CCTGCCGGAG CTTGCCGCCA AGGCCGCCAC CCGTGCCGAG
ATCGTCTCGA TCGACCGCCG ACCGGCCGGG AAGTTCGACC CGCGCCTCGC CCGGCTGACC
GAACGCGTGG CAGACGACTT CGGCCTGCCC CGGATGCAGC TCGACACGAT CGGCGGCCAT
GACGCGGTGG CGCTGAACGC GATCCTGCCG AGCATCGTCT TCGCCGTGCC GTCGGTCGGT
GGCGTGATCC ACCGCAACGA CGAATATACC AGCCCCGAGG ATCTGGCGGC GGGCGGCGAC
GTGCTGACCG ACATGGTCCG CCGCATCGAC CGCGCGGGCG CCGATCTCGA CCTCGCGCTC
GGGGCGAATG CATGA
 
Protein sequence
MSETQSLSRI DADLLNALME KVSEFGSTGD GGIDRPALTD AHRDARDWFR SELEARGYTV 
LVDEIGNLFG RIDLAGPGAP LVMIGSHLDS QPRGGRFDGA YGVIAALAAI ETFRRDGGTP
RCNYVIADWM NEEGARFQPS LLGSSVFAGL IELDWALGRR DRDGRSVGEE LVRTGYKGTD
AAPRPDLYLE LHIEGDAKME TAGARIAPFL RHWGALKVRI EVTGEQNHTG PTPMEDRKDA
VLGAAYIIAE VRRLADVAED TLFTSVARVD ISPNSPNIVP GKAVLFCELR APEPAMLDWS
EASLRAALPE LAAKAATRAE IVSIDRRPAG KFDPRLARLT ERVADDFGLP RMQLDTIGGH
DAVALNAILP SIVFAVPSVG GVIHRNDEYT SPEDLAAGGD VLTDMVRRID RAGADLDLAL
GANA