Gene Rsph17029_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2381 
Symbol 
ID4897833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2517139 
End bp2518554 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content69% 
IMG OID640112978 
Productargininosuccinate lyase 
Protein accessionYP_001044255 
Protein GI126463141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.746379 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACG CGCCCGACCC CTCCTCCGCC GCCAACGCCA TGTGGGGCGG CCGCTTCGCC 
GCGGGCCCCG ATGCGATCAT GCAGGCGATC AACGCCTCGA TCGGGTTCGA CAAGCGGCTC
TACGCGCAGG ATATCCGCGG CTCGCGGGCC CATGCCGCGA TGCTCGCGGC GCAGGGCATC
CTGACTTCTA GGGATGCCGA GGCCATCGGG GAAGGCCTTC TCACCGTTTT GTCAGAGATC
GAGGCCGGCG GCTTCCCCTT CCGGGTCGAG CTCGAGGACA TCCACATGAA CGTCGAGGCC
CGCCTGAAGG AGCTGATCGG CGAGCCCGCC GGCCGGCTGC ATACGGCGCG CTCGCGCAAC
GATCAGGTGG CGGTCGATTT CCGTCTCTGG GTCCGCGACC AGTGCGATGC GGCCATCTCG
GGGATCGAGG CGCTGATGCG GGCCTTCCTC GCGCAGGCCG AGGCAGGAGC CGACTGGGTG
ATGCCGGGCT TCACGCATCT GCAGACCGCG CAGCCGGTCA CCTGGGGCCA TCACATGCTG
GCCTATGTCG AGATGCTGGC GCGCGACCGC TCGCGCTTCA AAGATGCCCG CGCGCGCATG
AACGAATGCC CGCTGGGCGC CGCGGCGCTG GCGGGCACGG GCTTCCCCAT CGACCGGCAC
ATGACGGCGG CGGCGCTCGG CTTCGACCGG CCCACCGCCA ACAGCCTCGA TTCGGTCTCG
GACCGCGACT TCGCGCTCGA GTTCCTGTCG GCCTCGGCGA TCTGCGCCCT GCATCTGTCG
CGCTTCGCGG AAGAGCTGGT GATCTGGTCC TCGGCCCAGT TCCGCTTCGT GCGCCTGTCG
GACCGCTGGA CCACCGGCTC GTCGATCATG CCGCAGAAGA AGAACCCCGA TGCGGCGGAA
CTGCTGCGGG CCAAGATGGG CAGGGTGCTG GGCGCGGCCG TCGCGCTCTT CACCGTGATG
AAGGGCCTGC CGCTGACCTA TTCCAAGGAC ATGCAGGAGG ACAAGGAGCA GGTCTTCGAC
GCGGCCGACA CGCTGATGCT GGGGCTTGCC GCCATGACCG GCATGGTGGG CGACATGCAG
GCCAACCGCG AGAGCCTCGC CGCCGCCGCG GCCTCGGGCT TCTCGACGGC GACCGATCTG
GCGGACTGGC TGGTGCGCGA GCTTAACCTG CCGTTCCGCG ACGCCCATCA TGTGACGGGC
ACGCTCGTGG CCCGCGCCGA GGCCCGGGGC TGCGACCTGC CCGATCTCTC GCTCGCCGAG
ATGCAGGAGG TGCATCCGGG CATCCGCGAG GACGTGTTCG CCGTCCTCGG CGTCGAGAAT
TCCGTGCGCA GCCGCACCTC CTACGGCGGC ACCGCACCCG ACAATGTCCG CGCGCAGGCC
GCGCGCTGGA AAGAGCTGCT GGGAGATGCC GCATGA
 
Protein sequence
MSDAPDPSSA ANAMWGGRFA AGPDAIMQAI NASIGFDKRL YAQDIRGSRA HAAMLAAQGI 
LTSRDAEAIG EGLLTVLSEI EAGGFPFRVE LEDIHMNVEA RLKELIGEPA GRLHTARSRN
DQVAVDFRLW VRDQCDAAIS GIEALMRAFL AQAEAGADWV MPGFTHLQTA QPVTWGHHML
AYVEMLARDR SRFKDARARM NECPLGAAAL AGTGFPIDRH MTAAALGFDR PTANSLDSVS
DRDFALEFLS ASAICALHLS RFAEELVIWS SAQFRFVRLS DRWTTGSSIM PQKKNPDAAE
LLRAKMGRVL GAAVALFTVM KGLPLTYSKD MQEDKEQVFD AADTLMLGLA AMTGMVGDMQ
ANRESLAAAA ASGFSTATDL ADWLVRELNL PFRDAHHVTG TLVARAEARG CDLPDLSLAE
MQEVHPGIRE DVFAVLGVEN SVRSRTSYGG TAPDNVRAQA ARWKELLGDA A