Gene Rsph17029_2182 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_2182 
Symbol 
ID4896150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp2312418 
End bp2313584 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content72% 
IMG OID640112776 
Productaminotransferase, class V 
Protein accessionYP_001044057 
Protein GI126462943 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.186921 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGAGC GGGTCTATCT CGACAACAAT GCCACGACCC GCCTCGCGCC CGAGGCGCTT 
CAGGCCATGC TGCCCTTCCT GACCGAGGAG TTCGGCAATC CCTCGTCGCT GCACGGGCAG
GGGCGCGCGC CCGCCCGCGC CCTGATGGCC GCGCGGCGCG CGGTGCTGGA GCTGATCGGC
GCCGAGGCCG ACAGCGAGAT CCTCTTCACC TCCGGCGGCA CCGAGGCCGA CACGACGGCG
ATCCGCTCGG CGCTGGCGGC GGATCCGTCG CGGCGCGAGA TCGTGACCTC GACGGTCGAA
CATGCGGCCG TCCTCGCGCT CTGCGACCAT CTGGAGCGGC AGGAAGGGGT GACGGTGCAC
CGCATCCCGG TGGACGGCGA CGGCCGGCTC GACATCGAGG CCTATCGCGC GGCCCTCTCG
CCCCGGGTGG CGCTGGTCTC GCTCATGTGG GCCAACAACG AGACCGGCAC GGTCTTTCCC
ATCGAGGGGC TGGCCGAGCT TGCGCATCGG GCGGGGGCGC TCTTTCACAC CGACGCGGTG
CAGGCGGTGG GCAAGGTGCC CATAGGGCTG CGCGGGACCG AGATCGACAT GCTGTCGCTC
AGCGCGCACA AGTTCCACGG CCCGAAGGGG GTGGGGGCGC TCTGGCTGCG CAAGGGCGTG
CCGTTCCAGC CCCTGATCCG GGGCGGCAGG CAGCAGCGCG GGCATCGCGC GGGCACCGAG
AACATTCCCG GCATCGTGGG CCTCGGCCGC GCGGCGGAGC TGGCGCTGGG GCTGGATCAC
GGGGCGGTGC GGCTCCTGCG CGACCGGCTG GAGCAGGGGA TCCTCGCCCG TGTGCCCAAG
GCGCGCGTTC TGGGCGATCC GCTCGACCGG CTGCCCAACA CCTCCTGTGT GGCCTTCGAC
TTCGCCGAGG GCGAGGCGAT CGTGATGCTT CTCGACCGGG CGGGGATCTG CGTCTCGTCG
GGCGCGGCCT GCGCTTCTGG CGCGATGGAG CCGAGCCATG TGATCCGCGC CATGAAGGTG
CCCTTCACCG CCGCGCATGG CGCGATCCGC TTCTCGCTCT CGCACTGGAC GACCGCGGCC
GAGATCGACC GCCTGCTCGA GGTGCTGCCG CCCATCGTCG ACCAGCTGCG CGCGCTCTCG
CCCTTCGGGG CCGAGGAGGT GAAGTGA
 
Protein sequence
MMERVYLDNN ATTRLAPEAL QAMLPFLTEE FGNPSSLHGQ GRAPARALMA ARRAVLELIG 
AEADSEILFT SGGTEADTTA IRSALAADPS RREIVTSTVE HAAVLALCDH LERQEGVTVH
RIPVDGDGRL DIEAYRAALS PRVALVSLMW ANNETGTVFP IEGLAELAHR AGALFHTDAV
QAVGKVPIGL RGTEIDMLSL SAHKFHGPKG VGALWLRKGV PFQPLIRGGR QQRGHRAGTE
NIPGIVGLGR AAELALGLDH GAVRLLRDRL EQGILARVPK ARVLGDPLDR LPNTSCVAFD
FAEGEAIVML LDRAGICVSS GAACASGAME PSHVIRAMKV PFTAAHGAIR FSLSHWTTAA
EIDRLLEVLP PIVDQLRALS PFGAEEVK