Gene Rsph17029_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_1654 
Symbol 
ID4895631 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp1745867 
End bp1747216 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content69% 
IMG OID640112247 
ProductSAF domain-containing protein 
Protein accessionYP_001043536 
Protein GI126462422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4091] Predicted homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.448781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCTCG TCGACAAGGC GCTCGCGCGC CGCGCGGCCG AGGGACGGCC GATCCGGGTG 
GGGATGATCG GCGCGGGCTT CATGGGTTCG GGCATCGCGC TGCAGATCGC GAAGTCGGTG
CCGGGGATGC AGCTCGTGGC CATCGCCGCG CGGCGGACCG AGCAGGCGGC GGCGGCCTTC
GAGGCGAGCC GCACCGGCGA GACGGTCCGG CATGTGGACA GCCAGGCCGA GCTCGAGGCC
GCCATCGTGG CGGGCGAGCC CGCCGTGACG GCCGATCCCT CGCTCATCGG CTGCGCCCGC
GGGATCGAGG CCATCATCGA GGTCACGGGC TCGATGGATT ACGCGCTTCA GGCGGTCGAA
CCGGCCATTG CCACCGGCAA GCATGTGATC CTGATGAACG CCGAGCTCGA CGGCACCATC
GGCCCGCTGC TGAAGAAGCG CGCCGATGCG GCCGGGGTGG TGCTCACCAA CTGCGACGGC
GACCAGCCGG GCGTGCAGAT GAACCTGATC CGCTTCGTGA AGGGGATCGG GGTCAAGCCG
GTGCTCTCGG GCAACATCAA GGCGCTGCAG GACGAATATC GCACACCCGA GACCCAGCGC
GGCTTTGCCG AGAAATGGGG CCAGAACGTC CATATGGTGA CCTCCTTCGC GGATGGCACC
AAGATCTCCT ACGAGCAGGC CATCGTGGCC AACGGCACGG GGATGCGGGT CGCGAAGCGC
GGGATGATCG GGATCGATCC GACGAACAAG AACCCGACGC TGCCGCTGCG CCCGCTCGAG
GACTATGTGG CGATGTTCGC GCCGCATCTC GATCCCGAGG GGCCGGGCAT CGTCGACTAT
ATCGTCGGCG CCCGACCCGG GCCGGGGGTG TTCGTGATCG GCACTCACGA CGATCCGCGG
CAGAAGCACT ACCTCAACCT CTACAAGCTG GGCGAGGGGC CCTATTACCT CTTCTACACG
CCCTATCACC TCTGCCATTT CGAGGTGCCG ATGACGGTGG CGCGGGCGGT GCTGTTCGGC
GATGCGGCGC TGGCGCCGCT GGGCGCGCCT CAGGTGGGCG TGGTGGCGGT GGCCAAGCGC
GACCTGAAGG CGGGCGAGGT GCTCGACGGG ATCGGCGGCT TCACCGCCTA CGGCCAGTGC
GAAAACATGG AGACCTTCGT CTCGGAGCAG CTTCTGCCGA TGGGGCTGGC GGAGGGCGGC
GTGCTGCTGC GGGACATTCC CCGGGATCAG GCGCTGACCT TCGCGGATGT GCGGCTGCCC
GTCGGCCGCC GCATCGACGC GCTCTATGAA GAGATGGAGC GCGAGTTCGG CCTCGCGCGT
CCGGCCGAGG CCCATGCCCT GACGGTCTGA
 
Protein sequence
MILVDKALAR RAAEGRPIRV GMIGAGFMGS GIALQIAKSV PGMQLVAIAA RRTEQAAAAF 
EASRTGETVR HVDSQAELEA AIVAGEPAVT ADPSLIGCAR GIEAIIEVTG SMDYALQAVE
PAIATGKHVI LMNAELDGTI GPLLKKRADA AGVVLTNCDG DQPGVQMNLI RFVKGIGVKP
VLSGNIKALQ DEYRTPETQR GFAEKWGQNV HMVTSFADGT KISYEQAIVA NGTGMRVAKR
GMIGIDPTNK NPTLPLRPLE DYVAMFAPHL DPEGPGIVDY IVGARPGPGV FVIGTHDDPR
QKHYLNLYKL GEGPYYLFYT PYHLCHFEVP MTVARAVLFG DAALAPLGAP QVGVVAVAKR
DLKAGEVLDG IGGFTAYGQC ENMETFVSEQ LLPMGLAEGG VLLRDIPRDQ ALTFADVRLP
VGRRIDALYE EMEREFGLAR PAEAHALTV