Gene Rsph17025_2072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_2072 
Symbol 
ID5082777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp2114538 
End bp2116163 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content70% 
IMG OID640483635 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_001168268 
Protein GI146278109 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.770044 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.178392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTAAAGG AGCGGCTCTG GATCTACGAC ACGACGCTGC GCGACGGGCA GCAGACACAG 
GGCGTGCAGT TCTCGACCTC GGACAAGCGG CAGATCGCGC TGGCGCTCGA TGCGCTGGGG
GTGGACTACA TCGAAGGCGG CTGGCCGGGG GCAAATCCGA CCGACAGCGA CTTCTTCGCC
CATGCGCCGC AGCTCGGGGC GCGGCTCTCG GCCTTCGGGA TGACCAAGCG CGCGGGCCGC
TCGGCCGAGA ATGACGATGT GCTCGCGGCG GTGCTGGATG CGGGGACAGG CACGGTCTGC
CTCGTCGGCA AGGCGCATGA GTTCCATGTG ACCACGGCGC TCGGCGTGAC GCTGGAGGAG
AACCTCGAGT CGATCCGCGC CTCGGTCGCC CATGTGGTGG GCAAGGGGCG CGAGGCGATC
CTCGATGCCG AGCATTTCTT CGACGGCTAC AAGGCGAACC CGAGGTATGC GCTCGACTGC
CTGAAGGCCG CGCTGGAGGC GGGGGCGCGC TGGGTCGTGC TCTGCGACAC CAACGGCGGC
ACGCTTCCGG CCGAGGTGGG GCGGATCGTG GCCGAGGTGA TCGCGGCGGG CGTGCCGGGG
GACCGGCTCG GCATCCACAC CCACGACGAC ACGGGCACGG CGGTGGCGGC GACGCTGGCG
GCGGTCGATG CGGGCGCGCG GCAGGTGCAG GGCACGCTGA ACGGGCTGGG CGAGCGGTGC
GGCAACGCCA ACCTCACCGC GCTGATCCCG ACGTTCCTGC TCAAGGAGCC CTACGCCAGC
CGGTTCGAGA CCGGCATCTC GCGCGAGGCG CTGGCCGGGA TGGTGCGGAT CAGCCGGATG
CTCGACGACA TCCTGAACCG GGTGCCGCGC CGCGCCTCGG CCTATGTCGG CGCCTCGGCC
TTTGCGCACA AGGCGGGGCT GCATGCCTCG GCGATCCTCA AGGACCCCGC GACCTACGAA
CATATCGACC CCGCGCTGGT GGGCAACGTG CGGGTGATCC CGATGTCGAA CCAGGCGGGC
CAGTCGAACC TGCGCGCCCG CCTCGCCGCC GCCGGGATCG AGGTTCCGGC CGGCGATCCG
CGCCTCGGCC GCATCCTCGA GGTGATCAAG GCGCGCGAGG ATCAGGGTTA TGCCTACGAT
TCCGCCCAGG GCAGCTTCGA GCTGGTGGCG CGGCGGGAAC TGGGGCTCAT GCCCTCGTTC
TTCGAGGTGA AGCGGTATCG CGTGACGGTC GAGCGGCGGC GGGTCGGCGA GGGCACCATG
ACGCTCTCGG AGGCCGTGGT GGTCGTGATC ATCGACGGGC AGCGGGTGCT GTCGGTCTCG
GAGAGCCTGG ACGAGAACGG GACCGAACGC GGCCCCGTCA ACGCGCTGTC GAAGGCGCTG
GCCAAGGATC TGGGGCGCTG GCAATCGGTG ATCGACGACA TGCGGCTTGT CGATTTCAAG
GTGCGGATCA CCCAGGGCGG CACCGAGGCC GTCACGCGCG TCATCATCGA CAGCGAGGAC
GGACAGGGGC GGCGCTGGTC CACCGTCGGC GTCTCGCCCA ACATAGTGGA TGCCTCGTTC
GAGGCGCTGC TGGACGCGAT CAACTGGAAG CTCGTGCGCG ACGCACGGCG CGGGGAGGGA
TCATGA
 
Protein sequence
MVKERLWIYD TTLRDGQQTQ GVQFSTSDKR QIALALDALG VDYIEGGWPG ANPTDSDFFA 
HAPQLGARLS AFGMTKRAGR SAENDDVLAA VLDAGTGTVC LVGKAHEFHV TTALGVTLEE
NLESIRASVA HVVGKGREAI LDAEHFFDGY KANPRYALDC LKAALEAGAR WVVLCDTNGG
TLPAEVGRIV AEVIAAGVPG DRLGIHTHDD TGTAVAATLA AVDAGARQVQ GTLNGLGERC
GNANLTALIP TFLLKEPYAS RFETGISREA LAGMVRISRM LDDILNRVPR RASAYVGASA
FAHKAGLHAS AILKDPATYE HIDPALVGNV RVIPMSNQAG QSNLRARLAA AGIEVPAGDP
RLGRILEVIK AREDQGYAYD SAQGSFELVA RRELGLMPSF FEVKRYRVTV ERRRVGEGTM
TLSEAVVVVI IDGQRVLSVS ESLDENGTER GPVNALSKAL AKDLGRWQSV IDDMRLVDFK
VRITQGGTEA VTRVIIDSED GQGRRWSTVG VSPNIVDASF EALLDAINWK LVRDARRGEG
S