Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_2181 |
Symbol | |
ID | 4896848 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009049 |
Strand | - |
Start bp | 2311243 |
End bp | 2312418 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 640112775 |
Product | homocitrate synthase |
Protein accession | YP_001044056 |
Protein GI | 126462942 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR02660] homocitrate synthase NifV |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.703624 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.187711 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCGCC AGCAGCCCCG GGCGAGCTTC CTGCCCGAAA GCCCGCTCGC CCCCGTGGCC CTCTGCGACA CGACGCTGCG CGACGGAGAG CAGACGGCGG GCGTGGCCTT CACCCGCGCC GAGAAGCGGG CCATCGCCGA GGCGCTGCAG GCCGCAGGCG TGGCCGAAGT CGAGGTGGGC GTGCCCGCCA TGGGCGAGGA GGAGCGGGCC GACATCCGGG CGGTGGCGGC GGTGCTGAAG ACGGCGGCCC CCGTTGTCTG GTGCCGCCTG CGCGCCGAGG ATCTGGCGGC CGCGCAGCGC ACGGGCGTCG TGCGGCTCCA TATCGGCGTC CCCGTCTCCG AGCGCCAGAT CAGTGCCAAG CTCGGCAAGG ACGCGGCCTG GGTGCGCGAC AAGGTCGAGA AGCTCGTGCG CGCCGCCTCC TGGGCCGGGC ACAAGGTGTC GGTCGGGGCC GAGGATGCCT CGCGCGCCGA TCCGTTCTTT CTGGCCGAGA TCGCCCATGT CGCGGCCGAG GCGGGCGCGA TCCGCTTCCG CATCTCGGAC ACGCTGGGCG TGCTCGACCC GTTCGCTGCG CACGAGCTGG TGGGCCGCGT CGTCACGCGC TGCCCGCTGC CGGTGGAGTT CCACGGCCAC AACGATCTGG GCATGGCCAC GGCCAACAGC CTCGCCGCCG CGCGCGCCGG GGCCTCGCAC CTGTCGGTCA CGGTGAACGG GCTGGGCGAG CGGGCGGGCA ATGCCGCGCT CGAGGAGGTG GCGGCCGCGC TCGAAGCGGC GGGCCGCGCC ACCGGCGTCG CGCTGGGCCA GCTCTGCGCC CTCTCGGAGC TGGTGGCCCG CGCCTCGGGC CGTCCTCTCT CGCCGCAGAA GCCCATCGTG GGCGAGGGGG TCTTCACCCA TGAATGCGGC ATCCATGTCG ACGGGCTGAT GAAGGACCGC GCCACCTACG AGAGCGCGGA CCTGCGCCCC GAGCGGTTCG GCCGCAGCCA CCGCATCGCC ATCGGCAAGC ATTCCTCGGC CGCCGGGCTC GCCCGGGCGC TGGCCGAGGC GGGGCTTCCC GCCGACGCGG CGACGCTCGC GGCCCTGATG CCCGCGCTGC GCGACTGGGC GGCCACCGCC AAGCGCGCGG CCGCCCCCGA GGATCTTGCG GCGCTCCTTG CCGCGCAAAC CGAAACCGCC CGTTGA
|
Protein sequence | MSRQQPRASF LPESPLAPVA LCDTTLRDGE QTAGVAFTRA EKRAIAEALQ AAGVAEVEVG VPAMGEEERA DIRAVAAVLK TAAPVVWCRL RAEDLAAAQR TGVVRLHIGV PVSERQISAK LGKDAAWVRD KVEKLVRAAS WAGHKVSVGA EDASRADPFF LAEIAHVAAE AGAIRFRISD TLGVLDPFAA HELVGRVVTR CPLPVEFHGH NDLGMATANS LAAARAGASH LSVTVNGLGE RAGNAALEEV AAALEAAGRA TGVALGQLCA LSELVARASG RPLSPQKPIV GEGVFTHECG IHVDGLMKDR ATYESADLRP ERFGRSHRIA IGKHSSAAGL ARALAEAGLP ADAATLAALM PALRDWAATA KRAAAPEDLA ALLAAQTETA R
|
| |