Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3371 |
Symbol | |
ID | 3911173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3855210 |
End bp | 3856808 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637885274 |
Product | putative alpha-isopropylmalate/homocitrate synthase family transferase |
Protein accession | YP_486978 |
Protein GI | 86750482 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00710178 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0275974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGGG AACGACTGTA TCTCTACGAC ACCACGCTGC GCGACGGCGC GCAGACCAAC GGCGTCGACT TCACGCTGCA CGACAAGCGG TTGATCGCCG GCCTGCTCGA CGACCTCGGC ATCGACTATG TCGAGGGCGG CTATCCGGGC GCCAATCCGC TCGACACCGA GTTCTTCGCC ACCGAGCAGA AGCTCGAGCG CGCGACCTTC GCCGCCTTCG GCATGACGCG ACGCCCGGGC CGCTCGGCCT CGAACGATCC CGGAGTCGCG CTGCTGCTCG ACGCCAAGGC GGATGCGATC TGCTACGTCG CCAAATCGTC GGAATATCAG GTCCGCGTCG CGCTCGAGAC CACCAACGAC GAGAACATCG CCTCGATCCG CGACAGCGTC GCGATCGCCA GGGAGCGCAG CCGCGAAGTG CTGGTCGATT GCGAGCATTT CTTCGACGGT TACAAGGAGA ACCCGGCGTT CGCGCTGGAC TGCGCCAAGG CGGCCTACGA GTCCGGCGCG CGCTGGGTGG TCTTGTGCGA CACCAATGGC GGCACCATGC CGGACGAGGT CGAGGCGATC GTCGGCGAGG TGGTCAAACA CATCCCCGGC AGCCATGTCG GCATCCACGC CCATAACGAC ACCGAGCAGG CGGTGGCGGT GTCGTTCGCC GCGGTGCGCG CCGGCGCGCG GCAGATCCAG GGCACGCTGA ACGGACTCGG CGAGCGCTGC GGCAACGCCA ATCTGGTGTC GATGATCCCG ACGCTGAAGC TGAAGAAGGA ATTCGCCGAC AAGTTCGAGA TCGGCGTCTC CGACGACAAG CTGGCCACGC TGGTGCAGGT GTCGCGCGCG CTCGACAATA TCCTGGACCG CGCGCCCAAT CCGCACGCGC CTTACGTCGG CGGCAGCGCC TTCGTCACCA AGACGGGCAT CCATGCCTCG GCGGTGATGA AGGACCCGCA CACTTACGAG CACGTCACGC CCGAATCCGT CGGCAACCAC CGCAAGGTGC TTGTGTCGGA TCAGGCCGGC AAGTCGAATG TGGTCGCGGA GCTGTCGCGC ACCACCATCG AGTTCGACCG CAACGATCCG AAACTCGGCC GGCTGATCGA GAAGATGAAG GAGCGCGAGG CGGCGGGCTA CGCCTACGAG TCCGCCAACG CGTCGTTCGA TCTGCTGGCG CGCAGCACGC TCGGGCAGGT GCCGGAATTC TTCCATGTCG AGCAGTTCGA CGTGAATGTC GAGCAGCGCT ACAATTCGCA CGGCCAGCGC GTCACGGTGG CGATGGCGGT GGTCAAGGTC GTGGTTGACG GCGAAACGCT GATCTCGGCC GCCGAGGGCA ATGGCCCGGT CAACGCGCTC GACGTCGCGC TGCGCAAGGA CCTCGGCAAG TATCAGAAAT ACATCGAGGG CCTGAAGCTG GTCGACTACC GCGTCCGTAT CCTCAATGGC GGCACCGAGG CGGTGACGCG CGTGCTGATC GAGAGCGAGG ACGAACTCGG CGAGCGCTGG ACCACGATCG GCGTGTCGCC GAACATCATC GACGCGTCGT TTCAGGCGCT GATGGATTCG GTGGTCTACA AGCTTGTGAA GTCGAAAGCC CCGGTGTGA
|
Protein sequence | MSRERLYLYD TTLRDGAQTN GVDFTLHDKR LIAGLLDDLG IDYVEGGYPG ANPLDTEFFA TEQKLERATF AAFGMTRRPG RSASNDPGVA LLLDAKADAI CYVAKSSEYQ VRVALETTND ENIASIRDSV AIARERSREV LVDCEHFFDG YKENPAFALD CAKAAYESGA RWVVLCDTNG GTMPDEVEAI VGEVVKHIPG SHVGIHAHND TEQAVAVSFA AVRAGARQIQ GTLNGLGERC GNANLVSMIP TLKLKKEFAD KFEIGVSDDK LATLVQVSRA LDNILDRAPN PHAPYVGGSA FVTKTGIHAS AVMKDPHTYE HVTPESVGNH RKVLVSDQAG KSNVVAELSR TTIEFDRNDP KLGRLIEKMK EREAAGYAYE SANASFDLLA RSTLGQVPEF FHVEQFDVNV EQRYNSHGQR VTVAMAVVKV VVDGETLISA AEGNGPVNAL DVALRKDLGK YQKYIEGLKL VDYRVRILNG GTEAVTRVLI ESEDELGERW TTIGVSPNII DASFQALMDS VVYKLVKSKA PV
|
| |