Gene RPB_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3371 
Symbol 
ID3911173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3855210 
End bp3856808 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content65% 
IMG OID637885274 
Productputative alpha-isopropylmalate/homocitrate synthase family transferase 
Protein accessionYP_486978 
Protein GI86750482 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00977] 2-isopropylmalate synthase/homocitrate synthase family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00710178 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0275974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGGG AACGACTGTA TCTCTACGAC ACCACGCTGC GCGACGGCGC GCAGACCAAC 
GGCGTCGACT TCACGCTGCA CGACAAGCGG TTGATCGCCG GCCTGCTCGA CGACCTCGGC
ATCGACTATG TCGAGGGCGG CTATCCGGGC GCCAATCCGC TCGACACCGA GTTCTTCGCC
ACCGAGCAGA AGCTCGAGCG CGCGACCTTC GCCGCCTTCG GCATGACGCG ACGCCCGGGC
CGCTCGGCCT CGAACGATCC CGGAGTCGCG CTGCTGCTCG ACGCCAAGGC GGATGCGATC
TGCTACGTCG CCAAATCGTC GGAATATCAG GTCCGCGTCG CGCTCGAGAC CACCAACGAC
GAGAACATCG CCTCGATCCG CGACAGCGTC GCGATCGCCA GGGAGCGCAG CCGCGAAGTG
CTGGTCGATT GCGAGCATTT CTTCGACGGT TACAAGGAGA ACCCGGCGTT CGCGCTGGAC
TGCGCCAAGG CGGCCTACGA GTCCGGCGCG CGCTGGGTGG TCTTGTGCGA CACCAATGGC
GGCACCATGC CGGACGAGGT CGAGGCGATC GTCGGCGAGG TGGTCAAACA CATCCCCGGC
AGCCATGTCG GCATCCACGC CCATAACGAC ACCGAGCAGG CGGTGGCGGT GTCGTTCGCC
GCGGTGCGCG CCGGCGCGCG GCAGATCCAG GGCACGCTGA ACGGACTCGG CGAGCGCTGC
GGCAACGCCA ATCTGGTGTC GATGATCCCG ACGCTGAAGC TGAAGAAGGA ATTCGCCGAC
AAGTTCGAGA TCGGCGTCTC CGACGACAAG CTGGCCACGC TGGTGCAGGT GTCGCGCGCG
CTCGACAATA TCCTGGACCG CGCGCCCAAT CCGCACGCGC CTTACGTCGG CGGCAGCGCC
TTCGTCACCA AGACGGGCAT CCATGCCTCG GCGGTGATGA AGGACCCGCA CACTTACGAG
CACGTCACGC CCGAATCCGT CGGCAACCAC CGCAAGGTGC TTGTGTCGGA TCAGGCCGGC
AAGTCGAATG TGGTCGCGGA GCTGTCGCGC ACCACCATCG AGTTCGACCG CAACGATCCG
AAACTCGGCC GGCTGATCGA GAAGATGAAG GAGCGCGAGG CGGCGGGCTA CGCCTACGAG
TCCGCCAACG CGTCGTTCGA TCTGCTGGCG CGCAGCACGC TCGGGCAGGT GCCGGAATTC
TTCCATGTCG AGCAGTTCGA CGTGAATGTC GAGCAGCGCT ACAATTCGCA CGGCCAGCGC
GTCACGGTGG CGATGGCGGT GGTCAAGGTC GTGGTTGACG GCGAAACGCT GATCTCGGCC
GCCGAGGGCA ATGGCCCGGT CAACGCGCTC GACGTCGCGC TGCGCAAGGA CCTCGGCAAG
TATCAGAAAT ACATCGAGGG CCTGAAGCTG GTCGACTACC GCGTCCGTAT CCTCAATGGC
GGCACCGAGG CGGTGACGCG CGTGCTGATC GAGAGCGAGG ACGAACTCGG CGAGCGCTGG
ACCACGATCG GCGTGTCGCC GAACATCATC GACGCGTCGT TTCAGGCGCT GATGGATTCG
GTGGTCTACA AGCTTGTGAA GTCGAAAGCC CCGGTGTGA
 
Protein sequence
MSRERLYLYD TTLRDGAQTN GVDFTLHDKR LIAGLLDDLG IDYVEGGYPG ANPLDTEFFA 
TEQKLERATF AAFGMTRRPG RSASNDPGVA LLLDAKADAI CYVAKSSEYQ VRVALETTND
ENIASIRDSV AIARERSREV LVDCEHFFDG YKENPAFALD CAKAAYESGA RWVVLCDTNG
GTMPDEVEAI VGEVVKHIPG SHVGIHAHND TEQAVAVSFA AVRAGARQIQ GTLNGLGERC
GNANLVSMIP TLKLKKEFAD KFEIGVSDDK LATLVQVSRA LDNILDRAPN PHAPYVGGSA
FVTKTGIHAS AVMKDPHTYE HVTPESVGNH RKVLVSDQAG KSNVVAELSR TTIEFDRNDP
KLGRLIEKMK EREAAGYAYE SANASFDLLA RSTLGQVPEF FHVEQFDVNV EQRYNSHGQR
VTVAMAVVKV VVDGETLISA AEGNGPVNAL DVALRKDLGK YQKYIEGLKL VDYRVRILNG
GTEAVTRVLI ESEDELGERW TTIGVSPNII DASFQALMDS VVYKLVKSKA PV