Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3193 |
Symbol | |
ID | 3910994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3651197 |
End bp | 3652693 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637885095 |
Product | inosine 5'-monophosphate dehydrogenase |
Protein accession | YP_486800 |
Protein GI | 86750304 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0516] IMP dehydrogenase/GMP reductase [COG0517] FOG: CBS domain |
TIGRFAM ID | [TIGR01302] inosine-5'-monophosphate dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.291902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.962231 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTCAT TGAGCGGATC ACGGCAGTTT CAGGAAGCAT ACACGTTCGA CGACGTGTTG CTGAAGCCCG GCCCGTCGGA CGTGCTGCCG TCCGACGTCG ATATTCGCTC CCGGATCACC CGCGCGATTT CGCTCAACAT CCCGATCATC GCCTCGGCGA TGGACACCGT GACCGAGGCG CGGATGGCGA TCGCGATGGC GCAGGCCGGC GGCATCGGCG TGATCCACCG CAATTTCGAT CCCGAGGGCC AGGCCGCGCA AGTCCGCCAG GTCAAAAAGT TCGAATCCGG CATGGTGGTC AATCCGCTGA CCATCGCGCC CGAGGCCAAG CTCGCCGACG CGCTGGCGCT GATGACTCAG TACGGCTTCT CCGGCATCCC GGTGGTGACC GGCGCGCAGG GCGATGGTCC CGGCAAGCTG GTCGGCATCC TCACCAATCG CGACGTCCGC TTCGCCACCG ATCCGGCGCA GAAGGTCTCG GAGCTGATGA CGCACGAGAA CCTCGTCACC GTGCGTGAAG GGGTGAGCCA GGACGAGGCC AAGCGGCTGC TGCATCAGCA CCGCATCGAG AAGCTGCTGG TGGTCGACGA TCAATATCGC TGCGTCGGCC TGATCACCGT CAAGGACATG GAAAAGGCGG TCGCGCATCC GCTGGCGTCG AAGGACGCGC AGGGCCGGCT GCGGGTCGCC GCCGCGACCA CGGTCGGCGA GGGCGGCTAT GAGCGCACCG AGCGACTGAT CGAGGCCGGC GTCGACCTCG TCGTGGTCGA CACCGCGCAC GGCCATTCGG CGCGCGTGCT GGAAGCGGTG ACGCGGATCA AGCGGATCTC CAATGCGGTC CAGGTGATCG CCGGCAATAT CGCCACCCGC GACGGCGCGC AGGCGCTGAT CGATTCCGGC GCCGACGCCA TCAAGGTCGG CATCGGCCCG GGCTCGATCT GCACCACCCG GATCGTCGCC GGCGTCGGCG TGCCGCAGCT CACCGCGATC ATGGACGCGG TCGAGGCGGC CAAGAAGGCC GACATTCCGG TGATCGCCGA TGGCGGCATC AAGTACTCGG GCGACCTCGC CAAGGCGCTC GCCGCCGGCG CCGACATCGC GATGGTCGGC TCGCTGCTCG CCGGCACCGA CGAGACGCCC GGCGAAGTGT TCCTGTGGCA GGGCCGCTCC TACAAGGCCT ATCGCGGCAT GGGCTCGGTC GGCGCGATGG CGCGCGGCTC GGCCGATCGC TACTTCCAGC AGGACATCAA GGACACGCTG AAACTGGTGC CCGAAGGCAT CGAGGGCCAG GTGCCGTACA AGGGCCCGGT CGGCAACGTG ATGCATCAAC TCGCTGGCGG CCTGCGCGCC GCGATGGGCT ATGTCGGCGC GCGGACCCTG ACCGAATTCC ACGACAAGGC CGAGTTTGTC CGCATCACCG GCGCCGGCCT GCGCGAAAGC CACGTCCACG ACGTCACCAT CACCCGCGAG AGCCCGAACT ATCCGGGCGG GGTGTGA
|
Protein sequence | MASLSGSRQF QEAYTFDDVL LKPGPSDVLP SDVDIRSRIT RAISLNIPII ASAMDTVTEA RMAIAMAQAG GIGVIHRNFD PEGQAAQVRQ VKKFESGMVV NPLTIAPEAK LADALALMTQ YGFSGIPVVT GAQGDGPGKL VGILTNRDVR FATDPAQKVS ELMTHENLVT VREGVSQDEA KRLLHQHRIE KLLVVDDQYR CVGLITVKDM EKAVAHPLAS KDAQGRLRVA AATTVGEGGY ERTERLIEAG VDLVVVDTAH GHSARVLEAV TRIKRISNAV QVIAGNIATR DGAQALIDSG ADAIKVGIGP GSICTTRIVA GVGVPQLTAI MDAVEAAKKA DIPVIADGGI KYSGDLAKAL AAGADIAMVG SLLAGTDETP GEVFLWQGRS YKAYRGMGSV GAMARGSADR YFQQDIKDTL KLVPEGIEGQ VPYKGPVGNV MHQLAGGLRA AMGYVGARTL TEFHDKAEFV RITGAGLRES HVHDVTITRE SPNYPGGV
|
| |