Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0939 |
Symbol | |
ID | 3909793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1085686 |
End bp | 1086873 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637882832 |
Product | nitrile hydratase regulator |
Protein accession | YP_484560 |
Protein GI | 86748064 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.108797 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTTCGA TCGGGGATTG CGGAGGCGGT GGCGCCGGGC TTGCGCCGCC GCTCGGGTTG CTTGAGAGGG CATCTTCGCT CGCCGACGCG GCGTGGGTGG AAGGGCCGGG CGACGGCGCC GGTGCCGCGG CGCGTCGGGC GCGCAACAAA TTGCGCATCG CCAATTTCGT CACTTTTCAG GGCGCGCCGG GAATCTGGGG GCCTGCCTCG AGCAATGCCG CGCTGCTGGC TGCCGCCGAA ATCAACAAAC GCGGCGGGAT CCTCGGCCGC GAAATCGAGT TGGTGATGTG CGACGCCGGC GGTCCGATCG AGGATGTCGC GCGGCGGGTG GCGCAGGCGG TCGATTTCGA CGACGTCGAC ATCGTGATGG GCTCGCATAT CAGTGCGGTC CGCGTCGCGC TGCGCAAGGC GATCCGCGGC CGCGTGCCGT ACATCTACAC GCCGGTCTAT GAAGGCGGCG AACGCACGCC CGGCGTGATG GCGATCGGCG AGACGCCGCG CTGGCAGAGC CGGCCGGCGA TCGACTGGCT CACCCAGGTC AAGAAGGCGC AGCGCTGGTA TCTGATCGGC AGCGACTATG TCTGGCCCTG GCTGTCGCAT CGCGCGGTCA AGACATACAT TAAGAACGCC GGCGGCCAGG TGGTCGGCGA GGAATTCGTG CCGCTCGGCG AGGACGATCA CGAGCGCCAT CTCGCGCGTA TTCGCGCCGC GCGTCCCGAC GTGGTGCTGA TCTCGCTGAT CGGCGCCGAC AGCGTCACCT TCAATCGCGC CTTCGCCGAA TGCGGGCTGG CCGGTGGCAC TCTGCGGCTT GCCGGCGCGA TGGACGAGAC CGTGCTGCTC GGCATCGGCG CCGACAACAC CGAGAACCTG TTCTGCGCCT CGGGCTATTT CGGCTGCCAC GACTCCAGCG CCAACGATCA ATTCCGCGCT GCCTGCCTGA GGGCTTTCGG GCCGACCGCG CCGCCTATCG GATCTGTCGG ACAATCCAAC TACGAAGGCT TGCGATTCCT GGAGGCCGTC GCCGACAAGG CGCAGACGCT GGCCGCGCGT CCATTGCTCT CCGCGGCCAA GAACGTCGTC TACAACGGCG CGCGCGGCGC CGTGACGATC CGCGACGGCC GTGCGCGGAT GACGATCCAT CTCGCCGAAG CCGACGGCCT CGATTTCAAG CTGATCCGCA CGTTCTGA
|
Protein sequence | MLSIGDCGGG GAGLAPPLGL LERASSLADA AWVEGPGDGA GAAARRARNK LRIANFVTFQ GAPGIWGPAS SNAALLAAAE INKRGGILGR EIELVMCDAG GPIEDVARRV AQAVDFDDVD IVMGSHISAV RVALRKAIRG RVPYIYTPVY EGGERTPGVM AIGETPRWQS RPAIDWLTQV KKAQRWYLIG SDYVWPWLSH RAVKTYIKNA GGQVVGEEFV PLGEDDHERH LARIRAARPD VVLISLIGAD SVTFNRAFAE CGLAGGTLRL AGAMDETVLL GIGADNTENL FCASGYFGCH DSSANDQFRA ACLRAFGPTA PPIGSVGQSN YEGLRFLEAV ADKAQTLAAR PLLSAAKNVV YNGARGAVTI RDGRARMTIH LAEADGLDFK LIRTF
|
| |