Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4057 |
Symbol | |
ID | 3911864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4627321 |
End bp | 4629120 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637885961 |
Product | allophanate hydrolase |
Protein accession | YP_487661 |
Protein GI | 86751165 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases |
TIGRFAM ID | [TIGR02713] allophanate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTGAAA CCATCGCCGA CATTCTCGCC GCGCATCGCG CCGGCACCAC GACGCCCGCG CAGACCATCG CGCGCTGCTA TCAGCGCATC CGCGCTCATG CCGATCCCGC GCTGTTCATC ACGCTGCGCG ACGAGGCGGA CGCCGTGGCC GAAGCCGTGG CGCTCGCCGC CCGTGACCCG TCGCTGCCGC TCTACGGCGT GCCGGTCGCC GTCAAGGACA ATATCGACGT CGCCGGCCTG CCGACCACCG CGGCCTGTCC GGCTTTCGCG TATCAGCCGG CGCAGGATTC CACCGCGGTT GCGAAGCTGC GCGCCGCAGG CGCGATCATC ATCGGCAAGA CCAATCTCGA TCAGTTCGCC ACCGGCCTGG TCGGCGTGCG CTCGCCCTAC GGCATTCCGC GCAACGCGAT GCGCGCCGAT CTGGTGCCGG GCGGCTCGAG TTCGGGTTCG GCGGTCGCGG TCGGCGCCGG CCTGGTGCCG CTGTCCCTCG GCACCGACAC CGCCGGCTCC GGCCGCGTCC CGGCGATGCT CAACAACATC GTCGGGCTGA AGCCGAGCCT CGGCCTGATC TCGACCACCG GCCTGGTGCC GGCGTGCCGC ACGCTGGATT GCATCTCGGT GTTCGCGCTG ACCGTGGACG ACGCGATGAT CGCGCTGCGG GTGATGGGCA CCCCCGACGC CACCGATCCG TATTCGCGCG CTCGGCCGAT CGCGCCGATG TCGGCGATGC CCGACAGGCC ACGGCTCGGC GTACCGCGGC CCGATCAGTT GCAGTTTTTC GGCGATCAGC AATCCGAACA GGCCTATGCC GACGCGCTGC AACGCTGGAC GTCGCTCGGC GCCGAACTGA TCGAGATCGA TGTCGCGCCC TTATACGAGA CCGCGCGGCT GCTCTATGAC GGCCCGTGGG TCGCCGAGCG CTATCTCGCG ATCCGCGAGC TGATCGACGC GACCCCCGAC GCGCTGCATC CGGTGACGCG GCAGATCACG CTCGGCGGCA AGGCGATCAG CGCCGCCGAC ACATTCGCAG CGCTGTATCG ACTGCAGGCG TTGCGCAAGC TCGCGGAGCC CGCCTTCGCG GCGATCGACG CGCTGGTGCT GCCGACGGCG CCGACCGCCT ACACGGTCGA ACAGGTGCTG GCCGATCCGG TCACGCTCAA CAGCCGGCTC GGCACCTACA CCAATTTCGT CAATCTGCTC GACCTTTGCG GGCTCGCGAT CCCGGCGTCG ATCCGCGCCG ACGGCATCCC GTTCGGCGTC ACGCTGCTGG CGCCGGGCGG CCGCGATGCC GAGCTCGCCA GCCTCGGCCG CATGTTCCAC GCCGACACCG CGCTGCCGAT GGGCGCGACG GGCGTCGCCC TGCCGCCGCT GGCGGCGCTG GATGCGGACG CCGGCACGGA CCACATTCAG ATCGCGGTGG TCGGCGCGCA TCTGTCCGGC ATGGCGCTGA ACGGCGAGCT GACGTCGCTG GACGGCCGAC TGTTGCGCGC GACCGCGACG GCGCCGGACT ACAAGCTCTA TGCGCTGAAC GGCACGGTGC CGCCGAAGCC CGGCATGCTG CGCGTCGCAG CAGGCGCAGG CGCGGCGATC GCGCTGGAAA TCTGGTCGCT GTCGCCGGCC GCGTTCGGCC GCTTCGTCGC TGCGATCCCG CAGCCCTTGT CGATCGGCAC CCTGAAGCTG GCGGACGGCG CGCTGGTCAA AGGCTTCCTG GTCGAGCCCG CCGCGCTCGA GGTCGCCCGC GACATCACCC ATTTCGGCGG CTGGCGGGCC TATATGGCGG AGCTGGCGAA AGCGGGGTGA
|
Protein sequence | MTETIADILA AHRAGTTTPA QTIARCYQRI RAHADPALFI TLRDEADAVA EAVALAARDP SLPLYGVPVA VKDNIDVAGL PTTAACPAFA YQPAQDSTAV AKLRAAGAII IGKTNLDQFA TGLVGVRSPY GIPRNAMRAD LVPGGSSSGS AVAVGAGLVP LSLGTDTAGS GRVPAMLNNI VGLKPSLGLI STTGLVPACR TLDCISVFAL TVDDAMIALR VMGTPDATDP YSRARPIAPM SAMPDRPRLG VPRPDQLQFF GDQQSEQAYA DALQRWTSLG AELIEIDVAP LYETARLLYD GPWVAERYLA IRELIDATPD ALHPVTRQIT LGGKAISAAD TFAALYRLQA LRKLAEPAFA AIDALVLPTA PTAYTVEQVL ADPVTLNSRL GTYTNFVNLL DLCGLAIPAS IRADGIPFGV TLLAPGGRDA ELASLGRMFH ADTALPMGAT GVALPPLAAL DADAGTDHIQ IAVVGAHLSG MALNGELTSL DGRLLRATAT APDYKLYALN GTVPPKPGML RVAAGAGAAI ALEIWSLSPA AFGRFVAAIP QPLSIGTLKL ADGALVKGFL VEPAALEVAR DITHFGGWRA YMAELAKAG
|
| |