Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1651 |
Symbol | |
ID | 6409308 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1770388 |
End bp | 1772193 |
Gene Length | 1806 bp |
Protein Length | 601 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 642711540 |
Product | allophanate hydrolase |
Protein accession | YP_001990655 |
Protein GI | 192290050 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases |
TIGRFAM ID | [TIGR02713] allophanate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.426553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAT TCGAAACCAT CAGTGAAATC GTCGCCGCGC ATCGCGCCGG CACCACCACC CCCGCGCAGA CGATCGCGCG CTGCTACCAG CGCATCCGCG CCTATGCCGA TCCGGCACTG TTTATCACCC TGCGCGATGA GGCCGATGCG ATCGCCGAGG CGGTCGCGGT GGCGGCGCGC GACCCGTCGC TGCCGCTGTA CGGCGTGCCG GTCGCAGTGA AGGACAATAT CGACGTTGCC GGCCTGACGA CCACCGCCGC CTGCCCGGCA TTTGCATATC AGCCGGCCCA CGACGCCACC GCGGTCGCCA AGCTGCGCGC CGCCGGCGCG ATCGTGATCG GCAAGACCAA TCTCGATCAG TTCGCCACTG GTCTCGTCGG CGTGCGCTCG CCTTACGGCG TCCCCCGCAA CGCGATGCGC AGCGACCTCG TCCCCGGCGG ATCGAGCTCC GGCTCGGCGG TCGCGGTCGG TGCCGGGCTG GTGCCGCTGT CGCTCGGCAC CGATACGGCG GGCTCGGGCC GGGTGCCGGC GATGCTCAAC AACATCGTTG GGCTGAAGCC GAGCCTCGGG ATGATTTCGA CCACAGGTGT CGTGCCGGCG TGCCGCACGC TCGACTGCGT CTCGATCTTC GCATTGACGA CCGATGACGC GATGACCGCG CTGCGGGTGA TGACGGGGCC AGATGCCGAA GATCCGTTCT CGCGGGAACG GCCGGTGGCG GCAGTCACCG CGATCCCGAA GCGCGTGCGG CTCGGCGTGC CGCCGCAAGA TCAATTGCAG TTCTTCGGCG ATGATCTCGC GGCGCGTGGT TATGAGGAAG CGGTCGAGCG CTGGCGAAAG CTCGGTGCGG AGCTGGTCGA GATCGATGTC GAACCGCTCT ACGAGACGGC GCGGCTGCTG TACGAAGGGC CATGGGTGGC GGAGCGCTAT CTCACCATCC GGGAGTTGCT GGAGACCCAG CCGGACGCGG TGCACCCGGT GACGCGGCAG ATCACCCTCG CCGGTGCCAA GCTGTCCGCG GCAGACACCT TCGCGGCGCT GTACCGGCTG CAGGCGTTAC GTAAGATCGC CGAGCACAGC TTCGCCGGCA TCGACGCGCT GGTGCTGCCG ACCGCGCCGA CCGCCTACAC GGTCGAACAG GTGCTGACCG ATCCGATCAC GCTGAACAGC CGGCTCGGCA CCTACACCAA CTTCGTCAAC CTGCTGGACC TGTGCGGCCT CGCGCTGCCG GCGTCGATCC GCAGCGACGG CATTCCGTTC GGCATCACCC TGCTGGCGCC TGCCGGCCGC GATGCCGAAC TTGCCGGCCT CGGCCGCGTG TTCCACGCCG ACACCGCGCT GCCGATGGGC GCCAGCGGCC AGCCCCAGCC GCCGCTCGCC GAAGTCACTG GCGGTGATGC GCCGGGTGAG ATCGCGATCG CGGTGGTCGG AGCGCATCTT TCCGGTATGC CGTTGAACCG CGAGCTGACC GCACTCGGTG GCCGACTTCT GTCGGCCACC GCCACAGCGC CGGACTACAA ACTCTATGCG CTGAAAGGCA CCGTGCCGCC GAAGCCCGGC CTGCTGCGCG TGGCGCCGGG TAGTGGTTCT GCGATCGCTG TAGAAGTGTG GGCGCTGTCG CCGGCCGCAT TCGGCTCGTT CGTCGCGGCA ATCCCATCGC CGCTGTCGAT CGGTACGCTG ACGCTCGCCG ACGGCACCGC GGTGAAGGGC TTCCTCACCG AGCCGGCGGC GATCGAAGGC GCCCGCGATA TCTCGCATTT CGGCGGCTGG CGCGCCTACA TGGCGGAGCT GGCGGCGACC GGCTGA
|
Protein sequence | MSQFETISEI VAAHRAGTTT PAQTIARCYQ RIRAYADPAL FITLRDEADA IAEAVAVAAR DPSLPLYGVP VAVKDNIDVA GLTTTAACPA FAYQPAHDAT AVAKLRAAGA IVIGKTNLDQ FATGLVGVRS PYGVPRNAMR SDLVPGGSSS GSAVAVGAGL VPLSLGTDTA GSGRVPAMLN NIVGLKPSLG MISTTGVVPA CRTLDCVSIF ALTTDDAMTA LRVMTGPDAE DPFSRERPVA AVTAIPKRVR LGVPPQDQLQ FFGDDLAARG YEEAVERWRK LGAELVEIDV EPLYETARLL YEGPWVAERY LTIRELLETQ PDAVHPVTRQ ITLAGAKLSA ADTFAALYRL QALRKIAEHS FAGIDALVLP TAPTAYTVEQ VLTDPITLNS RLGTYTNFVN LLDLCGLALP ASIRSDGIPF GITLLAPAGR DAELAGLGRV FHADTALPMG ASGQPQPPLA EVTGGDAPGE IAIAVVGAHL SGMPLNRELT ALGGRLLSAT ATAPDYKLYA LKGTVPPKPG LLRVAPGSGS AIAVEVWALS PAAFGSFVAA IPSPLSIGTL TLADGTAVKG FLTEPAAIEG ARDISHFGGW RAYMAELAAT G
|
| |