Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1587 |
Symbol | |
ID | 6409244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1695225 |
End bp | 1697048 |
Gene Length | 1824 bp |
Protein Length | 607 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642711476 |
Product | allophanate hydrolase |
Protein accession | YP_001990591 |
Protein GI | 192289986 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0154] Asp-tRNAAsn/Glu-tRNAGln amidotransferase A subunit and related amidases |
TIGRFAM ID | [TIGR02713] allophanate hydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.556353 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCTG CTGAACTCAC CACCTTCCCG ACGATCGCGT CGCTGCACAC CGCCTATGCG GCCGGCGCCT CTCCGGCCGA GATCATCGCA GCGACGTATC GCCAGCTCGA AGCCGTGGCC GATCCCGGCA TCTTCATCAC GCTGCGGCCA GAGCACGACG TCCTTGCCGA CGCCGCCGCA CTCGGCCAGT TCGATCCCGA TACGAAGCCA CTGTGGGGCA TCCCATTCGC CGTGAAGGAC AATATCGACG TCGCGTGCCT TCCGACCACG GCCGCCTGCC CCGGCTTCGC GTCCACGCCG AGCGAAACTG CGTTTGCGGT TCAGCGCCTG CTCGATGCCG GCGCGGTCCT GATCGGCAAG ACCAACCTCG ATCAGTTCGC CACCGGACTG GTCGGCGTTC GCACGCCATA TCCAGTGCCG CGCAATGCGA TCGATCCGCG CTACGTGCCC GGCGGGTCGA GCAGCGGCTC GGCCGTGGCG GTGGCGCATG GCCTGGTGAC GTTCGCGCTC GGCACCGACA CCGCAGGCTC CGGCCGTGTC CCGGCCGCGC TCAACAACAT CGTCGGACTG AAGCCATCGC TCGGCAGCGT GTCGTCGCGT GGCATGGTGC CGGCGTGCCG GACCCTCGAT ACGATCTCGG TGTTCGCCGG CACGGTCGAC GACGCCCACG CGATCTATCG CATCATGGCC ACGTTCGACG GCGTCGATCC CTGGTCGCGG CCGCATCCCG CAGCGGCTGC CAACCCTCCA GCCTTGCCGC CCGGTCTGCG CGTCGGTGTG CCGGATACGG TCAGCCGCAA ATTCGCCGGC GACCTGCAGT CCGAACGAGC GTTCGACCTC GCGGTGGCGG ACCTTGCGAC CGTGGTGCCC GCGCCCGCTC GTGCGGTGGA TCTGTCGCCC TTGTTCGAGG TCGCCGATCT GCTCTACAGC GGACCGTGGG TCGCGGAGCG CTATCAGGCG ATCCGACAGG TGATTGAGAC GGCACCGGAG TTGCTGCATC CGGTAACACG GAAGATCATC GGCTCTGCGA CGGCGTTCAG CGCCGCCGAT GCGTTTGGCG GCCTCTACCG ACTGGCCGAA CTGCGCCGCG CCGCTGATGC GATCTGGAGC GGCATCGACG TGCTGATCGT ACCGACCTAT CCCCGCCCGC GCATGGTCGC GGAGCTGGAG GCTGATCCGA TCGGTCCGAA CAGCGAGCTC GGCACCTACA CCAATTTCGT GAACCTGCTC GACCTGTGCG CGCTGGCTGT GCCGAGTCGC TTCCGCGCCG ATGGTTTTCC GTCGGGCGTG ACGCTGATCG CACCGGCGGG CCGCGACGAC CTCCTCGCCG CGCTGGGCGA GCGCCTTCAC GCCGCAAGCG GCGTACACTT AGGCGCCAGC AGCACCAAGG TGCCGGCTTC CATCGAAGTA TCGCCCTCTG CCGTTGCGAA TGAGATCGAG CTGGTGGTGG TCGGCGCACA TCTGTCCGGC ATGCCGCTCA ATCACGAACT CATCAGCCGC GGCGCGCGTT TCCTGCGCGC GATCCCGACC GCGCCCGACT ACAAGCTGTT CGCCCTGCAG GGCGGCCCAC CATTCAGGCC CGGCCTGCTG CGCGTCGCCC CCGGCGAAGG CACCCCAATC GCAACCGAGG TTTGGGCGAT CTCCGCCGAA GGATTCGGCA GCTTCGTCGC CGGCATCCCC GCGCCTCTTG GAATCGGGAC TACGCGCCTC GCCGACGGCA CCGCGCCAAA GGGGTTCATC GTCGAAGCCG AGGGCCTGAA AGGCGCTAGG GATATTTCGT CATTCGGCGG ATGGAGGGCC TACATCAAAA GCCTTGCCGG GTAA
|
Protein sequence | MSAAELTTFP TIASLHTAYA AGASPAEIIA ATYRQLEAVA DPGIFITLRP EHDVLADAAA LGQFDPDTKP LWGIPFAVKD NIDVACLPTT AACPGFASTP SETAFAVQRL LDAGAVLIGK TNLDQFATGL VGVRTPYPVP RNAIDPRYVP GGSSSGSAVA VAHGLVTFAL GTDTAGSGRV PAALNNIVGL KPSLGSVSSR GMVPACRTLD TISVFAGTVD DAHAIYRIMA TFDGVDPWSR PHPAAAANPP ALPPGLRVGV PDTVSRKFAG DLQSERAFDL AVADLATVVP APARAVDLSP LFEVADLLYS GPWVAERYQA IRQVIETAPE LLHPVTRKII GSATAFSAAD AFGGLYRLAE LRRAADAIWS GIDVLIVPTY PRPRMVAELE ADPIGPNSEL GTYTNFVNLL DLCALAVPSR FRADGFPSGV TLIAPAGRDD LLAALGERLH AASGVHLGAS STKVPASIEV SPSAVANEIE LVVVGAHLSG MPLNHELISR GARFLRAIPT APDYKLFALQ GGPPFRPGLL RVAPGEGTPI ATEVWAISAE GFGSFVAGIP APLGIGTTRL ADGTAPKGFI VEAEGLKGAR DISSFGGWRA YIKSLAG
|
| |