Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_2957 |
Symbol | |
ID | 6410627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 3230013 |
End bp | 3231023 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 642712838 |
Product | urea amidolyase related protein |
Protein accession | YP_001991940 |
Protein GI | 192291335 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1984] Allophanate hydrolase subunit 2 |
TIGRFAM ID | [TIGR00724] biotin-dependent carboxylase uncharacterized domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAAGC TTGTAATCGA CGCCGTCGGA CCGGCCACCT CCGTGCAGGA CGCCGGGCGC CACGGCGCAC AGCGTTACGG CCTCCCGCCG AGCGGCGCGA TGGACCGGCT GTCACTTGCC GCCGCCAATG TGCTGGTCGG CAACAGCGCG TTCGCGGCGG CGATAGAACT CGGTCCGCTC GGCGCGAAAC TGAGTGTACG CGACGGCGCG GTGCGGCTGG CGCTGACAGG CGCCGAACGG CCCGCTGCGC TGGATGGGCA GCCGCTCGCG TTCAACGAGT CTTTCACGCT CGCTGAAGGC CAAATCCTCA CCCTCGGCGT CGCGCGCGGC GGCGTATTCA GCTATCTGGG AATTGAAGGC GGCGTCGGCG GCGAGCCGAT GTTCGGCAGT CTTGCGGTCA ACGCGCGCGC CGGTCTCGGC AGTCCCTACC CGCGGCCGCT GCAGGCCGGC GATGCGATCG CCGTCAAGTC TGCAGCGCCG TCGGTCGAAC GACGCCTCGA TCTGCCTGAG CAACAGGATA CGCCGATCCG CGTCGTGCTT GGTCCGCAGG ACGATGAGTT CGGTGCCGCG GTCGAAGCCT TTCTGGCCGG TGAATGGACG ATCTCGGCGA CCAGCGACCG CATGGGCTAT CGTCTCGACG GCCCGCAGAT CTCGCATCTC CACGGCCACA ACATCGTCTC GGACGGCACC GTCGACGGCA GCATTCAGGT GCCGGGGTCG GGCCAGCCGA TCGTGTTGAT GCCGGATCGC GGCACCAGCG GCGGCTATCC GAAGATCGCC ACGGTGATCT CAGCCGATCT CGGCCGACTG GCGCAGCGTC AACCCGGCCG GCCGTTCCGC TTTCAGGCGG TGAGCGTCGA AGAGGCTCAG GACGCTTATC GCACGATGGC CAAGCTGATC CGCTCGCTGC CTGACCTGCT GCGCGATGCG CAGCACGCGA TCATCGACCT CGACGCGCTG CTCTCCGCCA ACGTTGCCGG CACGGCAATC GACGCGCTGG CGGCGGAGTA A
|
Protein sequence | MTKLVIDAVG PATSVQDAGR HGAQRYGLPP SGAMDRLSLA AANVLVGNSA FAAAIELGPL GAKLSVRDGA VRLALTGAER PAALDGQPLA FNESFTLAEG QILTLGVARG GVFSYLGIEG GVGGEPMFGS LAVNARAGLG SPYPRPLQAG DAIAVKSAAP SVERRLDLPE QQDTPIRVVL GPQDDEFGAA VEAFLAGEWT ISATSDRMGY RLDGPQISHL HGHNIVSDGT VDGSIQVPGS GQPIVLMPDR GTSGGYPKIA TVISADLGRL AQRQPGRPFR FQAVSVEEAQ DAYRTMAKLI RSLPDLLRDA QHAIIDLDAL LSANVAGTAI DALAAE
|
| |