Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1734 |
Symbol | |
ID | 3908259 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1980211 |
End bp | 1981380 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883628 |
Product | peptidase M20D, amidohydrolase |
Protein accession | YP_485353 |
Protein GI | 86748857 |
COG category | [R] General function prediction only |
COG ID | [COG1473] Metal-dependent amidase/aminoacylase/carboxypeptidase |
TIGRFAM ID | [TIGR01891] amidohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.169236 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0172136 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTGA TCAACCGCGT CGCCGACCTG CAACCCGATA TCATGGCCTG GCGTCACGAC CTCCATCAGC ACCCCGAACT GATGTACGAC GTCGACCGCA CCGCCGATTT CGTCGCCCAG CGCCTGCGCG AATTCGGCTG CGACGAGGTG GTGACGGGAC TCGGCCGCAC CGGCGTGGTC GGTGTGATCC GCGGCCGCAA GCCGGCGAGC GGCGACCTCA AGGTGATCGG GCTGCGCGCC GACATGGACG CGCTGCCGAT CGAGGAGGCG ACCGGTCTAC CCTATGCCTC CAAGGTGCCC GGCAAGATGC ACGCCTGCGG CCATGACGGC CACACCGCGA TGCTGCTCGG CGCCGCGCGC TATCTCGCCG AGACCCGCAA TTTCGCAGGC AGTGTAGTGG TGATCTTCCA GCCGGCCGAG GAGGGCGGCG CGGGGGCCGC GGCGATGATC AAGGACGGGC TGATGGACCG CTTCGGCATC GAGCAGGTCT ACGGCATGCA CAACGGCCCC GGCATCCCGG TCGGCTCCTT CGCCATCAGC CCGGGCGCGA TCATGGCCTC GACCGATTCG GTCGACATCC GCATCGAGGG CGTCGGCGGC CACGCCGCGC GGCCGCATAT GTGCGTCGAC TCGGTGCTGG TGGGCGCCCA GCTCGTCACC GCGCTGCAGT CGATCGTGTC GCGCACGGTC GATCCGCTGG AATCGGCGGT GATCTCGATC TGCGAATTCC ACGCCGGCAA CGCCCGCAAC GTCATCCCGC AGATCGCCGA ACTGAAAGGC ACGGTCCGCA CCCTGAAGGC CGAAGTTCGC GACCTGGTCG AGAAGCGCAT CCACGAGGTC GCGGCCGGCG TTGCGCAGTC GACCGGCGCC AGGATCGACA TCGTCTACGA GCGCGGCTAC CCGGTGGTGG TCAACCATGC CGAGCAGACC GAGGTGGCGC AGCGGATCGC CCGCGACATC GCCGGCGAGT CCAACGTGAC GTCGATGCCG CCGCTGATGG GCGCCGAGGA TTTCGCCTAT ATGCTGGAAG CGCGGCCGGG CGCGTTCATC TTCCTCGGCA ATGGCGACAG CGCCGGGCTG CATCACCCGG CCTACAACTT CAACGACGAC GCCATCGTCT ACGGCACCTC GTACTGGATC AAACTGGTCG AGAACCAACT CGCGGCGTGA
|
Protein sequence | MPLINRVADL QPDIMAWRHD LHQHPELMYD VDRTADFVAQ RLREFGCDEV VTGLGRTGVV GVIRGRKPAS GDLKVIGLRA DMDALPIEEA TGLPYASKVP GKMHACGHDG HTAMLLGAAR YLAETRNFAG SVVVIFQPAE EGGAGAAAMI KDGLMDRFGI EQVYGMHNGP GIPVGSFAIS PGAIMASTDS VDIRIEGVGG HAARPHMCVD SVLVGAQLVT ALQSIVSRTV DPLESAVISI CEFHAGNARN VIPQIAELKG TVRTLKAEVR DLVEKRIHEV AAGVAQSTGA RIDIVYERGY PVVVNHAEQT EVAQRIARDI AGESNVTSMP PLMGAEDFAY MLEARPGAFI FLGNGDSAGL HHPAYNFNDD AIVYGTSYWI KLVENQLAA
|
| |