Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1221 |
Symbol | |
ID | 3910156 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1396370 |
End bp | 1397917 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637883115 |
Product | MlrC-like protein |
Protein accession | YP_484842 |
Protein GI | 86748346 |
COG category | [S] Function unknown |
COG ID | [COG5476] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAA CCAAGTCATC ATCAGAGAAA CAAGACATGA CCCGTATCGC CGTCGGCGGC TTTCTGCACG AGACCAATAC TTTTGCGCCC ACCAAGGCCA CCTGGGAGGC GTTCGTGCAC GGCGGCGGCT GGCCGGCGAT GACGATGGGC GCCGACGTGC TCAAGGTGAT GCGCGGCATC AATGTCGGGC TCGCCGGCTT CGTCGAGGAC GCCGAGCGCA AAGGCTGGGA GTTGGTCCCG ACCATCGCCT GCGGGGCGAG CCCGTCGGCC CACGTCACCG AAGACGCCTT CGAACGCGTC GTGAAGGCGA TGATCGACGG CATCCAGGCC GCCGGCAAAC TCGACGCGGT GTATCTCGAT CTGCACGGCG CCATGGTCAC CGAACATCTC GACGACGGCG AAGGCGAGAT CCTCTCGCGC GTCCGCGAGG TGATCGGCCC CGATTTGCCG CTGGTGGTCA GCATCGATCT GCACGCCAAC GTCACGCCAG CGATGATCGA CCATGCCGAC GCGCTGATCG CCTACCGCAC CTATCCGCAT GTCGACATGG CCGACACCGG CCGCGCCGCG GCGAAGCACC TCGATCTGCT GCTGCAGAGC GGCGCGAAAT ACGCGAAGGC GTTCCGGCAA TTGCCGTTCC TGATCCCGAT CTCGTGGCAA TGCACCTTCG ACGAGCCGAC CAAGGGCATC TACGCCAAGC TCGCCGCGCT GGAGAGCGAC GCGGTGCCGA CGCTGTCGTT CGCGCCGGGC TTCCCGGCCG CCGATTTTCC GGATTGCGGC CCCAGCGTGT TCGCCTATGG CCGCACCCAG GCGGATGCCG ACGCCGCGGC CGACAAAGTC GCCGCTTTGG TGATCGGCCA CGAGAATGAT TTCGACGGCA CCATCCATTC GCCCGACGAC GGCGTGCGGC TGGCGATGCA GATCGCACGC GGCGCGGCAA AGCCGGTGAT CATCGCCGAC ACCCAGGACA ATCCCGGGGC CGGCGGCGAT TCCGACACCA CCGGCATGTT GCGCGCACTG GTGCGCAACG ACGCGCAGCG CGCCGCGATC GGCGTGATCT ACGATCCCGT GTCGGCGCAA GCCGCACACG CCGCGGGCGT CGGCGCCACC GTCAGGCTGG CGCTCGGTGG CAAGTCGGGC ATCGCCGGCG ACGCGCCCTA CGAAGAGAGT TTCGTCGTCG AGCATCTGTC CGACGGCCGC TTCGTCGCAC CCGGTCCTTA CTATGGCGGG CGCGAGATGG AGATGGGGCC GTCGGCGTGC CTGCGCATCG GCGACGTGCG CGTCGTCGTC AGCTCGCACA AGGCGCAGCT CGCCGATCAG GCGATGTATC GCTATGTCGG CATCGAGCCG ACCGAACAGG CGATCCTCGT CGTGAAGAGT TCGGTGCATT TTCGCGCCGA CTTCCAGCCG ATCGCCGAGC GCCTGCTGAT CTGCGCCGCG CCCGGCGCGA TGCCGGCGGA TACCGCGTCG TTGCAGTGGA CACGCCTGCG CCCGGGCGTG CGTGTCAAGC CGAATGGACA ACCGTTTCTC GGCCGCAACG CCAACTAA
|
Protein sequence | MTATKSSSEK QDMTRIAVGG FLHETNTFAP TKATWEAFVH GGGWPAMTMG ADVLKVMRGI NVGLAGFVED AERKGWELVP TIACGASPSA HVTEDAFERV VKAMIDGIQA AGKLDAVYLD LHGAMVTEHL DDGEGEILSR VREVIGPDLP LVVSIDLHAN VTPAMIDHAD ALIAYRTYPH VDMADTGRAA AKHLDLLLQS GAKYAKAFRQ LPFLIPISWQ CTFDEPTKGI YAKLAALESD AVPTLSFAPG FPAADFPDCG PSVFAYGRTQ ADADAAADKV AALVIGHEND FDGTIHSPDD GVRLAMQIAR GAAKPVIIAD TQDNPGAGGD SDTTGMLRAL VRNDAQRAAI GVIYDPVSAQ AAHAAGVGAT VRLALGGKSG IAGDAPYEES FVVEHLSDGR FVAPGPYYGG REMEMGPSAC LRIGDVRVVV SSHKAQLADQ AMYRYVGIEP TEQAILVVKS SVHFRADFQP IAERLLICAA PGAMPADTAS LQWTRLRPGV RVKPNGQPFL GRNAN
|
| |