Gene RPB_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1221 
Symbol 
ID3910156 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1396370 
End bp1397917 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content67% 
IMG OID637883115 
ProductMlrC-like protein 
Protein accessionYP_484842 
Protein GI86748346 
COG category[S] Function unknown 
COG ID[COG5476] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCAA CCAAGTCATC ATCAGAGAAA CAAGACATGA CCCGTATCGC CGTCGGCGGC 
TTTCTGCACG AGACCAATAC TTTTGCGCCC ACCAAGGCCA CCTGGGAGGC GTTCGTGCAC
GGCGGCGGCT GGCCGGCGAT GACGATGGGC GCCGACGTGC TCAAGGTGAT GCGCGGCATC
AATGTCGGGC TCGCCGGCTT CGTCGAGGAC GCCGAGCGCA AAGGCTGGGA GTTGGTCCCG
ACCATCGCCT GCGGGGCGAG CCCGTCGGCC CACGTCACCG AAGACGCCTT CGAACGCGTC
GTGAAGGCGA TGATCGACGG CATCCAGGCC GCCGGCAAAC TCGACGCGGT GTATCTCGAT
CTGCACGGCG CCATGGTCAC CGAACATCTC GACGACGGCG AAGGCGAGAT CCTCTCGCGC
GTCCGCGAGG TGATCGGCCC CGATTTGCCG CTGGTGGTCA GCATCGATCT GCACGCCAAC
GTCACGCCAG CGATGATCGA CCATGCCGAC GCGCTGATCG CCTACCGCAC CTATCCGCAT
GTCGACATGG CCGACACCGG CCGCGCCGCG GCGAAGCACC TCGATCTGCT GCTGCAGAGC
GGCGCGAAAT ACGCGAAGGC GTTCCGGCAA TTGCCGTTCC TGATCCCGAT CTCGTGGCAA
TGCACCTTCG ACGAGCCGAC CAAGGGCATC TACGCCAAGC TCGCCGCGCT GGAGAGCGAC
GCGGTGCCGA CGCTGTCGTT CGCGCCGGGC TTCCCGGCCG CCGATTTTCC GGATTGCGGC
CCCAGCGTGT TCGCCTATGG CCGCACCCAG GCGGATGCCG ACGCCGCGGC CGACAAAGTC
GCCGCTTTGG TGATCGGCCA CGAGAATGAT TTCGACGGCA CCATCCATTC GCCCGACGAC
GGCGTGCGGC TGGCGATGCA GATCGCACGC GGCGCGGCAA AGCCGGTGAT CATCGCCGAC
ACCCAGGACA ATCCCGGGGC CGGCGGCGAT TCCGACACCA CCGGCATGTT GCGCGCACTG
GTGCGCAACG ACGCGCAGCG CGCCGCGATC GGCGTGATCT ACGATCCCGT GTCGGCGCAA
GCCGCACACG CCGCGGGCGT CGGCGCCACC GTCAGGCTGG CGCTCGGTGG CAAGTCGGGC
ATCGCCGGCG ACGCGCCCTA CGAAGAGAGT TTCGTCGTCG AGCATCTGTC CGACGGCCGC
TTCGTCGCAC CCGGTCCTTA CTATGGCGGG CGCGAGATGG AGATGGGGCC GTCGGCGTGC
CTGCGCATCG GCGACGTGCG CGTCGTCGTC AGCTCGCACA AGGCGCAGCT CGCCGATCAG
GCGATGTATC GCTATGTCGG CATCGAGCCG ACCGAACAGG CGATCCTCGT CGTGAAGAGT
TCGGTGCATT TTCGCGCCGA CTTCCAGCCG ATCGCCGAGC GCCTGCTGAT CTGCGCCGCG
CCCGGCGCGA TGCCGGCGGA TACCGCGTCG TTGCAGTGGA CACGCCTGCG CCCGGGCGTG
CGTGTCAAGC CGAATGGACA ACCGTTTCTC GGCCGCAACG CCAACTAA
 
Protein sequence
MTATKSSSEK QDMTRIAVGG FLHETNTFAP TKATWEAFVH GGGWPAMTMG ADVLKVMRGI 
NVGLAGFVED AERKGWELVP TIACGASPSA HVTEDAFERV VKAMIDGIQA AGKLDAVYLD
LHGAMVTEHL DDGEGEILSR VREVIGPDLP LVVSIDLHAN VTPAMIDHAD ALIAYRTYPH
VDMADTGRAA AKHLDLLLQS GAKYAKAFRQ LPFLIPISWQ CTFDEPTKGI YAKLAALESD
AVPTLSFAPG FPAADFPDCG PSVFAYGRTQ ADADAAADKV AALVIGHEND FDGTIHSPDD
GVRLAMQIAR GAAKPVIIAD TQDNPGAGGD SDTTGMLRAL VRNDAQRAAI GVIYDPVSAQ
AAHAAGVGAT VRLALGGKSG IAGDAPYEES FVVEHLSDGR FVAPGPYYGG REMEMGPSAC
LRIGDVRVVV SSHKAQLADQ AMYRYVGIEP TEQAILVVKS SVHFRADFQP IAERLLICAA
PGAMPADTAS LQWTRLRPGV RVKPNGQPFL GRNAN