Gene RPB_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1041 
Symbol 
ID3909165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1196131 
End bp1197747 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content65% 
IMG OID637882934 
Productextracellular solute-binding protein 
Protein accessionYP_484662 
Protein GI86748166 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTTT CGCGCTTCGA GATCAACCGC CGGACCGTCC TGCTGACGTC GGCCGCCATC 
GCCGCCAATG TGATCAATCC GATGCGGGCG TTCGCGCAGG AAACGCCGCG CAAGGGCGGC
GTGTTCAACG TTCATTACGG CGCCGAGCAG CGCCAGCTCA ACCCCAGCTT GCAGGCCTCG
ACCGGCGTCT ACATCATCGG CGGCAAGATC CAGGAGCCGC TGGTCGATCT CGACGCCGCC
GGCAATCCGG TCGGCGTGCT GGCCGAGAGC TGGGACTCGA CGCCCGACGG CAAGACCATC
ACCTTCCGGC TGCGCAAGGG CGTCACCTGG CACGACGGCA AGCCGTTCAC CTCCGAGGAC
GTCGCCTTCA CCGCGATGAA CATGTGGAAG AAGATCCTCA ATTACGGATC GACGCTGCAG
CTGTTCCTGA CCGCGGTCGA CACCCCCGAT CCGCAGACCG CGATCTTCCG CTACGAGCGG
CCGATGCCGC TGAACCTGCT GCTGCGCGCG CTGCCCGACC TCGGCTACAT TTCGCCGAAG
CACATCTACG AGACCGGCGA CATCCGCCAG AACCCGGTCA ATCTCGCGCC GATCGGCACC
GGCCCGTTCA AGTTCAACAA ATACGAGCGC GGCCAGTACA TCATCGCCGA CCGCAACGAC
AATTACTGGC GGCCGAACGC GCCCTATCTC GACCGCATCG TCTGGCGGGT GATCACCGAC
CGCGCCGCGG CGGCGGCCCA GCTCGAAGCC GGCAGCCTGC ATCTCAGCCC GTTCTCGGGT
CTCACGATCT CCGACATGGC GCGGCTCGGC AAGGACAAGC GTTTCGTCGT CTCGACCAAG
GGCAACGAGG GCAACGCCCG CACCAACACG CTGGAATTCA ACTTCCGCCG CAAGGAGCTG
TCGGACATCC GCGTCCGCAA GGCGATCGCG CACGCGATCA ACGTGCCGTT CTTCATCGAG
AACTTCCTCG GCGATTTCGC CAAGCTCGGC ACCGGGCCGA TCCCCTCGAC CTCGACCGAC
TTCTATCCGG GCCCGAACAC GCCGCAATAT CCGTACGACA AGAAGAAGGC GATCGCGCTG
CTCGACGAGG CCGGGCTGAA GCCGGCCGCC GGCGGCAACC GCCTCACGTT GCGGCTCTTG
CCGGCGCCGT GGGGCGAGGA CATCTCGCTG TGGGCGACCT TCATCCAGCA GTCGCTGGGC
GAAGTCGGCA TCCAGGTCGA GGTGGTGCGC AACGACGGCG GCGGCTTCCT CAAGCAGGTC
TATGACGAAC ACGCCTTCGA CCTCGCCACC GGCTGGCATC AATATCGCAA CGACCCCGCG
GTCTCGACCA CGGTGTGGTA TCGCTCCGGC CAGCCCAAGG GCGCGCCCTG GACCAATCAA
TGGGGCTGGA AGGACGAGGC GATCGACAAG ATCATCGACG ACGCCGCCAC CGAGGTCGAT
CCGGTCAAGC GCAAGGCGCT GTATGCCGAC TTCGTCACCC GCGCCAATGG CGAGCTGCCG
CTGTGGATGC CGATTGAGCA ATTGTTCGTC ACGGTGATCA GCGCGAAGGC ACGCAATGCC
TCCAATAATC CACGCTGGGC GTCGTCGACC TGGCACGATC TTTGGCTGGC CGAATAG
 
Protein sequence
MALSRFEINR RTVLLTSAAI AANVINPMRA FAQETPRKGG VFNVHYGAEQ RQLNPSLQAS 
TGVYIIGGKI QEPLVDLDAA GNPVGVLAES WDSTPDGKTI TFRLRKGVTW HDGKPFTSED
VAFTAMNMWK KILNYGSTLQ LFLTAVDTPD PQTAIFRYER PMPLNLLLRA LPDLGYISPK
HIYETGDIRQ NPVNLAPIGT GPFKFNKYER GQYIIADRND NYWRPNAPYL DRIVWRVITD
RAAAAAQLEA GSLHLSPFSG LTISDMARLG KDKRFVVSTK GNEGNARTNT LEFNFRRKEL
SDIRVRKAIA HAINVPFFIE NFLGDFAKLG TGPIPSTSTD FYPGPNTPQY PYDKKKAIAL
LDEAGLKPAA GGNRLTLRLL PAPWGEDISL WATFIQQSLG EVGIQVEVVR NDGGGFLKQV
YDEHAFDLAT GWHQYRNDPA VSTTVWYRSG QPKGAPWTNQ WGWKDEAIDK IIDDAATEVD
PVKRKALYAD FVTRANGELP LWMPIEQLFV TVISAKARNA SNNPRWASST WHDLWLAE