Gene RPB_0126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0126 
Symbol 
ID3908097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp136074 
End bp137282 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID637882008 
Productextracellular ligand-binding receptor 
Protein accessionYP_483749 
Protein GI86747253 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGGC GCTTCGCGGT GAAACTGGTC TCGGTCGCGG TGGCATTGAT GGCGAGCGGT 
TGGCGGGCCG ACGCCGCCGA TCAACCACCG ATCCGGATCG GCGACGTCTC CAGCTATTCG
GCTCTGCCGA TCGGCGCCCG TGGCTATCGG CAGGGCTGGG AGCTCGCGCG CGACGAGATC
AACGGCAAAG GCGGCGTGCT GGGCCGGCAG CTCCAGATCA TCTCCCGCGA CGACGCCGGC
AAGCCCGACG CGGCCATCAC GCAGGCCGCG CAACTGGTCG ACGCGGAGAA CGTGGATCTG
CTGACCGGGA CGATCCTCTC CAATGTCGGC CTCGCCGTCT CCGATTTCGC CAAACGGCGG
AAGATCTTCT TCCTGGCCTC GCAGCCCCTG ACCGACGCGC TGATCTGGGA CAAGCGCAGC
CGCTACACTT ACCGGCTGCG GCCTTCGACC TACACCCAGA CCACCATCCT GGCCCAGGAA
GCCGCCAAGC TGCCGGCGCG GCGCTGGGCC ACGATCGCAC CCAACTACGA GTTCGGGCAG
GCGGCCGTCT CCAACTTCAA GCAGGAATTG CGGCGACTCC GACCTGATGT CGAGTTCATT
TCCGAGCAAT GGCCGCCGCT CGGCAAGATC GATGCGGGCT CGGTCGTGCA GGCGATGGCC
GCCGATAACC CGGAGGCGAT CTTCAACGTC ACCTTCGGGT CCGATCTCGC CCGGCTGGTG
CGGGAGGGAA CCACGCGCAA GACGTTTGCG AACCGCACCG TCGTCAGCCT CCTGACCGGC
GAGCCGGAAT ATCTGGCGCC GCTGAAGGAC GAGGCGCCCG AGGGATGGAT CGTGACCGGC
TATCCGTGGA ACGAGATCGC GACGCCAGAG CACCGCGCCT TCGTCGAGGC CTATCGGGCG
CGCTTCAAGG ATGCGCCGAA CATCGGTGCG CTGATCGGCT ACATCAACAC GCTGGTGCTC
GCCAAAGCGA TCGAAACCGC CGGCTCGACC GAGACCGAAG CGCTTCTCGC GGCGATGGAG
CACTTGAAAA TCGACACGCC GTGCGGGCCG ATCTCGTTCC GTGCCGTCGA CCATCAGTCC
ACGCTGGGCA TCTACATCGG CAAGCTGGCG GTGAGGGATG GCCAGGGGGT GATGACCGAC
TGGCGTTATG TCGACGGCGC CACTCACCAG CCGTCGGACG CCGACGTTCT GGCCAAAATC
AAAGAGTAG
 
Protein sequence
MAGRFAVKLV SVAVALMASG WRADAADQPP IRIGDVSSYS ALPIGARGYR QGWELARDEI 
NGKGGVLGRQ LQIISRDDAG KPDAAITQAA QLVDAENVDL LTGTILSNVG LAVSDFAKRR
KIFFLASQPL TDALIWDKRS RYTYRLRPST YTQTTILAQE AAKLPARRWA TIAPNYEFGQ
AAVSNFKQEL RRLRPDVEFI SEQWPPLGKI DAGSVVQAMA ADNPEAIFNV TFGSDLARLV
REGTTRKTFA NRTVVSLLTG EPEYLAPLKD EAPEGWIVTG YPWNEIATPE HRAFVEAYRA
RFKDAPNIGA LIGYINTLVL AKAIETAGST ETEALLAAME HLKIDTPCGP ISFRAVDHQS
TLGIYIGKLA VRDGQGVMTD WRYVDGATHQ PSDADVLAKI KE