Gene RPB_0422 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0422 
Symbol 
ID3909978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp464874 
End bp466073 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content69% 
IMG OID637882308 
Productextracellular ligand-binding receptor 
Protein accessionYP_484044 
Protein GI86747548 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAC TGCGCGATCG AAATACCACG TCTCTGCGAA CCAGCCGCCG CACGGCGGTC 
GGATTGATCC TCGGCGCGCC GTTGCTCGGC GCCTGCTCGG GGATGCAGCA GACGCTCTCC
AGTCAGTTCG GCCAGCAGCC GACCGCGCCG GAGGCCGCGC AGCAATCGCA ATCCGTCGGC
AACGGTCGGG TCAAGGTCGG TCTCGTGCTG CCGCTGTCTG CGGCCGGAAA TGCCGGCGTC
GCCGCGCAGT CGATGAAGAA TGCCGCCGAG ATGGCGCTCG CCGAGTTCAA CAATCCCGAC
ATCCAGTTGC TGGTGAAAGA CGATGCCGGC AATCCGCAGG GCGCGCAGGC CGCGACCCAG
CAGGCGCTCG ACGAAGGCGC CGAGATCATG CTCGGTCCGC TGTTCGCGCA ATCGGTGCCG
GCTGCCGCCC AGCTGACGCG CGCCCGCGGC ATCTCGATGA TCGCGTTCTC GACGGATTCG
AGCGTCGCCG GCCGCGGCGT CTATCTGTTG AGCTTCCTGC CGGAGTCCGA CGTCAACCGG
ATCATCGGCT ACGCGTCGAG CGTCGGCAAA CGTTCCTATG CGGCACTGCT GCCGGACAAC
GCCTATGGCG GCGTCGTCGA GGCCGCCTTC AAGCAGGTGG TCGGGACCAA GGGCGGCCGC
ATCGCCGCGT TCGAGAAATA CGGCGCCGAC CGAGCCGGCC CGGCGCGGAC CATCGCGCAG
GCGCTGTCCG GCGCCGATTC GCTGCTGCTC GCCGATGACG GTGATGCGCT GGCAAGCGTC
AGCGAGGCGC TGACCGCGGC GGGCGCCGAT CTGCGCCGCG TGCAGTTGCT CGGCACCGGG
CTGTGGGACA ATCCGCGCGT GTTCGCAACG CCGGCGTTGC AGGGTGGACT CTACGCCGCG
CCCGATCCGT CCGGCTTTCG CAGTTTCTCT GGCCGCTACC GCGCCAAATT CGGCCAGGAA
CCCGTCCGCA CCGCGACGCT CGCTTACGAT GCAGTGGCGC TGGTGGCGGC CTTGTCGAAG
ACGCAGGGTG CCAAGCGGTT CTCGGCCGAG GTGTTGACCA ATCCGTCGGG CTTCGCCGGC
ATCGACGGCC TGTTTCGCTT CCGCGCCGAC GGCAGCAACG AGCGGGGCCT CGCGGTGATG
CGCGTTGCGA CCGGCGGCGC CCAGGCGGTG GCTGGATCGC CGAAGAGCTT CGGGGCGTAG
 
Protein sequence
MAELRDRNTT SLRTSRRTAV GLILGAPLLG ACSGMQQTLS SQFGQQPTAP EAAQQSQSVG 
NGRVKVGLVL PLSAAGNAGV AAQSMKNAAE MALAEFNNPD IQLLVKDDAG NPQGAQAATQ
QALDEGAEIM LGPLFAQSVP AAAQLTRARG ISMIAFSTDS SVAGRGVYLL SFLPESDVNR
IIGYASSVGK RSYAALLPDN AYGGVVEAAF KQVVGTKGGR IAAFEKYGAD RAGPARTIAQ
ALSGADSLLL ADDGDALASV SEALTAAGAD LRRVQLLGTG LWDNPRVFAT PALQGGLYAA
PDPSGFRSFS GRYRAKFGQE PVRTATLAYD AVALVAALSK TQGAKRFSAE VLTNPSGFAG
IDGLFRFRAD GSNERGLAVM RVATGGAQAV AGSPKSFGA