Gene RPB_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1463 
Symbol 
ID3908413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1652516 
End bp1653532 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content66% 
IMG OID637883357 
Productextracellular solute-binding protein 
Protein accessionYP_485084 
Protein GI86748588 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.46603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCC GCTTTCGCAC CGCTGCCCTC GCCCTCGCCG CTGTCGTCTT GCCAGCGCAG 
GCGTTCGCAG CCGAACAGGT CAACGTCTAC ACCTATCGCG AGACCAAGCT GGTCCAGCCG
CTGTTCGACG CTTTCACCAA GGACACCGGC ATCGCCGTCA ACGTGATTTC GGCGAGTTCG
GGGCTGGAGC AGCGGATGAA GGCGGAAGGC GCCAACAGCC CGGCCGACGT GCTGCTGACG
GTCGATATCG GGCGGATCGA CGAGGCGGTG GCGGCCGGCG TCACCCAGCC GATCCAGTCG
GCGGTGGTCG ACGAGATCGT GCCGCCGCGC TATCGCGATC CCGACGGCCA CTGGGCCGGC
ATCTCGATGC GGGCGCGGGT GATCTACGCC TCGAAGGACC GCGTCAAGCA AGACGCGATC
ACCTACGAGG AACTGGCCGA TCCAAAATGG AAGGGCAAGA TCTGCATCCG CTCTGGCCAG
CACATCTACA ACAACGCGCT GTTTGCCGCC TATATCGCCA AGCACGGCGA GGAGAAGGCC
GAGGCCTGGC TGCGCGGCCT CAAGGCCAAT CTGGCGCAGA AACCGTCGGG CGGCGACCGC
GAGACGGCGC GCGACGTGGC GGCGGGCAAA TGCGACATCG GCATCGGCAA CACCTACTAC
TGGGCGCTGA TGATGAACGG CGATCCCGAC AAGAAGCCGT GGGCGGAAGC GACCCGCGTG
ATCCTGCCGA CCTTCGAGGG CGGCGGCACC CACGTCAATC TGTCGGGCGT GCTGCTGGCC
AAGAACGCGC CGAACAAGGC CAACGGCGTC AAGCTGATCG AATGGCTGCT CGGCGAGAAG
GCGCAGCAGA TCTACGCCAA CGCCAACTAC GAATATCCGA TCCGCCCCGG CGTGCCGCTC
AACCCGACCA TTGCCGGCTA CGGCAAGCTG ACCGCCGACT CGCTGCCGAT CGCCAAGATC
GCCGCGCAGC GCAAGGCCGC CTCGACGCTG GTCGACAAGG TCGGGTTCGA CAACTGA
 
Protein sequence
MSRRFRTAAL ALAAVVLPAQ AFAAEQVNVY TYRETKLVQP LFDAFTKDTG IAVNVISASS 
GLEQRMKAEG ANSPADVLLT VDIGRIDEAV AAGVTQPIQS AVVDEIVPPR YRDPDGHWAG
ISMRARVIYA SKDRVKQDAI TYEELADPKW KGKICIRSGQ HIYNNALFAA YIAKHGEEKA
EAWLRGLKAN LAQKPSGGDR ETARDVAAGK CDIGIGNTYY WALMMNGDPD KKPWAEATRV
ILPTFEGGGT HVNLSGVLLA KNAPNKANGV KLIEWLLGEK AQQIYANANY EYPIRPGVPL
NPTIAGYGKL TADSLPIAKI AAQRKAASTL VDKVGFDN