Gene RPB_4614 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4614 
Symbol 
ID3912431 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5213725 
End bp5214945 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content62% 
IMG OID637886518 
Productextracellular ligand-binding receptor 
Protein accessionYP_488208 
Protein GI86751712 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTTAC GCCCGAATTC CGTTTTCGCG TCGGCAGCCG CGGTGGCCGC GGTTATGCTT 
GCGGCCACAT CGGCTGCAGC GGCGGAGAAG AAATACGATC CCGGCGCCAG CGACACTGAA
ATCAAGATCG GCCAGACCGT GCCGCATTCG GGTCCCGGTT CGCTCTATGG CGTGCTCGGC
CGCGTCGGCG AAGCCTATTT CCAGATGCTG AACGACAAGG GCGGCATCAA CGGCCGCAAG
GTCAAATTCC TGACCCTGGA CGATTCCTAC AGCGCGCCGA AGGCGGTCGA AGCCACCCGG
CGGCTGGTCG AGCAGGAAGA GGTGCTGGCG CTGTACGGCT CGCTCGGCAC CGCGCCGCAG
ACGGCTGTCC ACAAATATCT CAACAACAAG AAGGTGCCGC AGCTGCTGCT GAACACCGGC
GCGTCGAAAT GGAACGACCC GAAGAACTTC AAATGGACCA TGGCGGGTCT GCCGCTCTAT
CCGACCGAGG CGCGGATTCT CGCCAAATAC GTGCTCAGCG TGAAACCGGA CGCCAAGATC
GCGATCCTCT ATCAGAACGA CGATTTCGGC CGTGACTTCC TCGGCCCGTT CAAGAAGGTT
TTGGAAGACG CCGGCGGCAA GGCCAAGGTG ATCGCCGAGG CCAGCTACGA TCTGACCGAG
CCGACCATCG ACTCGCAGAT GATCAATCTG TCGAAATCCG GCGCCGACGT GTTCTACAAC
ATCACCACCG GCAAGGCGTC GTCGCAGTCG ATCCGCAAGG TCGTGGAACT CGGCTGGAAG
CCGCTGCAAC TGCTGTCGGC CGGCTCGACC GGGCGCTCGA TTCTCGAGGC CGCAGGCCTC
GACAACGCCA AGGGGATCGT GGCGATCGCC TATACCAAGG ACATCGGATC GCCGAAATAC
GCCGGCGACC CCGACGTGAT GGCGTTCGAG GAATTGCGCA AGAAGTACCT GCCGAACGTC
ACGCCGGACA ATTCGATCGC GTTCTCCGGC TATGCGCAGG CCGCCGCCAT GGCGGAAATT
CTGCGCCGCT GCGGCGACGA TCTGACGCGT GAAAACGTCA TCAAGCAGGC GTCGATGCTG
GGCGGATTCC GCGCTCCGCA CATGCTGCCC GGCGTGAGCT ACTCCTACAA GCCGGACGAC
TACACTTCGA TCAAGACGCT CTACACGATG GAATTCAGCG GCAAGGACTG GATCGTGTCC
GACAAGCCGG TCGCTGAATA A
 
Protein sequence
MNLRPNSVFA SAAAVAAVML AATSAAAAEK KYDPGASDTE IKIGQTVPHS GPGSLYGVLG 
RVGEAYFQML NDKGGINGRK VKFLTLDDSY SAPKAVEATR RLVEQEEVLA LYGSLGTAPQ
TAVHKYLNNK KVPQLLLNTG ASKWNDPKNF KWTMAGLPLY PTEARILAKY VLSVKPDAKI
AILYQNDDFG RDFLGPFKKV LEDAGGKAKV IAEASYDLTE PTIDSQMINL SKSGADVFYN
ITTGKASSQS IRKVVELGWK PLQLLSAGST GRSILEAAGL DNAKGIVAIA YTKDIGSPKY
AGDPDVMAFE ELRKKYLPNV TPDNSIAFSG YAQAAAMAEI LRRCGDDLTR ENVIKQASML
GGFRAPHMLP GVSYSYKPDD YTSIKTLYTM EFSGKDWIVS DKPVAE