Gene RPB_4103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4103 
Symbol 
ID3911910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4672497 
End bp4673702 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content62% 
IMG OID637886007 
Productextracellular ligand-binding receptor 
Protein accessionYP_487707 
Protein GI86751211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.742112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0857946 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGC TGATGGCAGC GGCGTGCGCC GCCGCGTGCA TCGTCGCGTC CGCGCCTGCG 
TATGCCGGGA ATTACGGACC GGGCGTGACC GACACCGAGG TGAAGATCGG GCAGACCATG
CCGTATAGCG GCCCGGCCTC GAGCTTCGCC GCGATCGGGC GCGCGATGAC GGCGTATTTC
GAGAAGCTGA ATGCCGAGGG CGGTGTCAAC GGCCGCAAGA TCAACCTGGT CTCGCTCGAC
GACGGCTACA GTCCGTCGAA GGCGGTGGAG CAGACCCGCC GGCTAGTGGA GAGCGACGAG
GTGCTGGCGA TCGTCGGCAC CTTCGGTTCG CCGACCAATT TTGCGATTCA GAAATATCTC
AACACCAGGA AGGTGCCGGG TCTGTTCCTC GGCACCGGGG CGAACCGCGT CTCCGAGCCG
CAGACCTATC CGTGGTCGAT GGGCTGGCAG CCGAACAACC ACGCCAAGGG CGTGATCTAC
GCCAAATATC TTCTCAAGGA ACGCCCCAAC GCCAAGGTCG CGGTGCTGTA TCAGAACGAC
GATTTCGGCC GCGACTATGC CAAGGGCTTT CGCGACGGGC TCGGCGACAA GGCCGGCAGC
ATGATCGTCA AGGAGCTCAG CTACGAGATC ACCGAGCCGA CGGTCGACTC CCAGATCTTG
CTGTTGAAGT CGACCGGTGC CGATGTGTTC CTGAACATCT CGACGCCGAA GTTCTCCGCG
CAGGCGATCA AAAAGATGAC CGAGACCAAG TGGGAGGCGC TGCACATGCT GAGCGACGCC
GCCGGTTCGA TCTCGAGCAC GCTGGTGCCG GCAGGGCTGG AGAACTCCAA GGGCGTGATC
ACGGTCGCGT TCCGCAAGGA CCCGAACGAT CCGGCCTGGG CCGAGGATCC GGGGATGAAA
CAGTATCTGG CATTCATGAA GCAATACATG CCGAACGCCG ATCCGTCCGA GACGTATTAC
GTGTTTGGCT ATGCCACCGC GCAGACCTTC GAGCATGTGT TGAAGGCCTG CGGCGACGAA
CTGACCCGCG AGAACCTGAT GAAGCAGGCG GCCAGCATCA AGGATCTGGA ACTGCCGATC
CTGCTGCCCG GCATCAAGCT CAACACCAGC GCGACTCACT TCACGCCGAT GAGCCAGGAG
CAGTTGATGC AGTTCGACGG CACCAGGTGG AAGCCGATCG GCGTGGTGAT CGACGCCGCA
AAATAG
 
Protein sequence
MKKLMAAACA AACIVASAPA YAGNYGPGVT DTEVKIGQTM PYSGPASSFA AIGRAMTAYF 
EKLNAEGGVN GRKINLVSLD DGYSPSKAVE QTRRLVESDE VLAIVGTFGS PTNFAIQKYL
NTRKVPGLFL GTGANRVSEP QTYPWSMGWQ PNNHAKGVIY AKYLLKERPN AKVAVLYQND
DFGRDYAKGF RDGLGDKAGS MIVKELSYEI TEPTVDSQIL LLKSTGADVF LNISTPKFSA
QAIKKMTETK WEALHMLSDA AGSISSTLVP AGLENSKGVI TVAFRKDPND PAWAEDPGMK
QYLAFMKQYM PNADPSETYY VFGYATAQTF EHVLKACGDE LTRENLMKQA ASIKDLELPI
LLPGIKLNTS ATHFTPMSQE QLMQFDGTRW KPIGVVIDAA K