Gene RPB_1556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1556 
Symbol 
ID3908755 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1754050 
End bp1755330 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID637883452 
Productbranched chain amino-acid ABC transporter substrate-binding protein 
Protein accessionYP_485177 
Protein GI86748681 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA AATCACTGTT GAGCACGGCG TCGCTGGCGC TGCTGATCGC CGCGACGTCG 
GCCACCGCGC AGGCGCAGAT CGCGATCGGC CATCTCGCCG ATTATTCCGG CGGCACCTCG
GACGTCGGCA CGCCCTACGG CCAGGCCGTC GCCGACACCT TCGCTTGGGT CAACAAGAAC
GGCGGCGTCG GCGGCAAGCA GCTCAATGTC GACACCAACG ACTACGGCTA CCAGGTGCCG
CGCGCGATCG CGCTGTACAA GAAATGGTCG GGCGGCGACA AGGTCGCGGC GATCATGGGC
TGGGGCACCG CCGACACCGA GGCGCTGACC GGCTTCCTCG CCCAGGACAA GATCCCCGAC
ATGTCGGGCT CCTACGCCGC GGCGCTGACC GACCCCGAAG GCACCAGCGG CAAGGCCAAG
CCGGCGCCGT ACAACTTCTT CTATGGCCCG TCCTATTCCG ATGCGGTGCG CGCCGAACTG
ATGTGGGCCG CCGAGGACTG GAAGGCCAAG GGCAAGACCG GCGCGCCGAA ATTCGTCCAC
ATGGGCGCCA ATCATCCCTA CCCCAACGCG CCCAAGGCCG CCGGCGAAGC GCTCGCCAAG
GAGCTCGGCT TCGAGGTGCT GCCGCCGCTG GTGTTCGCGC TGTCGCCGGG CGACTACAGC
GCCCAGTGCC TCAGCCTGAA GAGCTCGGGC GCCAACTACG CCTATCTCGG CAACACCGCG
GCCTCGAATA TTTCGGTGAT GAAGGCCTGC AAGGCCGCCG GCGTCGACGT GCAGTTCATG
AGCAACGTCT GGGGCATGGA CGAGAACGCC GCCAAGGCCG CGGGCGACGC CGCCGATGGC
GTGATCTTCC CGCTGCGGAC TGCGGTCGCC TGGGGCGGCA ACGCGCCCGG CATGAAGACG
GTGGAGACGA TCTCCAAGAT GTCGGACCCG TCCGGCAATG TGTATCGGCC GGTGCATTAC
GTCGCCGCGG TGTGCTCGGC GATGTACATG AAGGAAGCGC TCGACTGGGC CGCCAAGAAC
GGCGGCGCCA CCGGCGAGAA CGTCGCCAAG GGCTTCTACC AGAAGAAGGA CTGGGTGCCG
GCCGGGATGG AGGGCGTCTG CAACCCGTCG ACCTGGACCG CCAAGGACCA CCGCGGCACG
ATGAAGATCG ACCTGTATCG CGCCAAGGTG TCGGGCGCGA CCGATGGCGA CCTCAAGGAC
CTGATCGCGA AGGGCACGAT CAAGCTCGAA AAGGTCAAGA CCGTCGACCT GCCGCGCAAG
CCGGAATGGT TCGGCTGGTG A
 
Protein sequence
MTMKSLLSTA SLALLIAATS ATAQAQIAIG HLADYSGGTS DVGTPYGQAV ADTFAWVNKN 
GGVGGKQLNV DTNDYGYQVP RAIALYKKWS GGDKVAAIMG WGTADTEALT GFLAQDKIPD
MSGSYAAALT DPEGTSGKAK PAPYNFFYGP SYSDAVRAEL MWAAEDWKAK GKTGAPKFVH
MGANHPYPNA PKAAGEALAK ELGFEVLPPL VFALSPGDYS AQCLSLKSSG ANYAYLGNTA
ASNISVMKAC KAAGVDVQFM SNVWGMDENA AKAAGDAADG VIFPLRTAVA WGGNAPGMKT
VETISKMSDP SGNVYRPVHY VAAVCSAMYM KEALDWAAKN GGATGENVAK GFYQKKDWVP
AGMEGVCNPS TWTAKDHRGT MKIDLYRAKV SGATDGDLKD LIAKGTIKLE KVKTVDLPRK
PEWFGW