Gene RPB_1663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1663 
Symbol 
ID3908650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1892458 
End bp1893654 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content64% 
IMG OID637883557 
Productextracellular ligand-binding receptor 
Protein accessionYP_485282 
Protein GI86748786 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.596073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACACG TCGCACGACT GGCTTTGACG GCACTGCTTT TCGCCTCCGG CGCTGCCTAT 
GCGCAGCAAG GCGAGATCAA GGTGGGAGAG ATCAACTCCT ATTCGGCACT CCCCGGCTTC
ACCGAGCCAT ACCGCAAGGG CATGGAACTG GCGCTGAAGC AGATCAACGA TGCCGGCGGC
ATCAAGGGCA AGAAGCTCGT CGTCATCACC AAGGACGACG GCGGCAAGCC GGGCGACGCG
CTGACCGCCG CCAACGAGCT GGTGTCGCGC GACGGCGTGG TGATGATCGC CGGCGGCTTC
CTGTCGAATA TCGGCCTGGC GCTGTCCGAC TTCGCCAAGC AGAAGAAGGT GCTCTACGTC
GCCGCCGAGC CGCTGACCGA CGCCATCGTC TGGTCGAAGG GCAACGACTA TACTTTCCGG
CTGCGCACAT CGAACTATAT GCAGGCCTCG ATGCTGGCGG AGGAAGCCGC CAAGCTGCCG
GCCAAGAAGT GGGCGACGAT TGCGCCCAAC TACGAATTCG GCCAGTCGTT CGTCGCCGCG
TTCAAGGAGA TTCTCAGCAA GAAGCGTCCC GACGTCGAGT TCGTCGCCGA GCAATGGCCG
CCGCTGAACA AGATCGACGC CGGCCCGGTG CTGCAGGCGA TCGACGCCGC CAAGCCCGAC
GCCATCCTCA ACGCCACCTT CGCCGGCGAC CTGGTCAAGC TGGTGCGCGA GGGCAATACC
CGCGGCGTGT TCAAGGATCG CGCGGTGGTG AGCTATCTCA CCGGCGAGCC GGAATATCTC
GACCCGCTGA AGACCGAAAC GCCGGAGGGC TGGATCGTCA CCGGCTATCC CTGGTACGCG
ATCAGCACGC CGGAGCATCA GGCTTTCCTC GACGCTTACC AACAGCTCAA CAAGGACTAT
CCGCGGCTCG GCTCGGTGGT CGGCTACGCC ACCGTGAAGA CCATCGCCGC CGTGCTGACC
GCCACCGACG ATCACTCCAC CGACGGCCTG GTCAAGGCGA TGAAGAACCT GAAGGTCGAC
ACCCCGTTCG GCGCGGTCGT CTACCGCGCC GGCGATCATC AGTCGACGAT GGGCGCCTAT
GTCGGCAAGA CCACGCAGAA GGACGGCAAG GGCATCATGA CCGACATCAA GTTCAAGAAG
GGCGCCGACT ATCTACCGCC CGAGGCCGAG GCCGCCAAGC TGCGTCCGGC GAACTGA
 
Protein sequence
MKHVARLALT ALLFASGAAY AQQGEIKVGE INSYSALPGF TEPYRKGMEL ALKQINDAGG 
IKGKKLVVIT KDDGGKPGDA LTAANELVSR DGVVMIAGGF LSNIGLALSD FAKQKKVLYV
AAEPLTDAIV WSKGNDYTFR LRTSNYMQAS MLAEEAAKLP AKKWATIAPN YEFGQSFVAA
FKEILSKKRP DVEFVAEQWP PLNKIDAGPV LQAIDAAKPD AILNATFAGD LVKLVREGNT
RGVFKDRAVV SYLTGEPEYL DPLKTETPEG WIVTGYPWYA ISTPEHQAFL DAYQQLNKDY
PRLGSVVGYA TVKTIAAVLT ATDDHSTDGL VKAMKNLKVD TPFGAVVYRA GDHQSTMGAY
VGKTTQKDGK GIMTDIKFKK GADYLPPEAE AAKLRPAN