Gene RPB_4216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4216 
Symbol 
ID3912024 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4788812 
End bp4789891 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content68% 
IMG OID637886119 
ProductABC transporter related 
Protein accessionYP_487818 
Protein GI86751322 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.628741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGTGA GCCTGGACAA CGTCACCCGG ACGATCGACG GGCTGCCGGC GATCTGCGAC 
GTGTCGCTGA CGCTGGAGCG CGGCACGCTG AGCGTGCTGC TCGGACCGAC GCTGTCCGGC
AAGACCTCGA TCATGCGGCT GCTCGCCGGC CTCGACAAGC CGAATTCCGG TCGCGTCCTG
GTCGACGGCC GGGACGTCAC CGGGGCCGAC GTACGCAAGC GCTCGGTGGC GATGGTCTAT
CAGCAGTTCA TCAACTACCC GTCGCTGACG GTGTACGAGA ACATCGCCTC GCCGCTGCGG
GTGCAGCGCA AACCGCGCGC CGAGATCGAG CAGCGCGTGC AGGAGGCGGC GCAGCTGCTC
AAGCTCGAGC CGTATCTGAA GCGCACGCCC TTGCAGCTCT CCGGTGGTCA GCAGCAGCGC
ACCGCGATCG CCCGGGCGCT GGTCAAGGGC GCGGATCTGG TGCTGCTCGA CGAACCGCTG
GCCAATCTCG ACTACAAGCT GCGCGAGGAA CTGCGCACCG AACTCCCGCG GATCTTCGAA
GCCTCCGGCG CGATCTTCGT CTATGCCACC ACCGAGCCGT CCGAGGCGCT GCTGCTCGGC
GGCCGCACCA TCTGCATGTG GGAGGGCCGG GTGCTGCAGA CCGGGCCGAC ACCGCAGGTC
TATCGTCGGC CCGACACGCT ACGCGTCGCG CAGGTGTTTT CCGATCCGCC GCTCAACATC
GTCGGCGCCG AGAAGAAGAG CGGCACCGTG CATTATGCGG GAGGCGTCAC CGCGCCCGCG
ACCGGCGTCT TCGAAGGCCT CGGCGATGGC GTCTATCGGG TCGGTTTCCG CGCTCACCAG
ATCGCGGTGG CGCGCGGCGA CGCCGACCGC CACGGCTTTC AGACGACGGT CGCGGTGACG
GAAATCACCG GCTCGGAGAG CTTCGTGCAT CTGCGGCGCG GCGACGACAA TTGGGTCGCG
GTGCTGCACG GCGTCCACGA ATTCGAGCCC GGCCAGACGC TCGACGCGGT GCTCGATCCT
GCCAATCTGT TCGTGTTCGA CGCGGCCGAC CGCCTCGTCG CCGCGCCGAA GCCGATGTGA
 
Protein sequence
MSVSLDNVTR TIDGLPAICD VSLTLERGTL SVLLGPTLSG KTSIMRLLAG LDKPNSGRVL 
VDGRDVTGAD VRKRSVAMVY QQFINYPSLT VYENIASPLR VQRKPRAEIE QRVQEAAQLL
KLEPYLKRTP LQLSGGQQQR TAIARALVKG ADLVLLDEPL ANLDYKLREE LRTELPRIFE
ASGAIFVYAT TEPSEALLLG GRTICMWEGR VLQTGPTPQV YRRPDTLRVA QVFSDPPLNI
VGAEKKSGTV HYAGGVTAPA TGVFEGLGDG VYRVGFRAHQ IAVARGDADR HGFQTTVAVT
EITGSESFVH LRRGDDNWVA VLHGVHEFEP GQTLDAVLDP ANLFVFDAAD RLVAAPKPM