Gene RPB_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1040 
Symbol 
ID3909164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1195129 
End bp1196106 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content66% 
IMG OID637882933 
Productbinding-protein dependent transport system inner membrane protein 
Protein accessionYP_484661 
Protein GI86748165 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.314534 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTATCC TGAATCTTGC GGGGCGGCGA CTCGCCGCCT CGATCCCGAC CTTGCTGCTG 
ATCCTGATCG GCATCTTCCT GCTGCTGCAA TTCGCGCCCG GCGACACCGT CGACGCGATG
ATGGCGCAGA TGGGCGGCGG CGACGCCGCG ACCGCGCGCG AGCTGCGGCA GTTCTACGGG
CTCGATCTGT CGATCCCGAT GCAGCTCGGC AACTATCTGT GGCGGCTGGT GCGGTTCGAT
CTCGGCTTCT CGTCGATCTA CGGCAAGCCG GTCGCGACCG TGATCCTGGA GCGGCTGCCG
CCGACGCTGC TGTTGATGAC CGCGTCGCTG TCATTCGCGT TCTTCGCAGG CCTGGTGCTC
GGCGTGATCG CCGCGCGCGG CGTCAACAAA TGGCCGGACA CGCTGATCTC CACGCTCGGC
CTGATCTTCT ACGCGACGCC ATCCTTCTGG TTCGGCCTGA TGGCGATCGT GGTGTTCTCG
GTCTATCTGC AATGGCTGCC GGCCGGCGGC TTCGAGGATA TCGGCGCGGC CTCGACCGGG
CTCGCCCGGA CGCTCGACAT CGCCAGCCAT CTGGTGCTGC CGACGCTGAC GCTGGGGCTG
ATCTTTCTGG CGATCTATCT GCGGATCATG CGCGCCTCGA TGCTCGAAGT GCTCAATCTC
GATTTCGTCC GCACCGCCCG CGCCAAGGGC CTCGACGAGA CCCGCATCGT CGTCCGCCAC
GTGCTGCGCA ACGCGCTCTT ACCGATGGTG ACGCTGATCG GGCTGCAGGC CGGCACCATG
CTGGGCGGCT CGGTGGTGGT CGAAAGCGTG TTCTCGCTGC CGGGCCTCGG CCGGCTCGCC
TATGAGTCCG TGGTGCAGCG CGACCTCAAC ACGTTGCTCG GCATCGTGTT CGTCTCGGCG
CTCTTGGTGA TCGCCGTCAA CTTCCTGGTC GACCTCTTAT ATGCGCGGCT CGACCCGCGC
ATCACCGCCG GGACGTGA
 
Protein sequence
MRILNLAGRR LAASIPTLLL ILIGIFLLLQ FAPGDTVDAM MAQMGGGDAA TARELRQFYG 
LDLSIPMQLG NYLWRLVRFD LGFSSIYGKP VATVILERLP PTLLLMTASL SFAFFAGLVL
GVIAARGVNK WPDTLISTLG LIFYATPSFW FGLMAIVVFS VYLQWLPAGG FEDIGAASTG
LARTLDIASH LVLPTLTLGL IFLAIYLRIM RASMLEVLNL DFVRTARAKG LDETRIVVRH
VLRNALLPMV TLIGLQAGTM LGGSVVVESV FSLPGLGRLA YESVVQRDLN TLLGIVFVSA
LLVIAVNFLV DLLYARLDPR ITAGT