Gene RPD_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1814 
Symbol 
ID4022296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2031157 
End bp2032137 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content65% 
IMG OID637962008 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_568951 
Protein GI91976292 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.443976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGCAT TCACGCTCCG CCGGATGTTG CAGGCGATCG GGGTCATGAT CGTCGTCTGC 
GCGCTGTCCT TCGCGATGTT CCGCTTCGCC GGCGATCCGG TCAGCCAGAT CGTCTCGATC
GACACCTCGA CGGCCGAGCG CGCCGAAATC CGCAAGTCGC TCGGGCTCGA CGACCCAGTG
CTGCTGCAGT TCGGCCGCTA CTTCGTCAAC GCAGCGCAGT TCGACTTCGG CATGTCGTAT
CGCTTCCGCG AGCCGGTCGC CAAGCTGCTG CTGGAGCGAA TGCCGGCGAC GCTGGAGCTC
GCGACCTGCG CGACGGTGCT GGCGATGACG CTCGGCATTC TGCTCGGGGT CTACACCGCG
CTCCGGCGCA ACTCCTGGCT GGCCACGCTG ATGCAGGCGG TCTCGCTGAT CGGCATCTCG
CTGCCGACCT TCCTGATCGG CATCCTGCTG ATCTATCTGT TCGCGGTGGT GCTGGGCTGG
CTGCCGTCCT ACGGCCGCGG CGAGACGGTT CGGTTCGGCT GGTGGACCAC CGGCCTGCTC
ACCACATCCG GCCTCAAATC GCTGATCATG CCGTCGATCA CGCTCGGCCT GTTCCAGATG
ACGCTGATCA TGCGGCTGGT GCGCGCCGAG ATGCTCGAAG TGCTGCGCAC CGACTACATC
CGCTTCGCCC GCGCCCGCGG ACTGACCACC CGCGCCATCC ATTTCGGCCA TGCGCTGAAG
AACACGCTGG TGCCGGTGAT CACCGTCGCC GGCCTGCAAT TCGGCTCGGT GATCGCCTTC
GCGATCATCA CCGAGACGGT GTTCCAGTGG CCGGGCATGG GGCTGCTGTT CGTGCAGGCG
GTGCAGAACG TCGATATTCC GATCATGGCG GCGTATCTGC TGGTGGTGTC GCTGATCTTC
GTCACCATCA ATCTGGTGGT CGACATTCTC TACACGCTGG TCGATCCGCG GCTGCGCGCC
AGCGCCGCAC GACGGACATA G
 
Protein sequence
MLAFTLRRML QAIGVMIVVC ALSFAMFRFA GDPVSQIVSI DTSTAERAEI RKSLGLDDPV 
LLQFGRYFVN AAQFDFGMSY RFREPVAKLL LERMPATLEL ATCATVLAMT LGILLGVYTA
LRRNSWLATL MQAVSLIGIS LPTFLIGILL IYLFAVVLGW LPSYGRGETV RFGWWTTGLL
TTSGLKSLIM PSITLGLFQM TLIMRLVRAE MLEVLRTDYI RFARARGLTT RAIHFGHALK
NTLVPVITVA GLQFGSVIAF AIITETVFQW PGMGLLFVQA VQNVDIPIMA AYLLVVSLIF
VTINLVVDIL YTLVDPRLRA SAARRT