Gene RPD_4371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4371 
Symbol 
ID4024896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4834587 
End bp4835795 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content66% 
IMG OID637964581 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_571489 
Protein GI91978830 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.392335 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGAGC TTCGACGATT GCCGCCGATA TCCCGCCGCG CGCTGCTGAC CGGCGCTGCG 
GGCGCCGCCA CTCTTGCGGC GGCGCCGCGC TTCGCGACTC CTGCGATCGC GCAGGCCGCG
CCGCTCAAGG TCGGGCTGAT GCTGCCTTAC ACTGGCACGT TTGCGAAGCT CGGCCAGTTC
ATCGACGACG GCTTCCGGCT GCGCATCGAG CAGCTCGGCG GCAAGCTCGG CGGCCGCGAG
GTCACCTTCG TGCAGGTCGA CGACGAGTCC AAGCCCGAGG CCGCGACCGA CAACATGAAC
CGGCTGGTCG GCCGCGAGAA AGTCGACGTG GTGATCGGCA CGGTGCATTC CGGCGTGGCG
ATGGCGATGG TCAAGGTCGC GCGCGACAGC GGAACGCTGC TGATCATTCC CAATGCCGGC
GCCAATGATG CGACCGGACC GGCCTGCGCG CCGAACATCT TCCGCACCTC GTTCTCGAAC
TGGCAGACCA CTTTTCCGAT GGGCAAGGTG ATGGCCGATG CCGGCATCAA GAACGTCGTC
ACCATCACCT GGAAATACAC CGCCGGGGCC GAAATGGTCG GCGCCTTCGC CGAGAACTTC
GCCAAGAACG GCGGCAAGAT CGTCGAGGAT CTCACTCTGC CGTTTCCGCA GGTCGAGTTT
CAGGCGCTGA TCACCCGGAT CGCGCAGCTC AAGCCGGACG CGGTGTTCAG CTTCTTCGCC
GGCGGCGGCG CTGTGAAGTT CGTCAAGGAC TACGCGGCCG CCGGCCTCAA CAAGTCGATC
CCGCTCTACG GCGCGGGCTT CCTCACTGAC GGCACGATCG AGGCGCAGGG CGAGGCGGCC
AGCGGCATCA AGACGACGCT GCATTACGCG GATAATCTGG ACAACCCCGC CAACGTCGCC
TTCCTCAAGG CATTCAAGGC CAAGACCAGC AAAGACGGCG ACATCTACGC GGTGCAGGGC
TACGACGCCG CGGCGCTGCT CGATATCGGC CTGACTTCGG TGAAGGGCGA CGCCGCGGCG
CGCGACGCGA TGATCAAGGC GATGGCGGCG GCCAGGATCG ACAGCCCGCG CGGGCCGCTG
TCGTTCAACA AGGCGCACAA TCCGATCCAG AACATCTATC TCCGCGAGGT CCGGAACGGC
CGCAACGAGA TGGTGTCGAT CGCCCAGGCC GAGGTCGATG ACCCGGCGCG CGGCTGTCGG
ATGACGTAG
 
Protein sequence
MSELRRLPPI SRRALLTGAA GAATLAAAPR FATPAIAQAA PLKVGLMLPY TGTFAKLGQF 
IDDGFRLRIE QLGGKLGGRE VTFVQVDDES KPEAATDNMN RLVGREKVDV VIGTVHSGVA
MAMVKVARDS GTLLIIPNAG ANDATGPACA PNIFRTSFSN WQTTFPMGKV MADAGIKNVV
TITWKYTAGA EMVGAFAENF AKNGGKIVED LTLPFPQVEF QALITRIAQL KPDAVFSFFA
GGGAVKFVKD YAAAGLNKSI PLYGAGFLTD GTIEAQGEAA SGIKTTLHYA DNLDNPANVA
FLKAFKAKTS KDGDIYAVQG YDAAALLDIG LTSVKGDAAA RDAMIKAMAA ARIDSPRGPL
SFNKAHNPIQ NIYLREVRNG RNEMVSIAQA EVDDPARGCR MT