Gene RPC_3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3020 
Symbol 
ID3973627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3316463 
End bp3317701 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content60% 
IMG OID637926131 
Productputative urea/short-chain amide transport system substrate-binding protein 
Protein accessionYP_532884 
Protein GI90424514 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR03669] urea ABC transporter, substrate-binding protein, archaeal type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.692302 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGAA CTGTAGTTCG GGGACTGCAT GCCGCAGTCC TCGCGGGGAC GCTTGTCCTC 
GCCTCAAATA TGGCCTTCGC GGAGACGCCG ATCAAGCTCG GCGTGCTGGA AGATCAATCC
GGCGACTTCG CGGTGGCAAC GATCGGAAAG GTGCACGCGA TTCAGCTCGC CGCGGAGGAG
ATCAACAAGT CCGGCGGCAT CATGGGCCGT CCGCTCGAAC TGGTGATCTA CGACACGCAG
TCGGACAATA CGCGCTACCA GGAATTCATG CGGCGCGTGC TGCAGCGCGA CAAGGCCGAC
GTGGTGTTCG CGGGATTCTC CTCGGCATCG CGCGAAGCCT ATCGCCCGAT CGTCGATCAG
CTCAACGGCT TTGCCTTCTA CAACAACCAG TATGAGGGTG GCGTCTGCGA CGGACATATG
ATCGTCACCG GCGCGGTCCC GGAGCAGCAA TTCTCGACGC TGATCCCCTA CATGATGCAG
GCCTACGGCA AGAAGGTCTA CACGCTCGCC GCCGACTATA ATTTCGGTCA AATCTCGGCC
GAGTGGGTCC GCAAAATCGT CAAGGAAAAC GGCGGCGAAA TGGTCGGCGA GGAATTCATC
CCGCTCGGCG TCTCGCAATT TTCGCAGAGC ATCCAGAATA TCCAGAAAGC CAAGCCGGAT
TTCGTGGTGA CGCTGCTGGT AGGCACCGCG CAAGCCTCGT ACTACGAGCA GGCGGCCTCG
GCCAACGTCA ACCTGCCGAT GGCGTCGTCG GTCAATGTCG GCCAGGGCTA CGAGCACAAG
CGCTTCAAGG CGCCAAGCCT GAAGGACATG TACGTCACCA CCAACTACAT CGAGGAGATC
GACTCCCCGA CGGCCAAGGC GTTCCTGGCA AAGTTCAAGG CCAAATTCCC CAACGAGCCC
TATGTCAATC AAGAAGCCGA GAACTCCTAT CTCGCGGTCT ATCTGTACAA GCAAATGGTC
GAGCGGGCGA AGTCGACCAA GCGCGACGAC ATCCGCAAGG TGATCGCGCA AGGCGACGTC
TGCATGGACG CGCCTGAAGG CAAGGTCTGC ATCGACCCGA AGAGCCAACA CATGTCGCAC
ACCATCTATC TGGCCAAGGT CGGCGCCGAT CATTCCATCA CCTTTCCGAA GGTCTGGGAG
GGCATCAAGC CGTATTGGCT CGGCGACGCC GGGTGCGACC TGACCAAGAA GGATCCGACG
GCGCAGTACA CGCCGTCGAA TCCGCCGCCG AAGCCGTAA
 
Protein sequence
MNRTVVRGLH AAVLAGTLVL ASNMAFAETP IKLGVLEDQS GDFAVATIGK VHAIQLAAEE 
INKSGGIMGR PLELVIYDTQ SDNTRYQEFM RRVLQRDKAD VVFAGFSSAS REAYRPIVDQ
LNGFAFYNNQ YEGGVCDGHM IVTGAVPEQQ FSTLIPYMMQ AYGKKVYTLA ADYNFGQISA
EWVRKIVKEN GGEMVGEEFI PLGVSQFSQS IQNIQKAKPD FVVTLLVGTA QASYYEQAAS
ANVNLPMASS VNVGQGYEHK RFKAPSLKDM YVTTNYIEEI DSPTAKAFLA KFKAKFPNEP
YVNQEAENSY LAVYLYKQMV ERAKSTKRDD IRKVIAQGDV CMDAPEGKVC IDPKSQHMSH
TIYLAKVGAD HSITFPKVWE GIKPYWLGDA GCDLTKKDPT AQYTPSNPPP KP