Gene RPC_1944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1944 
Symbol 
ID3973576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2115975 
End bp2117327 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content63% 
IMG OID637925055 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_531820 
Protein GI90423450 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGACA ATAAGCTCAT CATCACTCGC AGAACCGCGC TCAAGACCGG GCTTGCCGGC 
GCAGCCGCGC TGGCGACACC GACGTTCTTC ATTCGCGACG CCTGGGCGGA TGAATTCTGC
AATATGCCGA AGGGCAAGGA AGTCACGTTC GGCTTCAACG TGCCGCAGTC CGGTGCGTAT
GCAGACGAAG GCATGGACGA ACTCAAGGCC TACCAGCTAG CGGTCAAGCA TCTCAATGGT
GAGGGCGACG GCGGCATGCT CAAGACCATG AAGCCGCTGG CGCTGAAGGG CAACGGCGTG
CTCGGCAAGA AGGTCGCCTA CGTCTCCGGC GACACCCAGA CCAAGGCGGA CCAGGCGCGC
GCCACCGCCA AGCGCATGAT CGAGAAAGAC GGCGCGATCA TGATCACCGG CGGCTCGTCG
TCTGCCGAGG CGGTCGCGGT GCAGTCGCTG TGTCAGGACG TCGGCGTGAT CTTCATGGCC
GGCCTGACCC ATTCCAACGA CACCACCGGC AAGGACAAGC GCGCCTACGG CTTCCGCCAT
TTCTTCAACG CCTACATGTC GGGCGTGGCA CTCGGCCCGG TGCTCTCGGA AGCCTATGGC
AAGGATCGCG CCGCTTATCA TCTGACCGCC GACTACACCT GGGGCTGGAC CCAAGAAGAG
TCGATGAAGG ACGCCACCGA GAAGCAGGGC TGGAAGACCG TCAAGGCCGT TCGCACGCCG
CTCGGCGCAG CCGACTTCTC GCAATACATC ACGCCGGTGC TGAATTCGGG CGCCGACGTG
CTGATCTTGA ACCACTACGG CAAGGACATG ATCAATTCCT TGACCCAGGC GGTGCAGTTC
GGTCTTCGCG ACAAGATGGC AAACGGCAAG AAGTTCGAGA TCGTCGTGCC GCTGTTCTCC
GAACTGATGG CGCAGGGCGC CGGCGATGCG ATCAAGGGCA TCTACGGCAC CGCGAACTGG
GACTGGAAGC TGGAAGACCC GGCGACCAAT GCCTTCACCA AGTCGTTCGG TGCGGCCTAC
GGCACGCCGC CGTCGCAGGC GGCGCAGACC TGCTACGTTC AGGCGATCCT CTACGCCGAC
GCCTGCGAGC GCGCCGGCAG CTTCAACCCG TCCGCCGTGA TCAAGGCGCT GGAGGGCTTC
GAGTTCGACG GCATGGGCAA CGGCAAGACG CTGTACCGCG CCGCCGACCA TCAGTGCTTC
AAGGACGTGC TGGTCGTGCA GGGCAAGGAC AAGCCCAAGG ACAAGTTCGA CCTGCTCGAG
GTCCGGAAGA TCGTGCCTGC ATCGCAGGTC ACCTACGATC CGAGCATCTT CGGCGGCGAA
CTCGGCTCCA AGGACGCCAA GAAGTGCGGA TAA
 
Protein sequence
MVDNKLIITR RTALKTGLAG AAALATPTFF IRDAWADEFC NMPKGKEVTF GFNVPQSGAY 
ADEGMDELKA YQLAVKHLNG EGDGGMLKTM KPLALKGNGV LGKKVAYVSG DTQTKADQAR
ATAKRMIEKD GAIMITGGSS SAEAVAVQSL CQDVGVIFMA GLTHSNDTTG KDKRAYGFRH
FFNAYMSGVA LGPVLSEAYG KDRAAYHLTA DYTWGWTQEE SMKDATEKQG WKTVKAVRTP
LGAADFSQYI TPVLNSGADV LILNHYGKDM INSLTQAVQF GLRDKMANGK KFEIVVPLFS
ELMAQGAGDA IKGIYGTANW DWKLEDPATN AFTKSFGAAY GTPPSQAAQT CYVQAILYAD
ACERAGSFNP SAVIKALEGF EFDGMGNGKT LYRAADHQCF KDVLVVQGKD KPKDKFDLLE
VRKIVPASQV TYDPSIFGGE LGSKDAKKCG