Gene RSc2041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc2041 
Symbol 
ID1220881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp2214561 
End bp2215817 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content64% 
IMG OID637238435 
Productputative urea/short-chain amide-binding signal peptide protein 
Protein accessionNP_520162 
Protein GI17546760 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR03407] urea ABC transporter, urea binding protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.300905 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GTGAGCTGCT CAAGCTGTCG GCCGTTGCCG CCGCAGCGCT GTTGTCGGTA 
CCGCATGTCG CGCTGGCCCA GGCCAAGGCG CCGATCAAGG TCGGTGTCCT GCATTCGCTG
TCGGGCACCA TGGCCATCTC GGAGACGTCG CTCAAGGACG TCGCGCTGAT GACGATCGAC
GAGATCAACA AGAGCGGTGG CGTGCTGGGC CGCAAGCTGG AGCCGGTGGT GGTGGACCCG
GCCTCGAACT GGCCGCTGTT CGCCGAGAAG GCGCGCGGCT TGCTGACGCA GGACAAGGTG
GCGGTCACGT TCGGCTGCTG GACTTCGGTG TCGCGCAAGT CGGTGCTGCC GGTGTACGAG
GAACTGAACG GCCTGCTGTT CTACCCGGTG CAGTACGAGG GCGAGGAGAT GTCCAAGAAT
GTCTTTTACA CCGGTGCGGC GCCCAACCAG CAGGCGATTC CCGCGGTGGA ATACCTGATG
AGCAAGGAAG GCGGCGGTGC CAAGCGCTTC TTCCTGCTGG GCACCGACTA CGTGTATCCG
CGCACGACCA ACAAGATCCT GCGCGCCTTC CTGCACAGCA AGGGCGTGAA GGACAGCGAC
ATCGAAGAGG TCTACACGCC GTTCGGCCAT GCCGACTACC AGACCATCGT CGCCAACATC
AAGCGGTTCG CGCAGGGCGG CAAGACGGCG GTGGTGTCGA CCATCAACGG CGACTCCAAC
GTGCCGTTCT ACAAGGAGCT GGGCAACGCG GGCCTGAAGG CCAAGGACGT GCCGGTGGTG
GCCTTCTCGG TGGGCGAGGA GGAGCTGCGC GGCATCGACA CCAAGCCGCT GGTCGGCCAC
CTGGCGGCGT GGAACTACTT CATGTCGGTC AAGAACCCGG TCAACGACGA TTTCAAGAAG
AAGTGGGCGG CCTGGGTCAA GAGCAACAAC CTGCCGGGCG GCGACAAGCG CGTCACCAAC
GACCCGATGG AAGCCACCTA CGTCGGCATC ATGATGTGGA AGCAGGCGGT GGAGAAGGCC
GGCTCGACCG ACGTGGACAA GGTCCGCAAG GCGATGGTCG GCCAGCAGTT CAAGGCGCCG
TCGGGCTTCA TGCTGGCGAT GAACAACAAC CATCACCTGT CCAAGCCGGT GATGATCGGC
GAGGTGCGCG GCGACGGCCA GTTCAACGTG GTGTGGAAGA CGCCGACGGC GATCCGCGCC
AAGCCGTGGA GCCCGTACAT TCCGGGCAAC GAGGGCAAGC CCGATCAGGT GATGTGA
 
Protein sequence
MKRRELLKLS AVAAAALLSV PHVALAQAKA PIKVGVLHSL SGTMAISETS LKDVALMTID 
EINKSGGVLG RKLEPVVVDP ASNWPLFAEK ARGLLTQDKV AVTFGCWTSV SRKSVLPVYE
ELNGLLFYPV QYEGEEMSKN VFYTGAAPNQ QAIPAVEYLM SKEGGGAKRF FLLGTDYVYP
RTTNKILRAF LHSKGVKDSD IEEVYTPFGH ADYQTIVANI KRFAQGGKTA VVSTINGDSN
VPFYKELGNA GLKAKDVPVV AFSVGEEELR GIDTKPLVGH LAAWNYFMSV KNPVNDDFKK
KWAAWVKSNN LPGGDKRVTN DPMEATYVGI MMWKQAVEKA GSTDVDKVRK AMVGQQFKAP
SGFMLAMNNN HHLSKPVMIG EVRGDGQFNV VWKTPTAIRA KPWSPYIPGN EGKPDQVM