Gene RS02473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRS02473 
SymbolRSp1071 
ID1223383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1351770 
End bp1354316 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content63% 
IMG OID637240936 
Productputative hemagglutinin-related protein 
Protein accessionNP_522632 
Protein GI17549292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.335971 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.337547 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCCA ACGCGAGCGC CTCGCGCGGC AAGGGCGAGG GCTCGGACGT GAGCTGGACG 
AACACGCACG TGTCGGCCGG CAACACGCTG ACGCTGGAGT CGGGCGGCAA CACGAACCTG
AAGGGGGCGG TGGCCACGGG CAAGCAGGTG GTGGCCAACG TGGGCGGCGA CCTGAACATC
GAAAGCCTGC AGGACACGAG CACGTACCAC ACCAAGGACC AGTCGATCGG CGGCAGCGTG
ACGGTGGGCT TCGGCTTCTC GGGCAGCGCG AACTTCAGCC AGCAGAAGAT CGACAGCGAC
TTCGCCAGCG TGACGGAGCA GTCGGGCATC AAGGCCGGCG ACCGAGGCTT CCAGGTCAAT
GTGCACGGCA ACACCGACCT GAAGGGCGCG GTGATCGCGA GCACCGACAA GGCGGTGCAG
GACGGCGCCA ACAGCCTGAC GACGGCGACG CTCACGCAGA GCGAGATCCA CAACCGTGCC
GAGTACAGCG CGAGCAGCAT CGGCATCGGC GGCGGCTACA GCTCTGGCGG TGGCAGCGGC
AAGAGCGACA ACAGCGGTGG CAAGGGCGCT AGCGCGGACG GCGTCGGCAC CAACCAGCAA
GGCCAGGCCA CCACGGGCGG CGACAAGGTG CCGGGCAGCA ACGCGCCGAC CAGCGGCAAC
TGGAGCGCCA CGCCACCGGT CGTGATGGGT GCGTCGGGCA GCGGCAGCAG CGTGACGGGC
AGCGGCATCA GCGGCGGCGC GATCCACATC ACGGATGGCG CGAAGCAGCA GGCGCTGACG
GGCAAGGACG CCGAGCAGAC GGTGGCGAGC GTCAACCGGG GGGTGTCGAG CGAGCGGGAT
AGCAGCAATG CGCTCAAGCC GATCTTCAAT GAGAAGGAGA TTCAGGCGGG GTTCGAGATC
GTTGGCGCGT TGCAGCGGGA GGCTGGAACG TTCCTGAGCA ACCGCGCGAA GGAAGTGGAC
CAGAAGAACG CTCAGGCCAA GGATGCGGAC GCAAAGGCAG CCGACCCCAG CAATGGCCTG
ACCGACGAGC AGCGTCTTGC ACTGCGCGAC CAAGCCTCGG CGCTGCGCTC GGAAGCGCAA
GCGATCAATG ACAAGTGGGG CGCGGGCGGC ACCTATCGCC AGATCACCTC GGCGCTGATG
GCCGGCGTTG GCGGCAACGT GACGGGCAGC ACGGCACAGT TCGCGCAGAA CATGGTGGTC
AACTATGTGC AGCAGCAGGG GGCGAGCTAC ATCGGCAAGC TGGTCGCGGA CGGCACGCTG
GTCGAAGGCA GCCCGGCCCA CGCCGCGTTG CATGCGATCG TCGCCTGCGC GGGTGCGGCG
GCGAGCAGTC AGAGTTGCGG CTCGGGTGCG CTGGGTGCGG CGGCGAGCAG CTTGCTGACC
GGCCTGTTCA GCGAAACCAG CCCGGACGAG ACCGCCACGC AGCGCGAGGG CAAGCGCAAT
CTCATCACCA GCCTGGTGAC CGGCATCGCC GCCATGAGCG GCGCGGATGC CACCACGGCG
ACCAACGGCG CCATCGCTGC GGTTGATAAC AACTGGCTGG CGACGCAGCA GATCGTGAAG
ATGAAGAAGG AGCTGTCCAA CGCGAAGTCG ACGCTGGAGC AGTTGAAGGT GGCCAGCAAG
TGGGCCTACA TCTCGACGAA GCAGGATGTG CTGACCACCA CGGGCATCGG CAAGGGGCTG
GCGGAGTCGG GCTGGAACGA CGTCAAGGGC GTGGCGGAAT TCCTGGCGCA CCCGATTGAG
GGGCTGAAGG GGCTGAAGCA ACTGATCAGC AGCCCGGATG CCCGGCAGCA GTTGGGTGAT
GCGCTGTTCA AAGAGTTGGA CGCCAAGATC GACCGCATGA GCTACGCCAT CGAAAAGGGT
GGCGATGAGA ATGCGGAGCA GCTTGGCAAG GACTTGGGCG GGCTGCTGTG GCAGGTGGGC
AGTGTTGTGA CCGGCGTGGG CGGGGTTGCG AAGGGGGCAA CCAAGCTGGC CTCAGTGGGT
GTCCGTCTTG GCACCGACAT GATGGAAACG CTGTCCGGTG CAGCGAAGTT CGATCGCCTG
CTTGCCAATG GTGGATTGTT CGCTGCAGAT GGCAAGCCTC TGATGGACTT CCGGAGTTTG
AGCAATCCGC AGAAGAGTAT TGTGGGTGAC ATGCTGGGTG GCGAAAAGGT TAAGCAGCTA
CTGCCGGATG CTCAGAAAAT TGGTCGAACC CCAGGCGTTG GTGAAGCCGG CATTGATGAT
CTCTACAAGG TCAATAAGCC GGGTGTGGAT TACGTAGTCG TAGAATACAA ATTTGGCTCA
TCTAAGCTGG GTAATCCAGC AGATGGTTTG CAGATGAGTG ATGATTGGAT TACAGGAGCG
AAAACAGGCA AGAGTCGTGT TCTCGATTCA TTGAGTGGTG ATAGGGTTGA GGCAGGAAAA
TTTATGGATG CATTTGATGC CGGGCGAGTT GAGAAGTGGC TGGTCCATAC AGATCCCTTC
GGAAACGTCA CCGTAGGTGT GCTAGATAAG AACGGGAAAT TTTTCCCCGA TCCCGTCAAG
GCATCGAAAA TTCTAGGATC AAAATGA
 
Protein sequence
MTANASASRG KGEGSDVSWT NTHVSAGNTL TLESGGNTNL KGAVATGKQV VANVGGDLNI 
ESLQDTSTYH TKDQSIGGSV TVGFGFSGSA NFSQQKIDSD FASVTEQSGI KAGDRGFQVN
VHGNTDLKGA VIASTDKAVQ DGANSLTTAT LTQSEIHNRA EYSASSIGIG GGYSSGGGSG
KSDNSGGKGA SADGVGTNQQ GQATTGGDKV PGSNAPTSGN WSATPPVVMG ASGSGSSVTG
SGISGGAIHI TDGAKQQALT GKDAEQTVAS VNRGVSSERD SSNALKPIFN EKEIQAGFEI
VGALQREAGT FLSNRAKEVD QKNAQAKDAD AKAADPSNGL TDEQRLALRD QASALRSEAQ
AINDKWGAGG TYRQITSALM AGVGGNVTGS TAQFAQNMVV NYVQQQGASY IGKLVADGTL
VEGSPAHAAL HAIVACAGAA ASSQSCGSGA LGAAASSLLT GLFSETSPDE TATQREGKRN
LITSLVTGIA AMSGADATTA TNGAIAAVDN NWLATQQIVK MKKELSNAKS TLEQLKVASK
WAYISTKQDV LTTTGIGKGL AESGWNDVKG VAEFLAHPIE GLKGLKQLIS SPDARQQLGD
ALFKELDAKI DRMSYAIEKG GDENAEQLGK DLGGLLWQVG SVVTGVGGVA KGATKLASVG
VRLGTDMMET LSGAAKFDRL LANGGLFAAD GKPLMDFRSL SNPQKSIVGD MLGGEKVKQL
LPDAQKIGRT PGVGEAGIDD LYKVNKPGVD YVVVEYKFGS SKLGNPADGL QMSDDWITGA
KTGKSRVLDS LSGDRVEAGK FMDAFDAGRV EKWLVHTDPF GNVTVGVLDK NGKFFPDPVK
ASKILGSK