Gene RS01887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRS01887 
SymbolRSp0820 
ID1223127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1036232 
End bp1037572 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content68% 
IMG OID637240680 
Productputative hemagglutinin-related protein 
Protein accessionNP_522381 
Protein GI17549041 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2706] 3-carboxymuconate cyclase 
TIGRFAM ID[TIGR02276] 40-residue YVTN family beta-propeller repeat 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCATTGG CGCTGACGCT CGCGGCATCC GCCTGTGGCG GTGGCGGCGA TGCTTCTACC 
GCGGCACCGG CCGGCATGTC CGCCGGTGCG GCCACTTACA GCGTTGGCGG CAGCGTGTCG
GGCCTGGGGC CGGGGTTGTC GCTCCAGTTG CTCAACAACG GCGGCGATGC TGTCACGGTT
GGCGGCAATG GCGGCTTTCG CTTCCCGGGC AAGCTGTCGG CTGGAGCTAC CTACGCGGCG
ACCGTCGGCA CCCGGCCGTC GGGCCAGCAG TGCACGATCG ACAAAGGCAG TGGCACGGTC
GCCAACGCGG ATGTTGCCGA TATTGCAGTG ACGTGCTCGG CACGCCCGCT CTTTGCCTAC
GTGGCGAATG CGGATGACAA CACCGTGCAG GCGTTTGCGC TGGACCCCGC CACGGGGGCC
GCCACCGGTG TGGGCCGTCC TGCGGCCGTG GGGCACGGTC CGGTCTCGCT GGTTGCTGAC
CCTGCGGGCA CAACCCTCTA TGTGGTCAAC GCCTCCGACA ATACGGTGAC CACGCTGGCG
ATCCATCCGG ATACCGGTGC CGTGTCGGTC AGCGCGCCAG CCGTCCGCAC GGGAGCGTCT
CCGCTGAGCA TCGCGCGCAC GCCCGCCGGA CCGTTCGCCT ATACCGCCAA CGCCGGGGAC
AATACGCTGT CGATCTTCAA GATGAGCGCC AAGGCAGAGG CTCCGGCCCC GCGCGGTGTC
GTGCAGGCGG GCTCCAACCC GTACACCGTT GCTGTCAACG GCACGGGCAC GTTCGCCTAT
GTGGTGAACG CGGCCATGGT TTCCGGCGCG CCGTCGGTCA TGGCCTTTGC CATCAACGGT
GCAACCGGCG CGCTGGTGCC GGCAGGCAGC CCGGCGGCAA CGGGCCATGC ACCGTTTTTC
ATTGCACTCC ATCCGGCGGG GAGGTTTGCC TATGTCGCCA ATTTCGCCGA CGACACGCTG
AGCGTCTACG CCATCAACGG GGCCACCGGC GCGCTCGCCC CGGCGGGCAG CCCGGTTGCC
ACCGGCGGCA ATCCGTTCGC GATCGCGATC CATCCGTCAG GCCGCTTCGC TTATGTCGCC
AACGTGTTCT CAGGCACGAT CAGCCTGTTC GCCATCGACG CACGCACCGG TGCGCTGACC
CCCATCGGCA CTGTGCAGGC AGGCGCCAGC CCGGTTGCCA TGACGCTCAA TCCGGCCGGC
ACTGTTGCCT ACGTGTTGAA TGCCGGCGAC GACACCATGG CCATCTACCG GGTGGACGGC
GGCACGGGCG TTCTGAGCGC GGTAGCCACG GTACAGACAG GCCTCACGCC GAGCGCGATG
GCGATCGTGG CCGTGCCTTA G
 
Protein sequence
MALALTLAAS ACGGGGDAST AAPAGMSAGA ATYSVGGSVS GLGPGLSLQL LNNGGDAVTV 
GGNGGFRFPG KLSAGATYAA TVGTRPSGQQ CTIDKGSGTV ANADVADIAV TCSARPLFAY
VANADDNTVQ AFALDPATGA ATGVGRPAAV GHGPVSLVAD PAGTTLYVVN ASDNTVTTLA
IHPDTGAVSV SAPAVRTGAS PLSIARTPAG PFAYTANAGD NTLSIFKMSA KAEAPAPRGV
VQAGSNPYTV AVNGTGTFAY VVNAAMVSGA PSVMAFAING ATGALVPAGS PAATGHAPFF
IALHPAGRFA YVANFADDTL SVYAINGATG ALAPAGSPVA TGGNPFAIAI HPSGRFAYVA
NVFSGTISLF AIDARTGALT PIGTVQAGAS PVAMTLNPAG TVAYVLNAGD DTMAIYRVDG
GTGVLSAVAT VQTGLTPSAM AIVAVP