Gene RSc3162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc3162 
Symbol 
ID1222025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp3418319 
End bp3421357 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content67% 
IMG OID637239580 
Productputative hemagglutinin-related protein 
Protein accessionNP_521283 
Protein GI17547881 
COG category[S] Function unknown 
COG ID[COG4625] Uncharacterized protein with a C-terminal OMP (outer membrane protein) domain 
TIGRFAM ID[TIGR01414] outer membrane autotransporter barrel domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGC GCGCCCATTA TCGGCCCAGG CAGACAGCCA TGACGCTGGC CGTCGCGGCC 
CTGGCTGCGA TGACGCAGAT GCAGACGGCG GAGGCCGCAA CCGTGAGCAG CCCCTTCGGG
GGCATCTACG TCTGGAGCAT TGGCGATCTT TCCGTCACAT CGACCATTTC GGGCGGGGGG
ATCGGCGTGG CAGCCTTCGC CCTCGGTACC CTCGGCACGC TGAGCAACAG CGGCGTCATC
AGCGGCGGCT ACGGTTTCTA CACCTCCGGC GCGGTAGCCG CGCTGCACAA CGCGAGCGGC
GGTACGATCA GCGGCACCGG CAACGACGGC GTCTTCAACA CCAGCAACAG CACCATCGGG
ACGCTGACCA ACGATGGGGC CATCAGCGGC GCCCACTACG GCGTTCTCAA TTACTACGCC
GCGACGATCA ATACGCTGAC GAACAGCGGT GCTATCACTG GCGGCGTTGT CAGCGGCAGC
GGCAGCTGGG CCGGCATCGA CAACGCCGGG TCGATCGGCA CGCTGACCAA CACCAGCAGC
GGCACGATCA GCGCCACCAG CAACGCTTCC GGCGTGTACA ACACATACAT GATCGACACG
CTGTCCAACA GCGGCCTGAT CACCGCCGAC TCGCGTGGCA TCCTCAACAA CGGCACGATC
ACCACGCTGA TTAACAACAG CGGCGGCACC ATCAGCCACA GCGGCACCGC TTTCGGCGAC
GGCATCGACA ACAACGGCGG CACGATCGGC ACGCTCACCA ACAGCGGGGT CATCAGCGCC
GCCGGCAGCA CCGGCTACGG TAACGGCATC GACAACAGCG GCACGATCAC CACGCTGACC
AACAACAGCG GCGGCACCAT CGCCGGCAAC TACACCGGCC TCTACAACAG CGGCACGATC
GGCACGCTGA CCAACCATGG CGCCATCATC GGCGGCTACA GTGCCGGCGT CGCCAATGAC
GGCACCATCG GTACGCTGAC CAACAGCGGC GTCATCACCG GTTCCAACAG CTACAGCGGC
AACGATGGTT TCGGCGTCAA GAACTCGGGC ACGATCACCT CGCTGACCAA CGCCAGCGGC
GGTACCATCA GCGGCACGGT GAGCGGCAGC GGCGCTGCCT ACGGCGTCTT CAACGGCGGC
ACGATCGGCT CGCTCGCCAA CAGCGGCCTG ATCACCGGCA GCACCTACGC GCTGTTCAAC
GATACGGGCG GCACGCTCGG CCCCGTCACC AATGCGGGTG TGATCGCCGG CAACATCGGG
AACCTGTCGA GCAACAACCT GAGCATCAGC GGGGGCACGG GCACGACCTT CGGCACGCTC
ACCGGCTACG GCGGCGGCAG CACCCTCGGC ACCATCGACA ATCCCAATTC GAGCGTGGTG
TTCGCCTCGG GCAATCTGCT GCTCAACGAC AACATCAACC TCGGCAGCGG CACCATCAAC
GCGGTGAACA ACACCGGCTC GGCGGTGGTG CAGGTCAACC GGCCGGTGAC CATCACCGGC
AACTACAACC AGGGCAGCGG CGCGACGCTG CAGATCGGCG TTGCCGGCGG CGCCACCACG
CAAGGCACCC TCGCCACGGA CGCCGGCTAC GGCCGGCTGG TCGTGACCGG CAATACGACG
ATTGCGCAGG GTTCGTCGAT CACGCTGCAG TCCAACGGCT ACGCCTTTGC TGCAGGGCAA
CGCTACGTGG TGGTGGACAC GGCCGGCACG GCGGCCTACA ACGCGGGCAG CCTGCGGTAT
GCGATCAATG GCTACACCTC CACCGTGACG GGCGCGGCCG TGGCCAACGG CAGCAACAGC
GACCTGGTGC TGACCGTGGT CAGCGCGACG TCGCTGTCGT CGCCCGCGAC GCCAAGCCCG
ACCACGAGCA CAACCACCAC CACAACCACA ACGCAGAACC CGGCATCGAT CGCCACCGTA
CCGAACGCGG TGGCCTCGCT CAATGGCCTG TTGAGCTACA CCGGCATCAG CAGCCCCAAT
CTGCTGAATC TGTATAACGC CGCGCTGGGT TCCCTGAGCG AGGGGTCGAC GGCAAGCGCC
AACCAGATCG GCAAGCAACT GGGTGCGACC CAGACCGGGT GGGCGCCCGC TGCGCCGACA
TTCGATGCGC TGAACGTGGT GGGGGCGCAC GTCAACACAC TGCGCCTGGC GCAGGCGGCG
GGCACAACGG GGGTGGCGAC CGGCGACAGC CCGGCGTCGT GGGGCGTGTG GGGTCAGGCA
TTCGGCGGCC ACGCCCAGCA GAACGAACGC GACCAGATCG ATGGCTACAG CGCGAACTAC
GGCGGCTTGC TGATCGGCGC GGACCGTGAG ATCAACGACC GCTGGCGCGC CGGCGGCGTG
TTTAGCTACA GCAACACGGC GATCGACAAC ACCGGCGATA CCTCGGGCAA CTCCGCCCGT
GTCAATGGCT ACGGGCTGAT CGGCTACGCG AGCTATACCG GCAACCCGTG GTACGTGAAC
CTCTCGGGCG CGGCAGTGCA GCAGCGATAC GACACGAGCC GCCTGGTGAG CATGCAAGGC
CTTTCGGGCA CGGCCAGCGG GCATTTCAGC GGCCAGCAGT ATGTGGCACG CACCGAGGCC
GGGTATCCGC TGTCGGTGGG TAGCGTGACG CTCACGCCGC TGGCGAGCCT GACGTACAGC
TACCTGACCC AGGACAGCTA TACGGAGAAC GGCGGCAACG GGGCCGCCCT GTCGGTGGGC
TCCACGCATG TCGGCTCGGT CAAGAGCGGC CTGGGTGCCA AGCTGGAGAA AGGCTTCGCG
ACCCGCTACG GCCAGATCGT GCTGGAGGCG CGGGCACAGT GGATCCACGA GTACGACCAT
GCCAGGCAGG TGACGAGCGC AAGCTTTGCG GCGGACGCAA CGGGCCAGAC GGCGTTCACG
ACGGTGGGCA TGACGCCGGT GTCGGACCTC GCGGATATCT CGCTGGGCGC GACGCTGCTG
CGGGCGAACA ACCTGAGCCT GTCCGCGCGC TACGAACTGC AGGCGGGCCG GGGCTTTGTC
TCGCAGACGG GGAGCGTGCG CCTGCGGCAG CTGTTCTGA
 
Protein sequence
MNQRAHYRPR QTAMTLAVAA LAAMTQMQTA EAATVSSPFG GIYVWSIGDL SVTSTISGGG 
IGVAAFALGT LGTLSNSGVI SGGYGFYTSG AVAALHNASG GTISGTGNDG VFNTSNSTIG
TLTNDGAISG AHYGVLNYYA ATINTLTNSG AITGGVVSGS GSWAGIDNAG SIGTLTNTSS
GTISATSNAS GVYNTYMIDT LSNSGLITAD SRGILNNGTI TTLINNSGGT ISHSGTAFGD
GIDNNGGTIG TLTNSGVISA AGSTGYGNGI DNSGTITTLT NNSGGTIAGN YTGLYNSGTI
GTLTNHGAII GGYSAGVAND GTIGTLTNSG VITGSNSYSG NDGFGVKNSG TITSLTNASG
GTISGTVSGS GAAYGVFNGG TIGSLANSGL ITGSTYALFN DTGGTLGPVT NAGVIAGNIG
NLSSNNLSIS GGTGTTFGTL TGYGGGSTLG TIDNPNSSVV FASGNLLLND NINLGSGTIN
AVNNTGSAVV QVNRPVTITG NYNQGSGATL QIGVAGGATT QGTLATDAGY GRLVVTGNTT
IAQGSSITLQ SNGYAFAAGQ RYVVVDTAGT AAYNAGSLRY AINGYTSTVT GAAVANGSNS
DLVLTVVSAT SLSSPATPSP TTSTTTTTTT TQNPASIATV PNAVASLNGL LSYTGISSPN
LLNLYNAALG SLSEGSTASA NQIGKQLGAT QTGWAPAAPT FDALNVVGAH VNTLRLAQAA
GTTGVATGDS PASWGVWGQA FGGHAQQNER DQIDGYSANY GGLLIGADRE INDRWRAGGV
FSYSNTAIDN TGDTSGNSAR VNGYGLIGYA SYTGNPWYVN LSGAAVQQRY DTSRLVSMQG
LSGTASGHFS GQQYVARTEA GYPLSVGSVT LTPLASLTYS YLTQDSYTEN GGNGAALSVG
STHVGSVKSG LGAKLEKGFA TRYGQIVLEA RAQWIHEYDH ARQVTSASFA ADATGQTAFT
TVGMTPVSDL ADISLGATLL RANNLSLSAR YELQAGRGFV SQTGSVRLRQ LF