Gene RSp1002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSp1002 
Symboltek 
ID1223314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003296 
Strand
Start bp1262422 
End bp1264131 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content65% 
IMG OID637240867 
ProductTEK signal peptide protein 
Protein accessionNP_522563 
Protein GI17549223 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00217229 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGCAAGCAC AGTTCAAGAA GCGAGCATTG GTTTTAGGGG TAAGCGCATC CGCCATCGCG 
GCGCTGGTCG CATGCGGCGG CGGTGGCTCG GAATCGGGGG CCTCGCTGTC CCCGAGCCAA
GTCCAGGGCA AGGCTGTCGA CTTCTATCTA TCGCAGGCGA ACGTCGTCTT CACGGACTGC
AACAACCAGA CGACGACCAC GGACAACGAG GGCAACTTCA CCATCCCGAG CAATTGCGCC
AAGAGCGCGA TCACGGTGTC GGGCGGTACC GATATCGGCA CTGGGCTGCC GTTCACCGGC
GTGCTGCAGG CGCCCGCCGC CGATCTGACA CAGGGCGGCA CGGCGCTGGT GTCGCCGATG
ACCACCCTGC TGGCGCAGGT CGGTACGGGC CAGAGTTCGG TCCTGGCGAG CAAGCTGGGC
CTGCAGGCCA GCGACCTGCT GAACAAAGAC CCGATGAACG ATTCGGGGCT GCTCCAGAAC
GCCGTGGTCA TGCAGCAGCT GATCGATCAG ATCGCCAAGG CGCTGACCGG GCTGTCCCAG
AGCACGGGCG GCACGTTGAC CCCGACTGCT GCAGCCGCAG CGGCCGCAGC GGCAGTCGCC
AGCGTGCTGG TCGGCTCGAC CGGCTCCGCT GATCTGAGCG ATCCGACCCT GATTGCGAAC
GCTATTGTGA CGGCGGTGAA GAACAGCGCC GCCTCGTTGC CCGCCAGTGT GGTGGCCAAC
GTCGACGCGA TCGCCGCCAA CCTGGCCGCG CTGATTGCGC CGGTGATTGC CGGCAATGTG
GCCAACGTCA ATGACAGCCT CGACAGCGTG GAACTGAGCG CCACGCCGTC CGAGACGCTG
GCATCGCTGC AGAAGGCCGG CTCGATGCAC GCCGTGGTCG ATAGCGTGCA GTCGAGCGCG
TCCAACCTGC TGGCGGCGGC GATCTCGCCG GCAGCGCTGC GCGATACCTT GCTGTCGGAC
AGCCTTTCCG GGCTGGGTAC CGCAGTCGCG GAGGGTGACG AAGACACCAT CACCGAGGCC
GCGGCCACGC TGGGCAGCAA TGTCGACAGC AACAACCTGG GCAACATCAT CAGCCGCGTC
AAGTACAAGA ACTTCCTGCG TGTCGACAGC ATCAGCGTCA ATGACACGGT GGTTCCGGTT
GCCAGCGCGA TCACGCTGCG GGGCGGCACG ATTTCGAGCC TGAAGACCAG CGTGACGCAG
GTGGGCAGCC CGTTCGGCTA CAACAATTCG GAAATCCGGG CCGGCGTGCG GTATCGCTAT
AACGGCAACG AACTGAATGC CGTGATCCAG CGGATCGTGC TGACCTTCAA CAGCAGCAAC
AAGCTGGTGG CCGCGCAGGT GCCCGCCGGT ACCAACTTCG AGTTCGTCCT GAAGGGCGAT
ACCAACACCC GCCTTTCGGT CACGAGCACG GGCGACAACC TGCTCGACGG CAGCACCGGG
CAACTCGTTC TGCCGATCGA CAAGCTCCAG GCCAAGCTCA AGAATTCGGG CATCCTGACC
GCAGCGCAGG TCGACGCGCT GACGCCGAAG GCACCCGCCC GGGTCGACAT GGCGCTTGCG
ATTGCAGGCA CGTCTGGCCA GATGGTTCGG GTGAGGGCGG CGACCGGCCA CGGCAATCGC
ACGAAGTCGC TGCCGGTGAT TCGCATCAAC GCCGGAGATA GCTCGGTGGT CGGCTACGGG
AAGCGGAGCG TCGTCACGCT GCTGCCCTGA
 
Protein sequence
MQAQFKKRAL VLGVSASAIA ALVACGGGGS ESGASLSPSQ VQGKAVDFYL SQANVVFTDC 
NNQTTTTDNE GNFTIPSNCA KSAITVSGGT DIGTGLPFTG VLQAPAADLT QGGTALVSPM
TTLLAQVGTG QSSVLASKLG LQASDLLNKD PMNDSGLLQN AVVMQQLIDQ IAKALTGLSQ
STGGTLTPTA AAAAAAAAVA SVLVGSTGSA DLSDPTLIAN AIVTAVKNSA ASLPASVVAN
VDAIAANLAA LIAPVIAGNV ANVNDSLDSV ELSATPSETL ASLQKAGSMH AVVDSVQSSA
SNLLAAAISP AALRDTLLSD SLSGLGTAVA EGDEDTITEA AATLGSNVDS NNLGNIISRV
KYKNFLRVDS ISVNDTVVPV ASAITLRGGT ISSLKTSVTQ VGSPFGYNNS EIRAGVRYRY
NGNELNAVIQ RIVLTFNSSN KLVAAQVPAG TNFEFVLKGD TNTRLSVTST GDNLLDGSTG
QLVLPIDKLQ AKLKNSGILT AAQVDALTPK APARVDMALA IAGTSGQMVR VRAATGHGNR
TKSLPVIRIN AGDSSVVGYG KRSVVTLLP