Gene RSc1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc1684 
Symbol 
ID1220520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp1802822 
End bp1804060 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content66% 
IMG OID637238074 
Productphage phi-C31 GP36-like protein 
Protein accessionNP_519805 
Protein GI17546403 
COG category[R] General function prediction only 
COG ID[COG4653] Predicted phage phi-C31 gp36 major capsid-like protein 
TIGRFAM ID[TIGR01554] phage major capsid protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.885456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGG CTGAACTGAA GCAGAAGCGT GCAAAGATCG CTGCCGAAAT GCGCTCGCTG 
AACGACAACA TTGGCGAGGT CGCCTGGAAC GATGAACAGC GCTCGCGCTG GGATGCCATG
CGCGCGGATC TCAAGAAGCT CGACGAGCAG ATCGAGCGCG AGGAAGAACT GCGCAGCGCG
GAGCAGCGCT ACGTCGAGAG CAACGCGGAC GACCTGGCTA GGCAGGCGCG GCAGGCTGCG
GCTGCTGCCA CCGGCGGGAC AACGGACGAC GAGCGCCGTG CCGCGGCTTT TGGGCGCTTC
CTGCGCGAAG GCCTCGGCGA GCTGTCCGCC GAAGAGCGCA GGGCGCTGCA GGAACTGCGC
GCGCAGAGTG CCGGCGTGCC GGACAAGGGC GGCTACACCG TGCCCCGCAC CTTCCTGGCC
AAGGTGGTCG AGCAACTGGT GACCTACGGC GGCATGGCCA GCGTCATGCA GAATCTGACT
ACGGACGGCG GCGAGCCGAT CGACTGGCCG GTGGCGCTGG GCGTGGCGGA GGAGGGCGAA
CTCCTCGGCG AAGGCGAGGC GACAGGCGAA GACGACATCG ATTTCGGCAT CGGAACCCTT
GGCGCTCACA AGCTCTCGTC GAAGGTGGTC CGTGTCAGCG AGGAACTGCT CAGCGACTCG
TCCGTCGACA TCGAAGCGTT CCTGGCCGGT CGCATCGCCG CGCGCATCGG CCGTGCGGAA
TCTCGCCTGG TGGTGCAGGG GACGGGCGAC GGCAAGCCGC AGCAGCCGCA GGGGCTGGCC
GCATCCGTGA CTCTCACCAA GAGCACGGCG AATGCCGCGA AGCTGACCTG GCAGGAGGTC
AACACCCTGA TTCACGCCGT GGATCCGGCT TACCGGAATG CGCCCATGTA TCGCCTGGCC
TTCAACGATC AGACGTTGAA GGTGCTTGAG GAGTTGGTCG ACGGCAACGG TCGCCCGCTG
TGGCTGCCGG GCCTGGAATC GTCCGCGCCG CCGACGATTC TCAAGCGGCA ATACGTGATC
GACCAAGCGA TCGATGATAT CGGCGCCGGC AAGAAATTCA TGTATGGCGG CGACTTCAAC
CAGTTCATTC TGCGGCGTGT GCGGTCCATG GCGATCAAGC GACTGGTCGA GCGCTACGCG
GAATACGGCC AGGTCGGCTT CCTGGCCTTC CACCGCTTCG GTTGCGTGCT GCAGGACACG
TCCGCAATTG CGGCGCTGGT CGGCAAGCCG GCTGTGTAA
 
Protein sequence
MTLAELKQKR AKIAAEMRSL NDNIGEVAWN DEQRSRWDAM RADLKKLDEQ IEREEELRSA 
EQRYVESNAD DLARQARQAA AAATGGTTDD ERRAAAFGRF LREGLGELSA EERRALQELR
AQSAGVPDKG GYTVPRTFLA KVVEQLVTYG GMASVMQNLT TDGGEPIDWP VALGVAEEGE
LLGEGEATGE DDIDFGIGTL GAHKLSSKVV RVSEELLSDS SVDIEAFLAG RIAARIGRAE
SRLVVQGTGD GKPQQPQGLA ASVTLTKSTA NAAKLTWQEV NTLIHAVDPA YRNAPMYRLA
FNDQTLKVLE ELVDGNGRPL WLPGLESSAP PTILKRQYVI DQAIDDIGAG KKFMYGGDFN
QFILRRVRSM AIKRLVERYA EYGQVGFLAF HRFGCVLQDT SAIAALVGKP AV