Gene Rru_A3037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRru_A3037 
Symbol 
ID3836483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodospirillum rubrum ATCC 11170 
KingdomBacteria 
Replicon accessionNC_007643 
Strand
Start bp3497481 
End bp3499187 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content67% 
IMG OID637827152 
Productchemotaxis sensory transducer 
Protein accessionYP_428119 
Protein GI83594367 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTGA TAACCCGATC GATTCGGGGG AAGCTGTTCG CGGCTTTCCT GGCGGCGACC 
CTTGGTGTCC TCATCTCCAG CAGCCTGGGG ATCATCCTGG TCTCGCGCGA GGGCGATCTG
GCCCATGAGG CGATGACCGG TCTCGCCCCC CTCGCCGATG CGGCGATGGA GACCAAGATT
TCCGCCACCC GCGCCCACCT CGTCTTCGAA GAGATTATGG CCGGGGATGA GACCGAGTCG
ATCTCCGAGG TCTGGTCGCT GCTCGACGAC GCCGACTGGT ATTTGCTGGC CATGCTCAAT
GGCGGCGCGA AAGCCAAGGA AACCTTCGTG AAGACCGACA ATCCGGCCGT CCGCGTCCGG
CTTGGAGAGG CGCGCGAGGT GCTGGCCGCC TTTCGCGTCA CGGCGGAAGG CCGCTACAAG
GCTTTTGGCG ACCAGGGACG GGGGCAGACG GTGGCCGGAA CCGATCTCGA CGTCGCCTTC
GACGCCGCCT TCGAGAGCTT CGCCACCCTC ACCGATTCCA TCGAAGAGCT TGTCCATACC
CAGATTGACG AGCAGGTCGA GAAGATCGAG CAGACCAAAA CAAGCTCGCT GTGGCTGATG
TCGGCGACGG CCTTGATCGC GCTGGCCGTT TCCTTATCCC TTGCCGTGGT CATTGGCCGC
TCGGTTTCGC GGCGGATCAC CGATCTGTCG ACGACGATGG GCCAGATCAC CGCCGGCGAC
CACGCCGCCC CGGTTCCCCA TACCACCTCG ACCGACGAGA TCGGCGTCAT GGGCCGGGCG
CTGGTCACCC TGCGCGATGG CGTTGACGAA GCCGCCCGAC TGCGCGCCTC GCTGCAGCTC
AAGGCCGAGG ACGAGGCCCG CCAACGCACC CATCTCGAAG GGGCGATCAT CGGCTTTGAC
CGCTCAATCG GCGAGGTCAT GGCCTCGTTG CGCGACGTCG TCGAGGTTCT CCATGGCGCG
GTCGCCGACC TTGAGCACGA AGCCAGCACC GTGGACCAGT TGACCAGCGG CGTTTCGGCC
AAGACCGCCG AGGCCGCGTC CAACGTGGAA AGCGTGGCGG CGTCGATCGA GGAGCTTTCG
GCCTCGGTCC GCGAAATCTC GGCCCAGGCC GCCCGGTCGT CGGAAGCCGC CGGCTTGGCC
GCCCGCGAGG CCGAGGGCGC CAACAAGCTG ATCGGCGGCC TGCAAACGGC AACCGACGCC
ATCAGCGAGG TGATGGCGCT GATCACCGCC ATCGCCGCCC AAACCAATCT GCTGGCGCTC
AACGCCACGA TCGAGGCGGC GCGCGCCGGC GATGCCGGCA AGGGCTTCGC CGTGGTGGCG
GGTGAAGTGA AGACCCTCGC CAGCCAGACC CAGAAGGCGA CCGAGCAGAT CCATGGACAG
GTCGACGCCA TGCACGACGT CACCCGCCAG TCGGTGGCCG CCATCGGATC GATCGCCCGC
AGCGTCGATA CCATGGAGGC GATGGCCGCC TCGATCGCCG CCGCCGTCGA ACAGCAGGGC
GCCGCCTCGG GCGGCATCGC CCGGAATGCC GAGCAGGCCT CGGCGGCGAC CGGCGTGGTC
TCCGAACAGA TCGGAGCGGT CCGCGACAGC TCTGATGCGA CCAGCCGGGC CTCGCGCGCC
TTGAACGAGG CGACGGCGCG GCTCTCGGCC CAGATGGACC TGCTCTCGCA CTCCGTCGAA
AGCTTCCTGA CGGCCGTTCG CCGCTAG
 
Protein sequence
MSVITRSIRG KLFAAFLAAT LGVLISSSLG IILVSREGDL AHEAMTGLAP LADAAMETKI 
SATRAHLVFE EIMAGDETES ISEVWSLLDD ADWYLLAMLN GGAKAKETFV KTDNPAVRVR
LGEAREVLAA FRVTAEGRYK AFGDQGRGQT VAGTDLDVAF DAAFESFATL TDSIEELVHT
QIDEQVEKIE QTKTSSLWLM SATALIALAV SLSLAVVIGR SVSRRITDLS TTMGQITAGD
HAAPVPHTTS TDEIGVMGRA LVTLRDGVDE AARLRASLQL KAEDEARQRT HLEGAIIGFD
RSIGEVMASL RDVVEVLHGA VADLEHEAST VDQLTSGVSA KTAEAASNVE SVAASIEELS
ASVREISAQA ARSSEAAGLA AREAEGANKL IGGLQTATDA ISEVMALITA IAAQTNLLAL
NATIEAARAG DAGKGFAVVA GEVKTLASQT QKATEQIHGQ VDAMHDVTRQ SVAAIGSIAR
SVDTMEAMAA SIAAAVEQQG AASGGIARNA EQASAATGVV SEQIGAVRDS SDATSRASRA
LNEATARLSA QMDLLSHSVE SFLTAVRR