Gene Saro_3048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3048 
Symbol 
ID3916660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3263390 
End bp3265195 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content68% 
IMG OID640445828 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_498317 
Protein GI87201060 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.229221 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAAAAA TGCGGCACTT GCATGCCGGA AGATCATACC CGGAGCGGAG GCAGCCGATG 
CTGCAAGGGC AGTCGATCAC CAGGAAAGTC ATGGCCGGAT GCCTGATCCT TGGCCTCGGC
GCCATGGCGT CCACGGGCCT CGGCCTGGCC GGGACCGTGC GGCTGGAGGC GGCTATGGAG
AAGCTCAATG CCGCGACCGC CCTGCTGCGT GCGCACATGG AGGCCGACAT GGGGCACGAC
GCAATCCGGA GCGAAGTGGT TAGCATCGTC GCATCCAGGC AGACGGCGGC CATCGACGGG
CTTGCGGCGG GTCGCGAACT GGCCGACAGG CTGGTCGAGT TCGAGAAGAA CATGGAACCG
ACCGCCAAGG TCGAGGATGC GCCGGAAGTG AGCGCCGCGC GCGCGGCAGC CGACCCTGCG
TTCAGGGCCT ATGTGGCGAT CGGCCGCGAA GTTTCGGCTG CCGCAGAACG CGGCGCTGTG
CCGGGCGATG CCGAACTGCA GAGGTTCCAG CATCTGTTCA CGCAACTTGA AGCGGATATG
TCGAAGATCT CCGACGCGGT CGAGGCGCAT TCGAGCGAGA CGGTCGCAGA GGCAAGCTCT
GCCGCCGCTC AGGCGCGCGT GCTTGGCATC GGCAGCCTGT TCGTACTGCT TGGCATTCTC
GCCGCGGTGG TCCGGTTTGC CCGCCGCGAT CTCGTCGACC CGGTCATCGC CATTGCCGGA
AGGGTCCGGG CCATGTCCGA CGGGCGGCTC GACGTGGAAA TGGACGGGGC CCGACGGGCC
GACGAGATTG GCGACCTTGC GCGCTCGGTC GTGGCCCTGC GCGACAACCT CGCCCAGGCG
CGGGCCGAGA CCGCCGGGCA GGCGGAAGCG ATCGTGGCTT CGATCGGGGC TGGGCTGAGC
CAGCTTGCTT CCGGCAACGT CGGATATCGA ATTCGCGAGA CGCTCGCCGG TCCGTTCCAG
AAGCTGCGCG ACGATTTCAA CCGCGCCATG GACGAGATGG CTTCCGCCCT GTGCGCGGTG
CAGACGGCTA CCGCGACGCT CGATGCGGTC GCGCGCGATA TCGGCGGTGC GGCGGGGGAC
CTGTCCAACC GCAACGCCAA CCAGGCCGCC AGCCTGCAGG AAACCGCGGC CGCCATCGCC
AGCCTTGCCC AGCGCGTCGC TGGATCGTCC GAGGCCGTCA CGGCCGCACG GGCAGCCGTC
GGCCACGTCG GCAGCGAGGT CAGCCGGGGC GGCGGCGTCA TCGACGACGC GGAGCAGGCA
ATGGATCGGA TAGAGATCGC CTCACAGGAA ATCGGCACGA TCGTGGGCGT CATCGACGGC
ATCGCCTTCC AGACCAACCT GCTCGCCCTG AACGCCGGGG TGGAGGCCGC GCGCGCGGGT
GAATCGGGCA AGGGCTTTGC CGTTGTCGCC AGCGAGGTTC GCGCGCTTGC CCAGCGCAGC
GCCGACGCGG CGCGGGAGAT CAAGCAACTT ATCGCCAACT CCTCGTCCGA GATCGGTGAC
GGTGTTCGGC TGGTGCGCGA TGCCGGCAGC AGCTTGCGCG CGATCAGCGC GCAGATGGAC
GAGATCAACC GCGTGATGGA GGTCGTGGAG GCGGGCGCCA GCGACCAGGA CGTTTCGCTG
CGCTCCATCG ACGAGACGTC GCGCCAGATG GAACAGATAA CCCAGAGCAA CAGCGCGGTC
GCGGAACAGG TCGGCAATGC GAGCCATGCC GTCGTCTCTG CGATCGAGGA CGTGCTGCGG
CAGTTGCAGC GTTTCGAGAT CGGTGAGGCC CGGCGCCCTG CACAAATCCA GGCGCTTGCC
GCATGA
 
Protein sequence
MRKMRHLHAG RSYPERRQPM LQGQSITRKV MAGCLILGLG AMASTGLGLA GTVRLEAAME 
KLNAATALLR AHMEADMGHD AIRSEVVSIV ASRQTAAIDG LAAGRELADR LVEFEKNMEP
TAKVEDAPEV SAARAAADPA FRAYVAIGRE VSAAAERGAV PGDAELQRFQ HLFTQLEADM
SKISDAVEAH SSETVAEASS AAAQARVLGI GSLFVLLGIL AAVVRFARRD LVDPVIAIAG
RVRAMSDGRL DVEMDGARRA DEIGDLARSV VALRDNLAQA RAETAGQAEA IVASIGAGLS
QLASGNVGYR IRETLAGPFQ KLRDDFNRAM DEMASALCAV QTATATLDAV ARDIGGAAGD
LSNRNANQAA SLQETAAAIA SLAQRVAGSS EAVTAARAAV GHVGSEVSRG GGVIDDAEQA
MDRIEIASQE IGTIVGVIDG IAFQTNLLAL NAGVEAARAG ESGKGFAVVA SEVRALAQRS
ADAAREIKQL IANSSSEIGD GVRLVRDAGS SLRAISAQMD EINRVMEVVE AGASDQDVSL
RSIDETSRQM EQITQSNSAV AEQVGNASHA VVSAIEDVLR QLQRFEIGEA RRPAQIQALA
A