Gene Saro_3592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3592 
Symbol 
ID5077741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp213700 
End bp215745 
Gene Length2046 bp 
Protein Length681 aa 
Translation table11 
GC content65% 
IMG OID640481316 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_001165978 
Protein GI146275818 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.181049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCC AGACACCCCC GTTGGACCTT GTCGGTCAGG CCGCAGCCGC GCTGGCCGAT 
GCATCGGTCA TCCTTTCATT CGACCGCAAG ACCGGCAAGT GCGTCGGCGG TAACGACGAG
GCGTTGCGCC TGTTCGGCGG TTCGGGGCTG TCCGACCACG ATTTCGCATC GATATTCGTG
GCGGACGTGG CTGCCTGGAC CCGGATCTGC AATGGCGAGA TCGCGCGCGT CAGCGGCGGC
GTGCCGCAGC CTTCGGGCGG CTTTGCCGGA ATCTCCGGCG CGGCGCGCCT TGGCGAGGGA
CGCAGCGCGC CGGTTCTCTT CATGGGCGTG CGGACTTGCG ATGGCGATGG GGACGTAGCC
CTGCTGGTCC ACCGCAATGA CGCGCTCGGC AAGGCGCTGC CGATCTGCCA GTACTCGCCC
ACTGGCACGA TCGTGGCCAT CAACGACACA TACCTCGAGC GCACCGGCCG TGATCGCGAA
GCCGTCGTCG GCAAGCCCTT CACCGCCTTA TGGCAAGGCG CGAACGTCCC TGCGGATCCC
CAGGCCTGGT GGAGCCGCTT TTCCCGTGGT GAAAGCGAGA TTGTCCTGCG TCGCCACGAT
GCAGGTGCGG GCAAGACGGT GTGGCTGCGC GAGGCCTTCG TCCCGACGCT GGACGGCGAC
CGTCTGATTT CGGTGCTGTG CTACAGCTGC GATGCCACCG AGACGGTCGA CCAGCTCAAC
GAGCAGTCCG AGCAGATCGC CGCGATCAGC CGCAGCCACG CGATGATCGA GTTCGATCTC
GAAGGCAACA TCCTGTGGGC CAACGAACAT ATGCTCGAGA CGACCGGCTA CCGGCTCGAC
GAGATCAAGG GACGGCACCA TCGGCTGTTT TGCGAACCGG AACTGGTCTC GTCTCCTGAA
TACGCCAGCT TGTGGGAAAG GCTTGGCCGG GGCGAATATG TCAGCGGCGA GTTCAAGCGG
CTGACCAAGG CCGGCGAGGA AATCTGGATC CGCGCCAGCT ACAATCCCAT CATCGGGACT
GACGGGAAGC CGACCAAGAT CTGCAAGATC GCCGCCGACG TCACCGCCCA GAAGGTCCTG
TCCAACGAAT ACCAGGCCAA GGTCGTCGCG ATCAACCGCG CGCTCGCCGT GATCGAGTTC
GACCTCGACG GGAATGTCCT GACGGCGAAT GACAATTTCC TGTCGACCAT GGGGTATTCG
CTGCGCGAGG TCAGCGGTCA GCACCACTCC ATGTTCTGCG CGCCCGACTA TGTGCGCAGC
CGCGAATATG CGGAGTTCTG GCTGAAGCTG AACCGTGGCG AATTCCATGC GGGCCGCTTC
CACCGCGTCG GCAAGTACGA TCGCGACGTG TGGATCCAGG CAACCTACAA TCCGATCTTC
GACTTGCGCG GCAAGCCCGT GCGCGTGGTC AAGTTCGCGT TCGACATCAC CGACCAGGTG
GCGATGGAGC GCGAGATCGA GGCCCGCGCG AGCGATCTGG GCGGATTGGT GGAACGGCTT
TCCAGCTCCA TCGCGGCGAT CACCGATGCC ACGACTTCGG CCAACGGTCT TGCCCAGAGC
ACCCGCGACA ATGCGGAGAA GGGCATGGAC GCCCTGTCCA AGGCGATCGA GGCGATCGAA
CTGATCCAGA CTTCCGCCGC CGGCATTGCC GAGATCGTCG GCATCATCGG CGAAATCGCG
GGGCAGACCA ACCTGCTTGC CTTCAATGCC GAAATCGAGG CGGCGCGGGC CGGCGAACAT
GGTGTCGGGT TCAGCGTGGT GGCCGGCGAG GTAAGGCGCC TTGCCGAACG GTCCTCGACC
GCCGCGCGCG ACATCAGCCG CCTGATCGAC GAATCGATCG GCCGCATCGG CCTCGGCACG
GCGCGCTCGC ACGAAGCGTC GAACGCATTC GACGGGATCG TCGATTCGGT GCGGAGGACC
GGCAGCGCGA TCGAGGCGAT CAGCGCATCG GTCTCGGTGC AGGATCAGGT TTCGGCCGAT
GCCGTCTCGC TCATCAACAG CCTCGCCAAC GTCACCGGCG CCTCGGCCCG GTCGCAGGCG
TCCTGA
 
Protein sequence
MTIQTPPLDL VGQAAAALAD ASVILSFDRK TGKCVGGNDE ALRLFGGSGL SDHDFASIFV 
ADVAAWTRIC NGEIARVSGG VPQPSGGFAG ISGAARLGEG RSAPVLFMGV RTCDGDGDVA
LLVHRNDALG KALPICQYSP TGTIVAINDT YLERTGRDRE AVVGKPFTAL WQGANVPADP
QAWWSRFSRG ESEIVLRRHD AGAGKTVWLR EAFVPTLDGD RLISVLCYSC DATETVDQLN
EQSEQIAAIS RSHAMIEFDL EGNILWANEH MLETTGYRLD EIKGRHHRLF CEPELVSSPE
YASLWERLGR GEYVSGEFKR LTKAGEEIWI RASYNPIIGT DGKPTKICKI AADVTAQKVL
SNEYQAKVVA INRALAVIEF DLDGNVLTAN DNFLSTMGYS LREVSGQHHS MFCAPDYVRS
REYAEFWLKL NRGEFHAGRF HRVGKYDRDV WIQATYNPIF DLRGKPVRVV KFAFDITDQV
AMEREIEARA SDLGGLVERL SSSIAAITDA TTSANGLAQS TRDNAEKGMD ALSKAIEAIE
LIQTSAAGIA EIVGIIGEIA GQTNLLAFNA EIEAARAGEH GVGFSVVAGE VRRLAERSST
AARDISRLID ESIGRIGLGT ARSHEASNAF DGIVDSVRRT GSAIEAISAS VSVQDQVSAD
AVSLINSLAN VTGASARSQA S