Gene Saro_2821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2821 
Symbol 
ID3916981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3044321 
End bp3046384 
Gene Length2064 bp 
Protein Length687 aa 
Translation table11 
GC content69% 
IMG OID640445600 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_498091 
Protein GI87200834 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.950861 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAAGT TCAATATCTC GGCCAAGTTC ATATCCGCAT TTGCGCTGCT TCTGGCGGTG 
ATGGCGGGCA TGGGGCTCTT TGCCGTATCG AAGATCGGTG AAGTCAACGT CATCGCGGCG
GAGCAGCGGG ACCGCTGGAT GCCCGCCGCA GCCACGCTGG GTGATATCCA CGCCTTCACG
TCGCAATACC GCCTCAAGCA GGACGAGATG CTCAACGCCA CCTCGCCGGC GGCGATGGAG
CGCAGCCAGA AGCTCATGCG CAACGCACGG GCTGCCATCG ACGACAGCCT GGCCCAGTTC
GAAAAGCTCG CTTCGACGCC CGAACAGAAG GGCGCGGTCG CCACGATCAG GGAATCATGG
GCTCGCCTGC TTGAACAGGA CCAGACCATG CAGGCCATGG CGCTTTCGGG CGACCAGGCC
GGCGCACAGG CCATGCACAA CAGCGAAGGC CTCGATTCCT TCTACGCCGT CGAGGACGCC
ATCCTCGCCG CGATCGAGGT CAACCAGAAG GCCGCGGATG CCGTCTCCGC CCAGAGCGAG
GAAATCTACG CCTCGGCGCG GACCTTCACC TTCGGCATCA TCGGCCTCGG CCTGCTCGCG
GCGCTTTCCC TGCTCGTCTT CCTCATGCGC AACATCGCCC GCCCCCTCGT CAACATGTCG
GAAGCGGTTG TCCGCCTGAC CTCCGGCGAC CACAGCATCG CCGTTCCGGG CCTGGGCCGC
ACCGACGAGC TGGGCAGCCT GGCCCGCGCG CTCGACCAGT TCCGCGATGT CTTCGCCTCC
GATCACGCCC GCGCCGAAAC AGAAAAGGCC CGCGCGCGCG AGACGCAGGT CACCATCGAC
GCCATCGGTA GCGGCCTCAC CGCCCTGGCC GAAGGCGACC TCACCTTCCG CGTGGCCGAG
AACGGCAGCG GCGCCCTTGC CAAGCTCCAT GTCGACTACA ACGCCGCCGT CGCCTCGCTC
GAACGCGTGC TCGGCAAGAT CGTCGACGGC TGCAACACCA TCAAGCTCGG CACCGACGAA
ATCGCCAGCG CCGCGACGGA CCTTGCCCTG CGCACCGAAC AGCAGGCCAC CTCGCTTGCC
GAAACCTCGC GCACGCTCAG CGAATTCACC GGCTCGGTGA AGACCACCGC CGACAACGCG
CGCCAGACGA GTTCGCGCCT GACCGTCGCG CGCAACACCG CGGACAGCGT GGGCGATACC
GCCAACCGCG CGGTCGCCGC CATGCGCTCC ATCGAAAGCA GCTCACGCGA GATGGCAGAG
ATTGTCGGCG TGATCGATGG CATCGCCTTC CAGACCAACC TGCTCGCGCT CAATGCCGGG
GTCGAGGCCG CCCGCGCGGG CGATGCCGGC AAGGGCTTCG CGGTCGTCGC CACCGAAGTG
CGCGCGCTCG CCCAGCGCTC GGCCGATGCC GCGCGCTCGA TCCGCGACCT CATCGGCAAG
AGCACCGACG AGATCAGCGG CGGCGTCGCG CTGGTGGAAT CCAGCGGCGA GGCGCTGCGT
CAGATCGTCA CCGAAGTCAG CGCCGTCTCG GCCCTGGTCG AGGAGATCGC CGAAGCGGCA
GGCCAGCAGG CCGCCGGCAT TGCCGACATC TCGGCCATGG TCGGCTCGAT GGACGCCTTC
ACCCAGCAGA ACGCGGCCAT GGTCGAGGAA AGCTCGGCCG GAACCCGCAA CCTTGCCTCG
GAAACGCTTT CGCTGGTCGA CCAGCTCGGG CGCTTCCGGC TCGCCACGGT CGGACAGGCT
GCGGAGGACC GGTTCGAAGA CGCGCCCGCC CCGGCCCGTC ATTCCGGCTT CGGCGCCTTC
GACGCGTTCG ACGCGGCCCA GGTCGACGAT TCCGACGAAG CCCCGGCCTT GAGATCGCCC
TCCTCGTACG AACCGGTGGC AATCGCCGAT GAGCGCCCCC GGCCAGCCGC CCCGGTCGCC
GCCCCGGTCG CCGCTCCGCC TCCGCCTCCG GCCCCGGCCC CGGCCCCGCC TCCCCCGCCG
GCAAAGCCAC GGGCGGCCCG CGCGGCACCT TCGCGTGGCG GCGCAGCGGT CAAGCTCGAC
GAGGACGACT GGTCGGAATT CTGA
 
Protein sequence
MFKFNISAKF ISAFALLLAV MAGMGLFAVS KIGEVNVIAA EQRDRWMPAA ATLGDIHAFT 
SQYRLKQDEM LNATSPAAME RSQKLMRNAR AAIDDSLAQF EKLASTPEQK GAVATIRESW
ARLLEQDQTM QAMALSGDQA GAQAMHNSEG LDSFYAVEDA ILAAIEVNQK AADAVSAQSE
EIYASARTFT FGIIGLGLLA ALSLLVFLMR NIARPLVNMS EAVVRLTSGD HSIAVPGLGR
TDELGSLARA LDQFRDVFAS DHARAETEKA RARETQVTID AIGSGLTALA EGDLTFRVAE
NGSGALAKLH VDYNAAVASL ERVLGKIVDG CNTIKLGTDE IASAATDLAL RTEQQATSLA
ETSRTLSEFT GSVKTTADNA RQTSSRLTVA RNTADSVGDT ANRAVAAMRS IESSSREMAE
IVGVIDGIAF QTNLLALNAG VEAARAGDAG KGFAVVATEV RALAQRSADA ARSIRDLIGK
STDEISGGVA LVESSGEALR QIVTEVSAVS ALVEEIAEAA GQQAAGIADI SAMVGSMDAF
TQQNAAMVEE SSAGTRNLAS ETLSLVDQLG RFRLATVGQA AEDRFEDAPA PARHSGFGAF
DAFDAAQVDD SDEAPALRSP SSYEPVAIAD ERPRPAAPVA APVAAPPPPP APAPAPPPPP
AKPRAARAAP SRGGAAVKLD EDDWSEF