Gene Saro_2627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2627 
Symbol 
ID3917042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2843398 
End bp2846748 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content66% 
IMG OID640445386 
Productmulti-sensor hybrid histidine kinase 
Protein accessionYP_497897 
Protein GI87200640 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0591] Na+/proline symporter
[COG0642] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.594612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCGA GTTTTGCCGC GTTTCTGGGT TTGAGCCTTG TGGCGTTGCT CTTCGGGGTT 
GCCGCCGCGA CGGAACGGTT CGGCACGTCG AGCCGCATGG GTCCGAAATT CAGGCACGGC
GCGTATACCC TTGGCCTCGG AGTCTACTGC TCCAGCTGGA CGTTCTATGG GGCGGTCGGA
ACAGCAGTCA GCGAGGGTTG GAACTACCTG CCGATCTACC TCGCACCTGT CTTTCTGCTG
CTCGCCGCGC CGCGGTTTCT CGAGCGCTTG TCGCGTGCAG TACAAGAAGA GCGAGCCTCG
ACGATCTCGG ACTTCATTGC GGCGCGCTTT GGCCACGATC CGGGGGTAGC CCGTATCATC
ACGGTTATCG CACTGTGCGG GACCGTGCCG TACATTGCGC TTCAGTTGCG GGGAATGGGT
ACGGCCATCG CGGCCCTGTC CGGCAGGGAT CTTGCGGTGC CGGTGATGGT GACGGCGGCG
GGTGGGCTGT CACTGTTTGC CATTCTCTTC GGGGCGCGGC GGTATGAGAT TTCCGGGCGG
AGCGAGGGCC TGGTGTTCTC TATCGCGCTG GAATCCGTGA TCAAGTTGGT GGCGCTGGGG
CTGGTCGGTG CTTACGCGGT CTATGTCCTT GCAAAGGCCG GGCCTGCCGA TCTCGGCCGC
GCGGGAAGCG TGCTGGGGGA GCAGTTTGGC CCGTCCCATG TCACTATAGA GACGTTTACG
GTGGCGCTCG TGTCGGCGTT TGCGGTGCTG GTGCTGCCGC GGCAGTTCTA CATGGGATTG
GCGGAGGCGC ATTCTCCGCG AGACTTACCT CGGGCCCGGT TCGGCCTGTC GCTCTACCTC
CTGGTGATGG CGGCTCTGAT CGTTCCCATC GCGCTGGCGG GGCTGGCGGC CTTGCCCCCA
GGCGTCGCGC CAGACAGCTT CGTCCTGCTC CTGCCCGCGC GCGAAGGGGC CGATGCCATT
GCGATTGCGG CGCTGCTGGG TGGGCTGAGT TCTGGCGCTG CGATGGTTAT CGTCGATGCG
ACGGCGCTGT CGACCATGGT GTCGAACGAC CTCATCTTTC CCGCAGTGGT CCGGAACGAG
GGAGCCGCCG GGGCGGGTGA TCTCGGGCGA AAGATGCTTG TTGTGCGCCG ATTGGCGATT
GTCGGCGTCG TCGGCCTCGC GCTTGCATGG GCATTGCTGG TCGATCCGGC GCGGTCGCTT
GCATCGATCG GCCTGGTCGC GTTTTCCGCG ATGGTTCAAT TCGTGCCGCA CCTGCTGCTT
GCGGTAGCGG CCCCGGGGAG GGATCCGGTG GCAGCGCGCG CCAGCCTGCT GACCGGTCTC
AGTCTTTGGC TCTACACGCT CGCCCTTCCG CCGATCATGC CAGCATGGCT GGTGTCGGCG
TTGCAGGGGA CGATCACCGA TCCCTCGCGC CTGCTGGGAA TTGGCCATGC TTCACCCTTG
GTCCACGGGG TAGGCTGGAG CCTGGCGGCA AATCTGGCAG TCCTTGCGCT CGCCATGGCG
CGCCAGAGCC GGGCGCCGCG CATGCCGCGC CTTGTTCTGG TTGACCGGGC GGTCGGCAAC
CTCGGCGATC TCGCCGGATT GGCCGCGCGC TTCATCGGAG AGGAGCGGGC GCTCGCCGCA
TTCCCCCGCG AACGGCATGC CAACCCAGTC GATCGTCAGT CGGCCAGGTT GGCCCAGGAC
CTGATCGGTG GCGTCGTGGG CGCATCGTCT GCCCGGGCGC TGGTGGCATC GGCGCTTGCG
GGCGGGCGCA TGAACCTGGA GGACGTTACC CGGTTGCTGG ACGAGGGCGG GCAGTCGCTG
AGCTTTTCCC GGCAACTGCT CGCGGCCACT TTCGAAAATC TTCAGTCCGG CGTCAGCGTC
ATCGATGGCG AACTCAATCT CGTGGCCTGG AATACACGTT ACGTCGACCT GTTCGGCTAC
CCGCCCGGCC TCGTGCGCGT CGGCGTGCCG ATAGCGACGC TCATCCGCTA CAACGTCGAG
CGCGGAGACT TCGCCGGAAC GGTGGAGGAA GAGGTCGAAA AGCGCCTGCG CCACCTGCGG
GCAAGACGTT CATATGCCTC GGAAAGGGTG CGCCGGGATG GCAGGGTCAT CAAGTCGGTC
GGCGGCCCCA TGCCTGGCGG CGGCTATCTG ACCTCGTTTA CGGACATCAC CGAGGAAGCG
GCGGTCCGGG CAGAACTTGA ACGTACACTC GATGAACTCG AACAGCGAGT GAACGACCGG
ACCAGCGAAC TGAGCGAAGC CAATCGCCGC CTTGCCGACG CCACGCGCGA GAAAACCCGT
TTCCTTGCCG CAGCCAGTCA CGACCTGCTC CAGCCGCTTC ACGCGGCTCG GCTCTTCGCC
TCGGCGCTCG ATCGCAATCT TGAAGGAAAT GCCAAGGTGC TGGCGGCGAG GGTGGACCGG
TCGATCGTGG CAGCCGAAGC CTTGTTGCGC GCACTACTCG ACATTTCCAA GCTGGATGCG
GGTGGTATTC AGCCTGACCC CGAACCGGTG CCGCTGGCGC CTCTGATCGC TGACATTGCC
GAGAACATGC GTCCACTCGC CGAGGAGAAG GGCATTACGC TGCGTATCGG GAGCCTGGTC
GGAACGGTCG ACACGGACCC GGGCCTGCTG CGCTCGGTAC TTCAGAACCT GGTGGCCAAC
GCGGTGCGCT ACACGGTTGA GGGTGGGGTG ATCATCGGGG TGCGGCGGCG CGGGGCTTTC
CTGCGCATCG ATGTCTATGA CACCGGCGTC GGCATCCCGC CGGACAAGCA GCGGGACATC
TTCAGCGAGT TCACCCGCCT CGGGTCGGTG GAAGCCGAGG GGTTGGGGCT CGGGCTGGCG
ATCGTCGAGC GGATCGCGCG ATTGATCGGC GCCCGGATCG AGGTCCGCTC GGTCGAAGGG
CGTGGCAGCC GTTTCAGCGT GCTCCTGCCG GCAGCAGCTT GCAGCACGGA CGTTCCGCCT
GATCCCAACG CCGACAAGGC GGATGTCGGT GGGCGGCCAC GCGGCGCGCT GAAGGTTCTT
GTGGTCGACA ACGAACCCGA CATCGTGGAG GCAACGGTCG CGTTGCTGGA AGGCATGGGA
CATCATGCCA TCGGTGCAGC GGGGACGGCG CAGGCACTGG AGAAAATCCA CGCGGTGGAC
GTGCTTCTGG CAGATTATCA CCTCGATGGG GGCGAGGACG GGCTGCGGTT GATCGATCAG
GCGCGCAGCC GCAATCCGAC GCTTGCGACA GCATTGATCA CCGCCGAGAG CGGCGCGGAC
TTGCGCAATC GGTTGCGAAT GCGGCGCGTG CCCCTGTTCG TGAAACCGGC AGATCCGGCT
GCGATCGAGG CCTTTCTGGC TGGCGTGTCA GGAGGCGAGA TCGAGCCCTA G
 
Protein sequence
MTASFAAFLG LSLVALLFGV AAATERFGTS SRMGPKFRHG AYTLGLGVYC SSWTFYGAVG 
TAVSEGWNYL PIYLAPVFLL LAAPRFLERL SRAVQEERAS TISDFIAARF GHDPGVARII
TVIALCGTVP YIALQLRGMG TAIAALSGRD LAVPVMVTAA GGLSLFAILF GARRYEISGR
SEGLVFSIAL ESVIKLVALG LVGAYAVYVL AKAGPADLGR AGSVLGEQFG PSHVTIETFT
VALVSAFAVL VLPRQFYMGL AEAHSPRDLP RARFGLSLYL LVMAALIVPI ALAGLAALPP
GVAPDSFVLL LPAREGADAI AIAALLGGLS SGAAMVIVDA TALSTMVSND LIFPAVVRNE
GAAGAGDLGR KMLVVRRLAI VGVVGLALAW ALLVDPARSL ASIGLVAFSA MVQFVPHLLL
AVAAPGRDPV AARASLLTGL SLWLYTLALP PIMPAWLVSA LQGTITDPSR LLGIGHASPL
VHGVGWSLAA NLAVLALAMA RQSRAPRMPR LVLVDRAVGN LGDLAGLAAR FIGEERALAA
FPRERHANPV DRQSARLAQD LIGGVVGASS ARALVASALA GGRMNLEDVT RLLDEGGQSL
SFSRQLLAAT FENLQSGVSV IDGELNLVAW NTRYVDLFGY PPGLVRVGVP IATLIRYNVE
RGDFAGTVEE EVEKRLRHLR ARRSYASERV RRDGRVIKSV GGPMPGGGYL TSFTDITEEA
AVRAELERTL DELEQRVNDR TSELSEANRR LADATREKTR FLAAASHDLL QPLHAARLFA
SALDRNLEGN AKVLAARVDR SIVAAEALLR ALLDISKLDA GGIQPDPEPV PLAPLIADIA
ENMRPLAEEK GITLRIGSLV GTVDTDPGLL RSVLQNLVAN AVRYTVEGGV IIGVRRRGAF
LRIDVYDTGV GIPPDKQRDI FSEFTRLGSV EAEGLGLGLA IVERIARLIG ARIEVRSVEG
RGSRFSVLLP AAACSTDVPP DPNADKADVG GRPRGALKVL VVDNEPDIVE ATVALLEGMG
HHAIGAAGTA QALEKIHAVD VLLADYHLDG GEDGLRLIDQ ARSRNPTLAT ALITAESGAD
LRNRLRMRRV PLFVKPADPA AIEAFLAGVS GGEIEP