Gene Saro_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3494 
Symbol 
ID5077643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp101629 
End bp103527 
Gene Length1899 bp 
Protein Length632 aa 
Translation table11 
GC content66% 
IMG OID640481218 
ProductPAS/PAC sensor hybrid histidine kinase 
Protein accessionYP_001165880 
Protein GI146275720 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCTCGCGC TTGGCCACGT CCTGCGCGAC ATTGCCGAGG TGGTGACTGC CAACTCGGGA 
CGCGAAGCAT TGCGGCACCT GCTGGATGGC GAGTTCGCTG TCATCCTGCT CGACGTGTTC
ATGCCGGACA TGGATGGTTA CGAGGTGGCG GACCTCATCC GCGAACGCAA GCAGACCGCG
CGCATCCCGA TCATATTCCT TTCGGCGGTC AACAAGGAAA CCGAGCACCT GATGCGCGGT
TATGCGATGG GCGCGGTCGA CTATGTGTTC AAGCCCGTCG ATCCGGTCGT GCTTGCCACG
AAAGTCAGCG TGTTCGTCGA ACTGTTCGAG ATGCGCAAGC GGGTGGAGGC CAAGAGCCGC
GCCGAACGCG AGCTGCGCGA GGCGGGGTTC CGTGCACAGC TCGAGCGGCT TCAGATCGAG
CACGAACTGA ATTCCACCCG AGCGCGGCAG GCGACGGTAC TCGACGCCCT GCCGCTCGCC
CTGTTCGAGG CGGTCGCAGA CAAGAACGGA ATGCTGATCC GCGAATTCGT GGCCGGCGAC
CTCGCCAAGA TCGCGGGCGT GGACGCGACC TCCATCGAGC AACGGTCGCT GTGCTGGGAG
GACCGCATCC ACCCGGAGGA TCTCCCCGCC ACGCGCCCTC CGGCAGGCTC GGACGCCGTG
TTCTCGACCG AATACCGCTG GAACTGTGCC GACGGTTCCC AACGGTACTT CTTCGAACGC
GCGGTACCCA TCGGATGCGA GACGGATGGG CTTGTTCGCT GGGCGGGCAC GCTGCTCGAC
GTGACCGACC GCCGGAAGCT GGAGGCGCAG CTTCTTCAAG CCGGCAAGAT GGATGCGCTG
GGGCGGCTGA CCGGCGGCGT CGCCCACGAT TTCAACAACG TGCTGGCAGC AGTGCTCGGC
GGCATCACCC TGCTTGAACG CAAGGCACCG CTCGACGACC TCGGCCATCG CCTTACCGAG
CAGATCCGCC TTGCCGCCGA ACGCGGCGCG GAACTGGTGC GGCGCATGAT GGCCTTTGCC
CGCAAACAGG AACTCAAGCC CGTCTACCTC GCGCCCTCCG CAGTGCGCGA GGCCGTGTCC
GGGCTGGTCG AACAAACCCT GGGCGGAACG GTGACGCTTT CCTGGGATTG CGCGGATACG
GATCTGGTCT TCCACGCCGA CCGGTCGCAG CTCGAACTGG CGCTTGTGAA CCTGGTCATC
AACGCGCGCG ACGCCATGCC CGAAGGCGGC TCGATCCACG TGGCGATCGC TCCCGCTGCC
GATGCGGATC GGCTGCGCAT AGAAGTGCGC GACGAAGGCA CCGGCATCGC ACCGGGCGTG
CTGGAACGCA TCACCGAACC GTTCTTCACC ACCAAGGGAG TGGGCAAGGG CACGGGGCTG
GGGCTGTCGA TGGTCATGGG GTTCGTCCAG CAATCGGGCG GAACGCTCGA CATCGAAAGC
GCGGAGGGGT GCGGCACCAC CGTGCGCATC CTCATGCCCG CCGCCCGGGC GCCGGACGCC
GATGAGCGGG AAGCGCCGAG CGTCGAAGGC ACCCGCGCCT ACGCAGTCAG GACCGTGCTG
GTGGTGGACG ATGACCACTC CGTCCGCACG ATAATCGCCG AACAACTCCG CGAATTCGGC
GTCATGGTGG AAGAGGCGGC AAGCGGCGCC GATGCGGTCG AACGCGTGAT ATCCGCAAAG
ACGCCCTTCG ACCTGCTCCT CACCGATTTC GCGATGCCGG GTCTCAACGG GTTGCAAACG
ATAGAGCGGC TGCGCGCGCT GGGAACGGAC ATTCCCTGCG CGCTCATGAC GGGATATGCC
GACGACCGGA TAGATACCAC CGGCGGCACG CAAACCCGGC TGCTGCGCAA GCCCATCGCC
TTCGAGGATC TCGAAGACCT CCTGATCCAT CCGACATGA
 
Protein sequence
MLALGHVLRD IAEVVTANSG REALRHLLDG EFAVILLDVF MPDMDGYEVA DLIRERKQTA 
RIPIIFLSAV NKETEHLMRG YAMGAVDYVF KPVDPVVLAT KVSVFVELFE MRKRVEAKSR
AERELREAGF RAQLERLQIE HELNSTRARQ ATVLDALPLA LFEAVADKNG MLIREFVAGD
LAKIAGVDAT SIEQRSLCWE DRIHPEDLPA TRPPAGSDAV FSTEYRWNCA DGSQRYFFER
AVPIGCETDG LVRWAGTLLD VTDRRKLEAQ LLQAGKMDAL GRLTGGVAHD FNNVLAAVLG
GITLLERKAP LDDLGHRLTE QIRLAAERGA ELVRRMMAFA RKQELKPVYL APSAVREAVS
GLVEQTLGGT VTLSWDCADT DLVFHADRSQ LELALVNLVI NARDAMPEGG SIHVAIAPAA
DADRLRIEVR DEGTGIAPGV LERITEPFFT TKGVGKGTGL GLSMVMGFVQ QSGGTLDIES
AEGCGTTVRI LMPAARAPDA DEREAPSVEG TRAYAVRTVL VVDDDHSVRT IIAEQLREFG
VMVEEAASGA DAVERVISAK TPFDLLLTDF AMPGLNGLQT IERLRALGTD IPCALMTGYA
DDRIDTTGGT QTRLLRKPIA FEDLEDLLIH PT