Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3494 |
Symbol | |
ID | 5077643 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | + |
Start bp | 101629 |
End bp | 103527 |
Gene Length | 1899 bp |
Protein Length | 632 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640481218 |
Product | PAS/PAC sensor hybrid histidine kinase |
Protein accession | YP_001165880 |
Protein GI | 146275720 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCTCGCGC TTGGCCACGT CCTGCGCGAC ATTGCCGAGG TGGTGACTGC CAACTCGGGA CGCGAAGCAT TGCGGCACCT GCTGGATGGC GAGTTCGCTG TCATCCTGCT CGACGTGTTC ATGCCGGACA TGGATGGTTA CGAGGTGGCG GACCTCATCC GCGAACGCAA GCAGACCGCG CGCATCCCGA TCATATTCCT TTCGGCGGTC AACAAGGAAA CCGAGCACCT GATGCGCGGT TATGCGATGG GCGCGGTCGA CTATGTGTTC AAGCCCGTCG ATCCGGTCGT GCTTGCCACG AAAGTCAGCG TGTTCGTCGA ACTGTTCGAG ATGCGCAAGC GGGTGGAGGC CAAGAGCCGC GCCGAACGCG AGCTGCGCGA GGCGGGGTTC CGTGCACAGC TCGAGCGGCT TCAGATCGAG CACGAACTGA ATTCCACCCG AGCGCGGCAG GCGACGGTAC TCGACGCCCT GCCGCTCGCC CTGTTCGAGG CGGTCGCAGA CAAGAACGGA ATGCTGATCC GCGAATTCGT GGCCGGCGAC CTCGCCAAGA TCGCGGGCGT GGACGCGACC TCCATCGAGC AACGGTCGCT GTGCTGGGAG GACCGCATCC ACCCGGAGGA TCTCCCCGCC ACGCGCCCTC CGGCAGGCTC GGACGCCGTG TTCTCGACCG AATACCGCTG GAACTGTGCC GACGGTTCCC AACGGTACTT CTTCGAACGC GCGGTACCCA TCGGATGCGA GACGGATGGG CTTGTTCGCT GGGCGGGCAC GCTGCTCGAC GTGACCGACC GCCGGAAGCT GGAGGCGCAG CTTCTTCAAG CCGGCAAGAT GGATGCGCTG GGGCGGCTGA CCGGCGGCGT CGCCCACGAT TTCAACAACG TGCTGGCAGC AGTGCTCGGC GGCATCACCC TGCTTGAACG CAAGGCACCG CTCGACGACC TCGGCCATCG CCTTACCGAG CAGATCCGCC TTGCCGCCGA ACGCGGCGCG GAACTGGTGC GGCGCATGAT GGCCTTTGCC CGCAAACAGG AACTCAAGCC CGTCTACCTC GCGCCCTCCG CAGTGCGCGA GGCCGTGTCC GGGCTGGTCG AACAAACCCT GGGCGGAACG GTGACGCTTT CCTGGGATTG CGCGGATACG GATCTGGTCT TCCACGCCGA CCGGTCGCAG CTCGAACTGG CGCTTGTGAA CCTGGTCATC AACGCGCGCG ACGCCATGCC CGAAGGCGGC TCGATCCACG TGGCGATCGC TCCCGCTGCC GATGCGGATC GGCTGCGCAT AGAAGTGCGC GACGAAGGCA CCGGCATCGC ACCGGGCGTG CTGGAACGCA TCACCGAACC GTTCTTCACC ACCAAGGGAG TGGGCAAGGG CACGGGGCTG GGGCTGTCGA TGGTCATGGG GTTCGTCCAG CAATCGGGCG GAACGCTCGA CATCGAAAGC GCGGAGGGGT GCGGCACCAC CGTGCGCATC CTCATGCCCG CCGCCCGGGC GCCGGACGCC GATGAGCGGG AAGCGCCGAG CGTCGAAGGC ACCCGCGCCT ACGCAGTCAG GACCGTGCTG GTGGTGGACG ATGACCACTC CGTCCGCACG ATAATCGCCG AACAACTCCG CGAATTCGGC GTCATGGTGG AAGAGGCGGC AAGCGGCGCC GATGCGGTCG AACGCGTGAT ATCCGCAAAG ACGCCCTTCG ACCTGCTCCT CACCGATTTC GCGATGCCGG GTCTCAACGG GTTGCAAACG ATAGAGCGGC TGCGCGCGCT GGGAACGGAC ATTCCCTGCG CGCTCATGAC GGGATATGCC GACGACCGGA TAGATACCAC CGGCGGCACG CAAACCCGGC TGCTGCGCAA GCCCATCGCC TTCGAGGATC TCGAAGACCT CCTGATCCAT CCGACATGA
|
Protein sequence | MLALGHVLRD IAEVVTANSG REALRHLLDG EFAVILLDVF MPDMDGYEVA DLIRERKQTA RIPIIFLSAV NKETEHLMRG YAMGAVDYVF KPVDPVVLAT KVSVFVELFE MRKRVEAKSR AERELREAGF RAQLERLQIE HELNSTRARQ ATVLDALPLA LFEAVADKNG MLIREFVAGD LAKIAGVDAT SIEQRSLCWE DRIHPEDLPA TRPPAGSDAV FSTEYRWNCA DGSQRYFFER AVPIGCETDG LVRWAGTLLD VTDRRKLEAQ LLQAGKMDAL GRLTGGVAHD FNNVLAAVLG GITLLERKAP LDDLGHRLTE QIRLAAERGA ELVRRMMAFA RKQELKPVYL APSAVREAVS GLVEQTLGGT VTLSWDCADT DLVFHADRSQ LELALVNLVI NARDAMPEGG SIHVAIAPAA DADRLRIEVR DEGTGIAPGV LERITEPFFT TKGVGKGTGL GLSMVMGFVQ QSGGTLDIES AEGCGTTVRI LMPAARAPDA DEREAPSVEG TRAYAVRTVL VVDDDHSVRT IIAEQLREFG VMVEEAASGA DAVERVISAK TPFDLLLTDF AMPGLNGLQT IERLRALGTD IPCALMTGYA DDRIDTTGGT QTRLLRKPIA FEDLEDLLIH PT
|
| |