Gene Saro_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2002 
Symbol 
ID3917322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2134947 
End bp2137295 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content69% 
IMG OID640444753 
ProductPAS/PAC sensor signal transduction histidine kinase 
Protein accessionYP_497275 
Protein GI87200018 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0262543 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCTGC TCAACCAGAC CGCGCTGGTC CTGATCGGAC TGCTGCTTGC CGCGTGGACG 
GTTGGCGCGG GCTGGCTCGT ACTCGACGCA CGGCGGCGCG CGCGGCGCGG CGAGGCGTTG
CAACGGCAGG CGCGCAGGCT TGCGCGGATG GTGGACGAAT CTCCCGCCCT GCCACTGCTG
GTCCGCGCCG ACGGGAGGAT CGAAGGACCC GCGCGCCTGG CCCTGTGGCT CGGTTTCGAG
GCGATGCCCG GCTATCTTTC AGAACTCGAC GCCGGCGACC ATGGCCTCGA CGAAGTTGAG
CTGGCCCGGC TTGGCGATGC GGTGCGACGG GCGCAGAAGA CCGGCACGCC GTTCCGCATG
GCGCTTACCC CAAGGGGCGG CTCGCGCAGT CTGTGCGCGC AGGGTCACCT GGCCGATCCG
CAGGTCTCGC CGGGCGGGGC CGCGCTCGTC TGGTTCTTCG ACGTTTCCGA AAGCGAGGAG
GAACTGGTCG CGCTGCGGGC CGAGACGCGC AAGGCCAAGT CGGATTTCGC CGGGCTTTCG
GGCCTGATCG AGGCCGCGCC CTTGCCGATG TGGTTCCGCG GCCCTGACTT GCGCCTGCGC
CTGGTGAACA GCGCCTATGT CGCTGCCGTG GGCGCCGAGA GCGCGGACCA GGTGATCGCG
CAGGGTATCG AACTGATCGA GCCGGTAGAG GGGCTGTCCG CCGCGCAGGT CGCCCGGCAA
GCCCATTCGC GCAAGGTCTC CATCGAACGG TCCCTGCTGG CGACGATCAA GGGCCAGCGC
CGCGCGGTGC GGGTGACCGA CCTTCCGCTT GGCGACGAGG GGGTGGCCGG TTACGTCGTA
GACATCGAGG AAATGGAGGA ACTGATCCGC CAGTTCCGCC GGTTCCGCGA GGCGCAGCGC
GAGATGCTGG ATACGCTTTC CGCCGGGATC GCGCAGTTCG ACGAGAAGCG GAACCTCGTT
TTCGCCAACC AGCCATTCCT GCGCATCTTC TCGGTCCCGC AGGCGTGGGT GGTGGACACG
CCGCCGTTCG ACCGCGTGCT GGACCGGATG CGCGACGCGG GCCGCCTGCC TGAAGTGCGC
GACTTTCCGG AATGGCGCCG CGAGCGGCAG GGATGGTTCC TTGCCCGCGA GCCGATCGAG
GAGCCCTGGC ACCTTTCGGA CGGTACGCAC CTGCGCGTCG TCGGCCAGCC CATGCCGGAC
GGCGGGTTGC TGATGATCTT CGAGGACCGC ACCGAGCAGC TCCAGCTTTC GGCAACTCGC
GACACGCTGC TGCGCACGCG TACGGCCACT TTCGACAACC TGTTCGAATC GCTCGCCGTA
TTCGCGCCAG ACGGCCGGTT GCAGCTCTGG AACCGCCGGT TTGCGGCTGA CTGGGGGCTC
GACGAGGAGT TCCTGGCAAC GCATCCCCGT GCCGACGTCC TGCTCAGCCG CATTGCCGAC
CAGCTCAAAC GGCCCGCGCA AGTTGGCACG GTGGCGCAGG TCATCCAGGG GGCGACGCTG
GAGCGCAAGC AGCGCAGTGG CAGGTTGGCG CTGGCTGACG GGCGGCATCT GGCGCTGGCC
GGCGTGCCAT TGCCGGATGG AAACGGCCTG CTGACCGTGC TCGACATCAC CGACAGCCAG
AAGGCCGAGG CGGCGCTGCG CGAACGGAAC GCCGCGCTGG TCGAGGCGGA CGCGGTCAAG
ACGCGCTTCA TCGCCAACAT GAGCTATGAA TTCCGGACGC CGCTGACGTC CATCGGCGGC
TTTGCGGAAC TGCTTCAGAG CGGCATCGCC GGCGATCTGA CCGAGCAGCA GAACGAATAC
GTCGAGGCGA TTCTCACGTC GGTAGAGCGC CTGGGCGAGC AGATCGAGAA CGTGCTGGAC
CTCTCGCAGA GCGAGGCCGG AACGCTGCCG CTGGCGCAGG AACCGGTGGA GATTTTCCCG
CTCCTGACCG ATGTGGTGAC CGAACGCACC GAGCGCCTTG CCGGTGCCGG CATCACGCTC
GATCTGAGGG GCGACAGGTC CGCAGGCATG GTAACGGGCG ATGCGCGGCG GCTGCGGCGC
GCTTTCGGCC AGCTCATCGA CAATGCGATT GCGGCAACGC CCGAGGGCGG GCGCATCCTG
GTCGAGGCGT CGCGCAAGAA GGGCGGAGCG CTCCAGGTCG TCGTTTCGGA CAACGGGCGC
GGGATGGAGC CGGCCGTGCT GGCGCGGGCG CTCGACGGCC TGAAGGTCAG CGCGGATGGC
AAGGCGGTCG AACGGCGGCA GGGATTGGGG CTTCCGCTGG TCCGCCAACT GGTCGAGGCG
CACGGCGGAA ACCTGGAGCT GATGTCCGAG CCGGGGCTGG GCACGAGCGC GATCGTGATG
CTGCCGTGA
 
Protein sequence
MPLLNQTALV LIGLLLAAWT VGAGWLVLDA RRRARRGEAL QRQARRLARM VDESPALPLL 
VRADGRIEGP ARLALWLGFE AMPGYLSELD AGDHGLDEVE LARLGDAVRR AQKTGTPFRM
ALTPRGGSRS LCAQGHLADP QVSPGGAALV WFFDVSESEE ELVALRAETR KAKSDFAGLS
GLIEAAPLPM WFRGPDLRLR LVNSAYVAAV GAESADQVIA QGIELIEPVE GLSAAQVARQ
AHSRKVSIER SLLATIKGQR RAVRVTDLPL GDEGVAGYVV DIEEMEELIR QFRRFREAQR
EMLDTLSAGI AQFDEKRNLV FANQPFLRIF SVPQAWVVDT PPFDRVLDRM RDAGRLPEVR
DFPEWRRERQ GWFLAREPIE EPWHLSDGTH LRVVGQPMPD GGLLMIFEDR TEQLQLSATR
DTLLRTRTAT FDNLFESLAV FAPDGRLQLW NRRFAADWGL DEEFLATHPR ADVLLSRIAD
QLKRPAQVGT VAQVIQGATL ERKQRSGRLA LADGRHLALA GVPLPDGNGL LTVLDITDSQ
KAEAALRERN AALVEADAVK TRFIANMSYE FRTPLTSIGG FAELLQSGIA GDLTEQQNEY
VEAILTSVER LGEQIENVLD LSQSEAGTLP LAQEPVEIFP LLTDVVTERT ERLAGAGITL
DLRGDRSAGM VTGDARRLRR AFGQLIDNAI AATPEGGRIL VEASRKKGGA LQVVVSDNGR
GMEPAVLARA LDGLKVSADG KAVERRQGLG LPLVRQLVEA HGGNLELMSE PGLGTSAIVM
LP