Gene Saro_1553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1553 
Symbol 
ID3917228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1609227 
End bp1610573 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content63% 
IMG OID640444293 
Productputative GAF sensor protein 
Protein accessionYP_496827 
Protein GI87199570 
COG category[T] Signal transduction mechanisms 
COG ID[COG1956] GAF domain-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00827304 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGC GCCTCGCCGA CCTGGCTCCT TGCTTCGAGG GAGTGATTCC TTCGATCATC 
GCGACCGCAG CGGCCGACGG GACGCCGAAC GTTTCCTACC TGTCCCACGT CGTCCGGGTC
GATGACGAGC ATGTCGCGCT GTCGAACCAG TTCTTCGCCA AGACCGCCGC GAACATCCGA
GCTAATCCCC ATGTTACGCT GATCCTCGTC GATTGTTTTT CCGGGGAACA GTACCTGCTC
GACATCCGGT TCGTGCGGTC ACTCGACACT GGGCCATTGT TCGAGAAGAT CTCGATTCAA
CTCAAGGCAA GCAGCGCGCA GATCGGCATG TCCGAGATCA TGCGGCTAAG GAGTGCCGAC
GTATTCAGGG TGGAAGCGAT CGAGAGGGTT CCCTGTCCGG TTGACACCGG CCCGGCACAG
GTGCCCCGCC CGCCGGTAAG CCTTCCCGCG CTTGCGGACG GCTGCCGGGC CATCGAAAAT
CTGGCGGAGG TGGAAGATAT CATCGACTGC CTGCTCGACC GCGTTGTCGG CCTGCTTGGC
TATTCGCACG CGCTCGTTCT TGTCCCCGAT CCGGGCCGCG ACAGCTTCGT CACGACGGGC
AGCACAGGCT ACGACCCCTC CGGGATCGGC TCCGAGGTCA AAGGCAGCGA GGGCATGATC
GGTACGGCGG CCGCAAGCGG ACGCACGATC AAGGTTAGCG ACATGAGCCG CGTGCGTCGC
TTTGCCGAAG CAATCGATGC CGACGCAGGG CTGTCCGAAA ACACGTCGCG CGTGATCGAC
TTTCCCGGAC TTGCCGGCGT CATGAGCCAG ATCGCCGTGC CGATGGTTAC GCGGGGCGAA
ACGATCGGCA TCCTCTTCGT CGAAAGCCCG GAGCGCATGG CGTTTCACGA CGATGACGAG
GCAGCGCTGG AATTGCTATG CGCTGCGGCT GCGCGTGCGA TCGCGGCAGG TGAAAGCATT
GCGTCAAGGG ACGATGATGC TTTGCCAGGC GCCGCGCGAT CGCTGCCTGT AGCGAATGGC
GGCGCGATCC GCGTCACGCA TCACCGCCTC GACGACAGTA TCTTCGTGGA CGGCAACTAT
ATCGTGAAGG GGATCGCGGG CGCGGTGTTG CGCCGTATCA TCGAATGGCA CCTCGTTGAC
GGCAGGAACA CGTTCTGCAA CCGCGAACTG CGGCTCGCGC TCGCCGCGCG GATGCCCGAT
ATCAAGGACA ATCTGGAAAC GCGCCTGTTG CTGCTTCGCC GCCGGCTCGA AGAGAAGCAA
GCGCCGATCC AGATCGTCAG GACAGGTCGA GGAAGGCTGA GCCTTGAGGC GAAGGGGCCG
CTGCTCCTCG CAGCGGCCCA GGATTGA
 
Protein sequence
MKLRLADLAP CFEGVIPSII ATAAADGTPN VSYLSHVVRV DDEHVALSNQ FFAKTAANIR 
ANPHVTLILV DCFSGEQYLL DIRFVRSLDT GPLFEKISIQ LKASSAQIGM SEIMRLRSAD
VFRVEAIERV PCPVDTGPAQ VPRPPVSLPA LADGCRAIEN LAEVEDIIDC LLDRVVGLLG
YSHALVLVPD PGRDSFVTTG STGYDPSGIG SEVKGSEGMI GTAAASGRTI KVSDMSRVRR
FAEAIDADAG LSENTSRVID FPGLAGVMSQ IAVPMVTRGE TIGILFVESP ERMAFHDDDE
AALELLCAAA ARAIAAGESI ASRDDDALPG AARSLPVANG GAIRVTHHRL DDSIFVDGNY
IVKGIAGAVL RRIIEWHLVD GRNTFCNREL RLALAARMPD IKDNLETRLL LLRRRLEEKQ
APIQIVRTGR GRLSLEAKGP LLLAAAQD