Gene Saro_1887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1887 
Symbol 
ID3917108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1993798 
End bp1994832 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content66% 
IMG OID640444631 
ProductLacI family transcription regulator 
Protein accessionYP_497161 
Protein GI87199904 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.587669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGCGTA AGCCATCCAA CAAGCCGACG AGTTTCGACA TTGCCTACCT CGCCGGCGTG 
TCCCAACCTA CCGTCAGCCG TGCGCTCAGG GGCAGCAAGT CGGTCAGCCT CGCCACGCGC
CAGAAGATCG AGGCGATCGC GCGCCAGCTC AACTATACGG TCGACAAGAA CGCTTCGTCA
TTGCGCTCGC AGCGATCGAA CACCCTCGCC CTGCTGTTCT TCGAGGACCC GACGCCGGAC
GAATCGAACA TCAACCCGTT CTTCCTCGCC ATGCTCGGCT CGATCACCCG GCACTGCGCC
AATCGCGGCC TCGACCTGCT GATCTCGTTC CAGAAGCTCG ATGACGACTG GCACAAGCGC
TACCAGGACA GTCATCGCGC CGACGGGCTG ATCCTGCTCG GCTACGGTGA CTACACTCTC
TACGGATCCC GCCTGCGCCA GCTCATCCGC TCGGGCACGC ATTTCGTGCG CTGGGGCTCG
GTAGACGAAG GCACCATCGG GGCGACAATC GGGTCCGACA ACTTCGGCGC CGGACGCCTG
GCGGGCGAGC ACCTCCTTGC CCGGGGCCGC AAGCGCATTG CCTTCCTCGG CCAGGCGGAT
TCGCACTATC CAGAGTTCGA GCAGCGCTAC GCAGGCCTGT CCAAGGCCAT CCGCACAGCC
GGGCTGGAGC CCGATCCGGA CCTTGTCGTC GATGCGACCT CGTCCGAGGA AATCGGCTAC
AACGCCGCGC GGGAGCTGCT GTCGCGCGGC AAAACCTTCG ATGCCATCTT CGCCGCGAGC
GACCTGATCG CCATCGGCGC GATGCGCGCG CTTGCCGAAG CCGGTCGTTC CGTGCCCGCC
GATGTCGCGG TCGTCGGTTT CGACGACATC CCGGCCGCCA GCCTGACCAC GCCGCCATTG
ACCACCATCA TGCAGGATAC GCGGCTTGCC GGTGAGGCTC TGGTCGATTG CGTGCTCGGG
CAGGTCGAAG GCCGCCCACC CAGCCCGCGC ATCCTCCCCG CACGACTTGT CGTCAGGGCC
AGCAGCGGCG GCTGA
 
Protein sequence
MGRKPSNKPT SFDIAYLAGV SQPTVSRALR GSKSVSLATR QKIEAIARQL NYTVDKNASS 
LRSQRSNTLA LLFFEDPTPD ESNINPFFLA MLGSITRHCA NRGLDLLISF QKLDDDWHKR
YQDSHRADGL ILLGYGDYTL YGSRLRQLIR SGTHFVRWGS VDEGTIGATI GSDNFGAGRL
AGEHLLARGR KRIAFLGQAD SHYPEFEQRY AGLSKAIRTA GLEPDPDLVV DATSSEEIGY
NAARELLSRG KTFDAIFAAS DLIAIGAMRA LAEAGRSVPA DVAVVGFDDI PAASLTTPPL
TTIMQDTRLA GEALVDCVLG QVEGRPPSPR ILPARLVVRA SSGG