Gene Saro_1728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1728 
Symbol 
ID3916303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1820365 
End bp1821411 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content64% 
IMG OID640444469 
ProductLacI family transcription regulator 
Protein accessionYP_497002 
Protein GI87199745 
COG category[K] Transcription 
COG ID[COG1609] Transcriptional regulators 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.531296 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGAAA TGAACATCGC AAGTACGAGC CGCGGCAACG CACACGCAAC GATGGAGGAC 
GTGGCAAAAC TCGCTGGCGT CTCGCTAAAG AGCGTGTCTC GCGTAATCAA CGCCGAGCCG
CACGTATCGG CAAAGCTGAG GGCAAAGGTC GAGGCGGCGA TTGCCGAACT CAATTACGTT
CCGGATACGG CAGCGCGCTC GCTTGCCGGA TCGCGGGCTT TCATCGTCGG CCTGCTGTTC
GACAACCCCA GCCCGAACTA CACGATGAAC ATCCAGAAGG GCGTGTACGA GACCTGCCGC
GACCAGCAGC ACCACCTGCG CATCGACAAC ATCGATTCGA CGGTTCCCGC CGAAAAGTTC
GAGGCACAAC TGGCGGCGAT GGTGCGCAAC AGCCGATGCG ACGGGTTCGT GCTAACGCCT
CCCCTTACCG ACAACGTGGT GCTACTCGAT TTCCTCGACC GTAGCGGCAT CCGCTATGTA
CGCATTGCGC CGGACATTCA GCCCGACCGA TCGCCCGGGG TCTGCATCGA TGACGCGGCA
GCAGCAGCCG CCGCCGCGCG CCACCTGTGG GAACTGGGGC ACAGGCGCTT TGCCGTGGTG
CGCGGGCCCG CCAGCCACGG CGCGGCGGGA CGGCGACGCC AAGGCTTCAT CGACGAGTTG
CACAGGCTCG GCGCGGAGAA CCCCATCATC GAGGCGGAAG GCAATTTCAG CTTCGAAAGC
GGCATCGCAG CGGGTGCGAA GGTTCTGGCG GCAACCCCCC GCCCGACCGC GATCTTTGCC
GCGAACGACG ATTCGGCCGC AGGCGTCATG GTCGCCTGCT CGCAGGCCGG ACTGAAAGTG
CCGAACGACG TTTCAGTCTG CGGCTTCGAC GATAGCTGGG TGGCGAAGTC GGTCTGGCCC
TATCTGACCA CCGTCTACCA GCCCATCGAG GAGATGGGCC GGGCCGCCGC GGCGCTGTTG
CTACGCCGCG ACGAGCCCGA CAATGTCCTC CACGAACTGG ATTTCAGTCT CGTCGTCAGG
GCTTCGACGG CACCCCCGCC CCAATAG
 
Protein sequence
MGEMNIASTS RGNAHATMED VAKLAGVSLK SVSRVINAEP HVSAKLRAKV EAAIAELNYV 
PDTAARSLAG SRAFIVGLLF DNPSPNYTMN IQKGVYETCR DQQHHLRIDN IDSTVPAEKF
EAQLAAMVRN SRCDGFVLTP PLTDNVVLLD FLDRSGIRYV RIAPDIQPDR SPGVCIDDAA
AAAAAARHLW ELGHRRFAVV RGPASHGAAG RRRQGFIDEL HRLGAENPII EAEGNFSFES
GIAAGAKVLA ATPRPTAIFA ANDDSAAGVM VACSQAGLKV PNDVSVCGFD DSWVAKSVWP
YLTTVYQPIE EMGRAAAALL LRRDEPDNVL HELDFSLVVR ASTAPPPQ