Gene Saro_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2051 
Symbol 
ID3917698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2191236 
End bp2193557 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content67% 
IMG OID640444803 
Producthypothetical protein 
Protein accessionYP_497324 
Protein GI87200067 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR03607] patatin-related protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.475949 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCATC TCATGCGGCA GAAGGAATTG CGCATCGCGC TGGTCTGCTA CGGCGGCATC 
AGCCTCGCAG TGTACATGCA CGGCGTGACG AAGGAGTTGT GGCACGCGAC ACGCGCCAGC
CGTGCCTTCC ATCGCCCGGA ATCGCCCACG TCGGTCGGCG TGGAAGCCGT CTATCTCCGC
CTGTTCGAGG AAATCGAACT CACCAGGGAC CTGCGCCTGC GCCTTCTTGC CGACATCGTC
TCCGGGGCGA GCGCCGGCGG CATAAACGGC GTATTCCTGG CCCAGGCGAT CGCGACCGGG
CAAAGTCTCG AACCATTGAC CGCGATGTGG CTCGAACGCG CCGACATCGA CGTACTTCTC
GATCCGGATG CCAAGCCGGT AAGCCGGTTT GCCAAGTTCT GGGCGCAGCC GCTGGTGTGG
TTCATTCTCA AGCGCCCCGG CAACGTGGTC ACGAGCACCG TCGCGCCCGA AACACGGGCC
GAAGTCCGGC GCAAGGTCTC CCGCCTGATC CGCGCGCGCT GGTTCGCCCC GCCGTTCAGC
GGCATCGGCT TCTCGCGGAT GATCGCCGAC GCGCTCGATG CAATGAAGGC CGCACCCGAG
GGCGACCCGC TCCTGCCGGA AGGGCACCCC ATCGACCTGC TGGTCACGGC AACCGACTTC
AACGGCCACC TCGAGGAACT GCGCCTCAAC AGCCCGGCGA CGGTGGTGGA AAGCGAGCAT
CGCCTGTCCA TCGCCTTTCG CCGCGCCACG AGCCGCAATG GCGACCGCGA TCTGGCGCCC
TGCCCCGAAC TGGTCCTCGC GGCACGCTCG ACGGCCAGTT TCCCGGGGGC CTTCCCGCCG
CTCGCGGTCG ACGAAATCGA CCGGCTCGTC TCCGAACGCG GCGCATCATG GCCAGGGCGC
GACCGATTTC TCGAGCGCAT CATGCCCAGC CACTGGAAGC GCGGGGAGAT ACAAGGCGTG
GCGCTGGTGG ATGGCGCGGT GCTGGTGAAC CGTCCTTTCG CGCAGGCGAT AGGCGTCTTG
CGCGACAGGC CGGCCCGGCG CGAGGTGGAC CGCCGCTTCG TCTATATCGA CCCAAGCCCC
GATCACATGC GCATCGTCCC CAGGGGCGGG CGCACCGTTG GTTTCTTTTC CGCGATATTC
GGTTCCCTGT CCTCGATCCC GCGCGAGCAG CCGATCCGCG ACAACCTGGA ATCCCTCGAA
CGCCAGTCCC GCCAGCGCGC CCGGCTTCGC GCCATCGTCG ATGCGCTTCG GCCCGAGATC
GAGGAAACGG TCGAGGGCCT GTTCGGTCGT ACGTTGTTCT TCGACAGCCC CAACGCCCGC
CGGCTCACCG CGTGGCGCAA CAAGGCGCAG CAGGCGGCAG CCGAGCGCGC GGGCTTCGCC
TTTCACGGCT ATGCCCAGAC CAAGCTTGCA GGGATCGTGT CGGACCTTGC CGAACTTATC
CATGAAGCCG CCCCGCGCGC CGCCCCTGCC GACAAGATCG AGACGGCGCT GTGGACGCAT
CTTGCGGAGA ACGGAATGGA CCGCCTGTCC GCGATGCGCG GCGGCGCCAC GGAACAGGCG
ATCGTGTTCT TCCGGACGCA CGACCTCGCC TTTCGCATCC GCCGCCTCAA ACTGCTCGCG
CGGCGCCTGA CGCATGACTG GGACATCGGC GAAAGCATCG ACCCCGCCGC CCGCGAAGCC
GCGCGCGATG CCGTCTACCG GGCGCAGGCG CTCTACCAGG GGCGGGAAAC GGCGGAAGGT
CTGGGCCCGG GATTTGCAGA GACCGCCAGC GTCGCAACGG TCGATCCAGG CGCGGCGCTT
GCCGCCATCG CCAGCCGCCG CGACCTCGTT GGCCTCGATA TCCAGGTCGA CGCGATGCTT
GTCGCCGCGC TCGACGCAAT GGACAAGCCG CTGCGTCGGC GCTTCCTCCA CGCCTACCTC
GGCTTTCCCT TTTACGATAT CGCGACCCTG CCCCTTCTCC AGGGCGAAGG CATGGGCGAA
TTCGATCCGG TCAAGGTCGA CCGGATCTCG CCCGACGATG CTACCTCGAT CCGCAAGGGC
GGCACCTTCG CCTGCCTGCG CGGGGTGGAG TTCTTCAACT TCGGGGCATT CTTCAGCCGC
GCCTACCGCG AGAACGATTA CCTGTGGGGC AGGCTCCATG GGGCTGAGCG GATGATCGAC
TTGATGGCAT CAACCGCCGA AACGCCCCTT CCAGATCAGC TCATCCGCAC GTTCAAGCGG
GATGCGTTCC TCGCCATACT CGACGAGGAA CAAGGCAAGT TGCTGGCCGA ACCCGGCCTG
GTCGATACGG TGCGCAAGGA AGTTCACGCA AGCTTCGCTT GA
 
Protein sequence
MLHLMRQKEL RIALVCYGGI SLAVYMHGVT KELWHATRAS RAFHRPESPT SVGVEAVYLR 
LFEEIELTRD LRLRLLADIV SGASAGGING VFLAQAIATG QSLEPLTAMW LERADIDVLL
DPDAKPVSRF AKFWAQPLVW FILKRPGNVV TSTVAPETRA EVRRKVSRLI RARWFAPPFS
GIGFSRMIAD ALDAMKAAPE GDPLLPEGHP IDLLVTATDF NGHLEELRLN SPATVVESEH
RLSIAFRRAT SRNGDRDLAP CPELVLAARS TASFPGAFPP LAVDEIDRLV SERGASWPGR
DRFLERIMPS HWKRGEIQGV ALVDGAVLVN RPFAQAIGVL RDRPARREVD RRFVYIDPSP
DHMRIVPRGG RTVGFFSAIF GSLSSIPREQ PIRDNLESLE RQSRQRARLR AIVDALRPEI
EETVEGLFGR TLFFDSPNAR RLTAWRNKAQ QAAAERAGFA FHGYAQTKLA GIVSDLAELI
HEAAPRAAPA DKIETALWTH LAENGMDRLS AMRGGATEQA IVFFRTHDLA FRIRRLKLLA
RRLTHDWDIG ESIDPAAREA ARDAVYRAQA LYQGRETAEG LGPGFAETAS VATVDPGAAL
AAIASRRDLV GLDIQVDAML VAALDAMDKP LRRRFLHAYL GFPFYDIATL PLLQGEGMGE
FDPVKVDRIS PDDATSIRKG GTFACLRGVE FFNFGAFFSR AYRENDYLWG RLHGAERMID
LMASTAETPL PDQLIRTFKR DAFLAILDEE QGKLLAEPGL VDTVRKEVHA SFA