Gene Saro_3775 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3775 
Symbol 
ID5077923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp414152 
End bp415393 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content69% 
IMG OID640481498 
ProductMOFRL domain-containing protein 
Protein accessionYP_001166160 
Protein GI146276000 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2379] Putative glycerate kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.801472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACG ACGAGCAGGC ACGCGGCATT GTGGAGAACG TATTCCGGGC GGCGCTCGAC 
GCGGCCATGG CAGGGCCCGC CGTGCTGCGC CATCTGCCGG AAAAACCGCA AGGACGGTGC
ATCGTCGTGG GGGCGGGGAA GGCGAGTGCC GCCATGGCCG CGGCAGTTGA CGCCGCCTGG
CCCGACGTGG CGCTCACCGG CGTGATCGCG ACGCGGTACG GCCACGCAGT GGAAGCAGGG
AGGATTGCGG TCTTCGAGGC GGGTCATCCC GTGCCGGATG AAAACTCCGT TCGCGCGGCC
CGGAGAATGC TGGAAGCGGT ACGCGGGCTA GGCCCCGACG ATCTCGTGCT TGCACTTGTA
TCGGGTGGCG GATCAGCCTC GCTCGCGCTT CCGATGGACG GAATGGATCT TGCCGGGAAG
CAGGCGGTGA CGCGCGCGTT GCTCAACAGT GGCGCTCCGA TCGGCGAGAT CAACACCGTT
CGGCGCCATC TCTCCGGCAT CAAGGGCGGG CGGCTGGCAG CGGCTGCCCG TCCTGCCCGG
GTCGTGACGC TGCTCATAAG CGACGTGCCA GGCGACGATC CCGCAGCAAT CGCCTCCGGC
CCGACGCTGG CCGACAGTTC CACCCCTGCG GATGCCGTCG CCATACTGGA ACGCCACGGC
ATTCCGGTAC CCCAGGCACT GCGCAACGCG AGGCCTGCCC CTTCCCCTGC CGACAATGGC
GAATGCCACC TTGTCGCGAC ACCGTCGCGC GCGCTCGACG CAGCAGCAGC GCGAGGGCGG
GCACTTGGCT GCGATGTGGT GAACCTGGGC GATGCGCTGG AGGGAGAAGC GGCGGACCTG
GGCCGCGAAC TCGCGCGTGA CGCGCTTGAG CGCGGGCGTA GCGCTGCCGG GCCGCTATTG
CTGTTGTCGG GCGGTGAGAC GACCGTCACG ATCGGCCCTG AAGGCGCCGG CGAAGGCGGA
CGCAACTGCG AGTTCCTGCT CTCGCTCGCT GTTGCGTTCG ACGGGGCCTC CGGTGTCTTC
GCCCTTGCGG CAGACACGGA CGGGATCGAC GGGACCAGCG ATGCCGCCGG CGCCTTCGTC
ACCCCATCCA CGCTCGCTCG CGCGCAGGCG CTCGGCCTCG ACCCGGTTGC CGCGCTGGCC
CGCCACGACA GCTACACGCT CTTTGCCGCA CTGGGCGATC TCGTCGTCAC CGGCCCCACC
CATACCAACG TCAACGACTT TCGCGCCGTT CTGGTTGGCT GA
 
Protein sequence
MMNDEQARGI VENVFRAALD AAMAGPAVLR HLPEKPQGRC IVVGAGKASA AMAAAVDAAW 
PDVALTGVIA TRYGHAVEAG RIAVFEAGHP VPDENSVRAA RRMLEAVRGL GPDDLVLALV
SGGGSASLAL PMDGMDLAGK QAVTRALLNS GAPIGEINTV RRHLSGIKGG RLAAAARPAR
VVTLLISDVP GDDPAAIASG PTLADSSTPA DAVAILERHG IPVPQALRNA RPAPSPADNG
ECHLVATPSR ALDAAAARGR ALGCDVVNLG DALEGEAADL GRELARDALE RGRSAAGPLL
LLSGGETTVT IGPEGAGEGG RNCEFLLSLA VAFDGASGVF ALAADTDGID GTSDAAGAFV
TPSTLARAQA LGLDPVAALA RHDSYTLFAA LGDLVVTGPT HTNVNDFRAV LVG