Gene Saro_1652 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1652 
Symbol 
ID3918761 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1727025 
End bp1728656 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content65% 
IMG OID640444393 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_496926 
Protein GI87199669 
COG category[C] Energy production and conversion 
COG ID[COG0277] FAD/FMN-containing dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCAG GAATCTCGCG TCGCGACATG ATCCGCGCGG GCGCTGCCGG CGCCGCGCTC 
CTCTCCTCCC GGGCATTCGC CAGTCCGCTC GACGGCCCGC GCATCATGCC GCCGGGCCTG
GCCGCTGACC GCTTTGCCGC TGCTGTCAAG GAACTGCGGG CCGTTGTCGG CACGGACTGG
GTCTTTGCGG ATGCCGAAAG CACGCTGCCC TACGCCAGCA CCTTCACCCC CGACCCCGAC
GGTCGGCACC TGCCTTCGGG CGCGGTGGCC CCGGCTTCGG TCGAGGAAGT ACAGGCGGTG
CTGAAGGTGG CGAACAAGTA CGGGCTGCCG CTCTGGCCGG TCTCCACCGG CAAGAACATG
GGTTATGGCA ATGCAACGCC TGCGACCTCG GGCCAGATGG TGCTCGACCT CAAGCGCATG
AACCGGATCA TCGAGGTAGA CGCCGAACTC GGTACTGCGC TGGTAGAGCC GGGCGTGACC
TACCAGGACC TCCACGACTA CCTGCAGGAA CACAATCTGC CCTACTGGGT CGACGTGCCC
ACGGTCGGGC CGATCGTGTC GCCGCTCGGC AACACGCTGG AACGCGGGGT GGGCTATACC
CCTTATGGCG ACCATTTCTT CATGCAGTGC GGCATGGAAG TCGTGCTGGC CGATGGCACG
GTCGTGCGGA CCGGGATGGG CAGCGTGAAG AACTCGACCA CCTGGCAGGC GTTCAAGTGG
GGCTACGGCC CCTACATCGA CGGCCTGTTC ACCCAGTCGA ACTTCGGCGT GGTGACCAAG
CTCGGCATGT GGTTGATGCC GGCGCCGCCA GCCTACAAGC CCTTCATGGT CCGTCACATG
GAAGTGGCCG ACGTGGCGCG GATCGTCGAT GCGATCCGCC CGTTCCGCAT GAACAACCTC
ATCCCCAATT GCGTCTTGAT GATGGGCGCG GCCTACCAGC TCGCGATGTT CAAGCGCCGC
GCCGACATCT GGACCGAGCA GCGCTCCGTT CCGGATGACG TGATCCGGGC CGAGGCTATG
CGGAACGGCC TCGGCATGTG GAACACCTAT TTCGCGCTCT ACGGTACCGA TGAGATCATC
GCTGCGGTGG AACCCATCGT TCGCTCCGCC TTCGAGGCGA CCGGCGGCGA GGTACTGACC
GAGAGGGAAA TGTCCGGCAA CCCGTGGTTC GAACATCACA AGTCGCTGAT GCGTGGCGGC
ATGACGTTGG AGGAGATCGG CATCGTGCGC TGGCGCGGGC CCGGTGGCGG GATGATCTGC
TTTGCCCCGG TCGCTCCGGC CAAGGGCGTC GAGACCGCCG AGCAGACCGC GCTCGCCAAG
GAAATCCTCG GCAAGTACGA CTTCGACTAC AACGGTGCCT TCGCCATCGG CAGCCGCGAA
CTGCACCACC TGATCTTCCT GCTGTTCGAC AAGGATGATC CGGCCGAGGA ACGCAAGGCG
CAGGACTGCA TGGAAGAGAT GATCCTGCGC TTCGGCGACA AGGGCTGGGC CGCGTATCGC
ACCGCCGTCA GCACCATGGA TCTCGTAGCA GGCCAGTACG GCGAGGCGAA TAGGATGCTC
AATCGGCGCC TGAAGGCGGC GCTCGACCCA AACGGTGTCA TCGCGCCCGG AAAATCGGGG
ATCACGCTTT GA
 
Protein sequence
MTAGISRRDM IRAGAAGAAL LSSRAFASPL DGPRIMPPGL AADRFAAAVK ELRAVVGTDW 
VFADAESTLP YASTFTPDPD GRHLPSGAVA PASVEEVQAV LKVANKYGLP LWPVSTGKNM
GYGNATPATS GQMVLDLKRM NRIIEVDAEL GTALVEPGVT YQDLHDYLQE HNLPYWVDVP
TVGPIVSPLG NTLERGVGYT PYGDHFFMQC GMEVVLADGT VVRTGMGSVK NSTTWQAFKW
GYGPYIDGLF TQSNFGVVTK LGMWLMPAPP AYKPFMVRHM EVADVARIVD AIRPFRMNNL
IPNCVLMMGA AYQLAMFKRR ADIWTEQRSV PDDVIRAEAM RNGLGMWNTY FALYGTDEII
AAVEPIVRSA FEATGGEVLT EREMSGNPWF EHHKSLMRGG MTLEEIGIVR WRGPGGGMIC
FAPVAPAKGV ETAEQTALAK EILGKYDFDY NGAFAIGSRE LHHLIFLLFD KDDPAEERKA
QDCMEEMILR FGDKGWAAYR TAVSTMDLVA GQYGEANRML NRRLKAALDP NGVIAPGKSG
ITL