Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1652 |
Symbol | |
ID | 3918761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1727025 |
End bp | 1728656 |
Gene Length | 1632 bp |
Protein Length | 543 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444393 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_496926 |
Protein GI | 87199669 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCAG GAATCTCGCG TCGCGACATG ATCCGCGCGG GCGCTGCCGG CGCCGCGCTC CTCTCCTCCC GGGCATTCGC CAGTCCGCTC GACGGCCCGC GCATCATGCC GCCGGGCCTG GCCGCTGACC GCTTTGCCGC TGCTGTCAAG GAACTGCGGG CCGTTGTCGG CACGGACTGG GTCTTTGCGG ATGCCGAAAG CACGCTGCCC TACGCCAGCA CCTTCACCCC CGACCCCGAC GGTCGGCACC TGCCTTCGGG CGCGGTGGCC CCGGCTTCGG TCGAGGAAGT ACAGGCGGTG CTGAAGGTGG CGAACAAGTA CGGGCTGCCG CTCTGGCCGG TCTCCACCGG CAAGAACATG GGTTATGGCA ATGCAACGCC TGCGACCTCG GGCCAGATGG TGCTCGACCT CAAGCGCATG AACCGGATCA TCGAGGTAGA CGCCGAACTC GGTACTGCGC TGGTAGAGCC GGGCGTGACC TACCAGGACC TCCACGACTA CCTGCAGGAA CACAATCTGC CCTACTGGGT CGACGTGCCC ACGGTCGGGC CGATCGTGTC GCCGCTCGGC AACACGCTGG AACGCGGGGT GGGCTATACC CCTTATGGCG ACCATTTCTT CATGCAGTGC GGCATGGAAG TCGTGCTGGC CGATGGCACG GTCGTGCGGA CCGGGATGGG CAGCGTGAAG AACTCGACCA CCTGGCAGGC GTTCAAGTGG GGCTACGGCC CCTACATCGA CGGCCTGTTC ACCCAGTCGA ACTTCGGCGT GGTGACCAAG CTCGGCATGT GGTTGATGCC GGCGCCGCCA GCCTACAAGC CCTTCATGGT CCGTCACATG GAAGTGGCCG ACGTGGCGCG GATCGTCGAT GCGATCCGCC CGTTCCGCAT GAACAACCTC ATCCCCAATT GCGTCTTGAT GATGGGCGCG GCCTACCAGC TCGCGATGTT CAAGCGCCGC GCCGACATCT GGACCGAGCA GCGCTCCGTT CCGGATGACG TGATCCGGGC CGAGGCTATG CGGAACGGCC TCGGCATGTG GAACACCTAT TTCGCGCTCT ACGGTACCGA TGAGATCATC GCTGCGGTGG AACCCATCGT TCGCTCCGCC TTCGAGGCGA CCGGCGGCGA GGTACTGACC GAGAGGGAAA TGTCCGGCAA CCCGTGGTTC GAACATCACA AGTCGCTGAT GCGTGGCGGC ATGACGTTGG AGGAGATCGG CATCGTGCGC TGGCGCGGGC CCGGTGGCGG GATGATCTGC TTTGCCCCGG TCGCTCCGGC CAAGGGCGTC GAGACCGCCG AGCAGACCGC GCTCGCCAAG GAAATCCTCG GCAAGTACGA CTTCGACTAC AACGGTGCCT TCGCCATCGG CAGCCGCGAA CTGCACCACC TGATCTTCCT GCTGTTCGAC AAGGATGATC CGGCCGAGGA ACGCAAGGCG CAGGACTGCA TGGAAGAGAT GATCCTGCGC TTCGGCGACA AGGGCTGGGC CGCGTATCGC ACCGCCGTCA GCACCATGGA TCTCGTAGCA GGCCAGTACG GCGAGGCGAA TAGGATGCTC AATCGGCGCC TGAAGGCGGC GCTCGACCCA AACGGTGTCA TCGCGCCCGG AAAATCGGGG ATCACGCTTT GA
|
Protein sequence | MTAGISRRDM IRAGAAGAAL LSSRAFASPL DGPRIMPPGL AADRFAAAVK ELRAVVGTDW VFADAESTLP YASTFTPDPD GRHLPSGAVA PASVEEVQAV LKVANKYGLP LWPVSTGKNM GYGNATPATS GQMVLDLKRM NRIIEVDAEL GTALVEPGVT YQDLHDYLQE HNLPYWVDVP TVGPIVSPLG NTLERGVGYT PYGDHFFMQC GMEVVLADGT VVRTGMGSVK NSTTWQAFKW GYGPYIDGLF TQSNFGVVTK LGMWLMPAPP AYKPFMVRHM EVADVARIVD AIRPFRMNNL IPNCVLMMGA AYQLAMFKRR ADIWTEQRSV PDDVIRAEAM RNGLGMWNTY FALYGTDEII AAVEPIVRSA FEATGGEVLT EREMSGNPWF EHHKSLMRGG MTLEEIGIVR WRGPGGGMIC FAPVAPAKGV ETAEQTALAK EILGKYDFDY NGAFAIGSRE LHHLIFLLFD KDDPAEERKA QDCMEEMILR FGDKGWAAYR TAVSTMDLVA GQYGEANRML NRRLKAALDP NGVIAPGKSG ITL
|
| |