Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3335 |
Symbol | |
ID | 3915982 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3555912 |
End bp | 3558212 |
Gene Length | 2301 bp |
Protein Length | 766 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640446120 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_498604 |
Protein GI | 87201347 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCTCA CGCGGAGGGG GCTGCTGGTG GGGGCAGGGC TCGGCGGAGC CCTGCTTATC GCCTTCCCGC TCATTCCGCG CCGCCACCCG GTTCCACTGC AGGCGGGCGA AGGCGAGCAC GTGGTCGACG CCTTCCTCAA GCTGGGGCGT GCCCGGGGCG GCAAGGACTG CATCCTGACC GTCGCGGTCC CGTTCTGCGA GATGGGACAG GGCATCACCA CGCTCGTCGC GCAGATCGTC GCGGACGAGG CCGGGGCCGA CTGGCGCAAG GTCGCGGTCG AGGCCGCGCC GATCAGTCCC GCCTATGCCG ACCCGGTGCT CTCGGCAAAG TGGGCGCCGT TGTGGATGCC GGCATTCGCT TCGCTGGGCA ATGACGCCGA GGGCACGCTC GCCCGCCTTC ATGCAGAGCG CGGGCCGATG ATGATTACCG CCGATGGCAC GGCGCTGGCG GCGTTCGAGA CTCCCTTGCG CGAGGCCGGG GCAGCCCTGC GCGCGATGAT GGCCCAGGCT GCCGCGGACA AGTGGGGCGT GGGCTGGGAG GAGTGCGAGA CCGGGGACAG CGCCGTGACC CACGGCAAGA AGCGCCTCTC CTTTGCCGAA CTGCTGGCGG ATGCGGTAGA GTACGACCCG CCCGACCTGC CCGTCCTGCG CGCCGAGCCC CCGCGCGAAA GGCCCGGCCA GTTTCCCGAA GGTGCCCCCG CCCGGCACCC GCGCCTCGAT CTTCCAGCCA AGGTCGACGG CAGCTTCACC TTTGCCGGCG ACGTGCGCCT GCCGGGCATG GTCCACGCCG CGATCGCCCA CGCGCCGCAA GGATCGGCGG TCCTGTCGAC CTATGACAAG CAGGCTGCGG CCTCGGTGCG CGGGCTGGTG GGCGTGGTTC ATGCCCGGCG CTGGCTCGCC GCCGTCGCCA CCAACTGGCA CGCCGCCGAC AAGGCCTTGC GCGCGATGGA GCCACGTTTT CGCGCCGATG GCCCGGTGGC CGACAACGAG AAGGTCCTCG TGGCGCTCGA CAAGGCGCTG GACAAGGGCG ACGCGGTGCG ACTCATGGCC GAGGGCGATC CCGATGCGCT GCTCGAAAAG CCGGTCCTCA GCGCCCGCTA CGATGTCGAA CCGGCCCTCC ACGCCCCGCT CGAAACCACC AGCGCCACCG CCCGCCTTCG CGATGGCAAG CTGGAGCTGT GGATCGCGAC CCAGGCTCCC GAGCGAGCGC GCCGCGCCGC CGCACGCGCG GCGGGTCTCT CGCGGCAGGA CGTGATCGTC TATCCGATGC ACGCGGGCGG CAGCTTCGAC GCTCGGCTCG ACGTGCGCAT CGCCGCCGAA GTTGCCACCA TCGCGACCAT CATCCGCAAG CCCGTGCAAC TGACGTGGTC GCGCTGGCAG GAATCGCTGG CGGGCATTCC CCGAACCCCG GTTTCGGCGC GGCTCGACGG CGCGCTCAGC CCCGACAAGT CGCGCGTGCT GGGCTGGCGC AGCCGCCTCG CACTTCCCGC CACGACAATC GAATCGGGCG CACGCCTGCT CGATGGGCAA GGCATCGGCG ATGCACTCGA CTTGCAGGAC CGGGCCGACC CGATGGCCTG TGAAGGCGCG ATGCCGCTCT ACCGCATTCC CGAAAAGGCG GTGGACCACG TCCCCGCCGC CCTGCCCCTG CCCACGGCGC GCTTTCGCGG ACAGGCGCAC GGCTACACCG CATTCTTCAC GGAAAGCTTC GTGGACGAGC TGGCCCATCT TGCGGGGCGC GAGCCACTGT CGTTTCGCGT CGGCATGCTC GATGGCCAGC CGCGCCTTGT CGCCTGCCTT TCCGGCGTCG CCAGGCTGGC GCAATGGGGC GGCGGGGTCG ATGCATCGGG ACAGGGCATT GCCTGCCATC GCATGGATCT TGCCTCGGGA GGCGGCGCGG TGCGTTCCGG CATGATCGCG GTCGTCGCCA CCGCACGGCA GGAAGCCGGC GTCGTGCGGG TCGAGCGGCT GAGCGCCTTT GTCGACATCG GCCGCATCGT GAACATGGAT ATCGCGCGCC AGCAGATCGA GGGCGGCTTG GTGTTCGGCC TCGCCCATGC GGTCGGCGGG TCGAGCGGCC ACGCTCGCGG AAGGCCGCTG GCGGGCCATC TCTCGCAACT CGGCCTGCCT CTGCTGGCCG ATTGCCCGAA GGTCGATATT GCCTTTGCCG ACAGCAACGA AGAGCCGTTC GACCCCGGCG AACTCGGAAT GGTCGCGGTG GCCCCGGCCA TCGCCAATGC GCTGTTCTCC GCCACGGGCG TGCGCTTCCG CCGCCTGCCG CTCATTTCCG AAGGACTTTG A
|
Protein sequence | MRLTRRGLLV GAGLGGALLI AFPLIPRRHP VPLQAGEGEH VVDAFLKLGR ARGGKDCILT VAVPFCEMGQ GITTLVAQIV ADEAGADWRK VAVEAAPISP AYADPVLSAK WAPLWMPAFA SLGNDAEGTL ARLHAERGPM MITADGTALA AFETPLREAG AALRAMMAQA AADKWGVGWE ECETGDSAVT HGKKRLSFAE LLADAVEYDP PDLPVLRAEP PRERPGQFPE GAPARHPRLD LPAKVDGSFT FAGDVRLPGM VHAAIAHAPQ GSAVLSTYDK QAAASVRGLV GVVHARRWLA AVATNWHAAD KALRAMEPRF RADGPVADNE KVLVALDKAL DKGDAVRLMA EGDPDALLEK PVLSARYDVE PALHAPLETT SATARLRDGK LELWIATQAP ERARRAAARA AGLSRQDVIV YPMHAGGSFD ARLDVRIAAE VATIATIIRK PVQLTWSRWQ ESLAGIPRTP VSARLDGALS PDKSRVLGWR SRLALPATTI ESGARLLDGQ GIGDALDLQD RADPMACEGA MPLYRIPEKA VDHVPAALPL PTARFRGQAH GYTAFFTESF VDELAHLAGR EPLSFRVGML DGQPRLVACL SGVARLAQWG GGVDASGQGI ACHRMDLASG GGAVRSGMIA VVATARQEAG VVRVERLSAF VDIGRIVNMD IARQQIEGGL VFGLAHAVGG SSGHARGRPL AGHLSQLGLP LLADCPKVDI AFADSNEEPF DPGELGMVAV APAIANALFS ATGVRFRRLP LISEGL
|
| |