Gene Saro_3335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3335 
Symbol 
ID3915982 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3555912 
End bp3558212 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content71% 
IMG OID640446120 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_498604 
Protein GI87201347 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCTCA CGCGGAGGGG GCTGCTGGTG GGGGCAGGGC TCGGCGGAGC CCTGCTTATC 
GCCTTCCCGC TCATTCCGCG CCGCCACCCG GTTCCACTGC AGGCGGGCGA AGGCGAGCAC
GTGGTCGACG CCTTCCTCAA GCTGGGGCGT GCCCGGGGCG GCAAGGACTG CATCCTGACC
GTCGCGGTCC CGTTCTGCGA GATGGGACAG GGCATCACCA CGCTCGTCGC GCAGATCGTC
GCGGACGAGG CCGGGGCCGA CTGGCGCAAG GTCGCGGTCG AGGCCGCGCC GATCAGTCCC
GCCTATGCCG ACCCGGTGCT CTCGGCAAAG TGGGCGCCGT TGTGGATGCC GGCATTCGCT
TCGCTGGGCA ATGACGCCGA GGGCACGCTC GCCCGCCTTC ATGCAGAGCG CGGGCCGATG
ATGATTACCG CCGATGGCAC GGCGCTGGCG GCGTTCGAGA CTCCCTTGCG CGAGGCCGGG
GCAGCCCTGC GCGCGATGAT GGCCCAGGCT GCCGCGGACA AGTGGGGCGT GGGCTGGGAG
GAGTGCGAGA CCGGGGACAG CGCCGTGACC CACGGCAAGA AGCGCCTCTC CTTTGCCGAA
CTGCTGGCGG ATGCGGTAGA GTACGACCCG CCCGACCTGC CCGTCCTGCG CGCCGAGCCC
CCGCGCGAAA GGCCCGGCCA GTTTCCCGAA GGTGCCCCCG CCCGGCACCC GCGCCTCGAT
CTTCCAGCCA AGGTCGACGG CAGCTTCACC TTTGCCGGCG ACGTGCGCCT GCCGGGCATG
GTCCACGCCG CGATCGCCCA CGCGCCGCAA GGATCGGCGG TCCTGTCGAC CTATGACAAG
CAGGCTGCGG CCTCGGTGCG CGGGCTGGTG GGCGTGGTTC ATGCCCGGCG CTGGCTCGCC
GCCGTCGCCA CCAACTGGCA CGCCGCCGAC AAGGCCTTGC GCGCGATGGA GCCACGTTTT
CGCGCCGATG GCCCGGTGGC CGACAACGAG AAGGTCCTCG TGGCGCTCGA CAAGGCGCTG
GACAAGGGCG ACGCGGTGCG ACTCATGGCC GAGGGCGATC CCGATGCGCT GCTCGAAAAG
CCGGTCCTCA GCGCCCGCTA CGATGTCGAA CCGGCCCTCC ACGCCCCGCT CGAAACCACC
AGCGCCACCG CCCGCCTTCG CGATGGCAAG CTGGAGCTGT GGATCGCGAC CCAGGCTCCC
GAGCGAGCGC GCCGCGCCGC CGCACGCGCG GCGGGTCTCT CGCGGCAGGA CGTGATCGTC
TATCCGATGC ACGCGGGCGG CAGCTTCGAC GCTCGGCTCG ACGTGCGCAT CGCCGCCGAA
GTTGCCACCA TCGCGACCAT CATCCGCAAG CCCGTGCAAC TGACGTGGTC GCGCTGGCAG
GAATCGCTGG CGGGCATTCC CCGAACCCCG GTTTCGGCGC GGCTCGACGG CGCGCTCAGC
CCCGACAAGT CGCGCGTGCT GGGCTGGCGC AGCCGCCTCG CACTTCCCGC CACGACAATC
GAATCGGGCG CACGCCTGCT CGATGGGCAA GGCATCGGCG ATGCACTCGA CTTGCAGGAC
CGGGCCGACC CGATGGCCTG TGAAGGCGCG ATGCCGCTCT ACCGCATTCC CGAAAAGGCG
GTGGACCACG TCCCCGCCGC CCTGCCCCTG CCCACGGCGC GCTTTCGCGG ACAGGCGCAC
GGCTACACCG CATTCTTCAC GGAAAGCTTC GTGGACGAGC TGGCCCATCT TGCGGGGCGC
GAGCCACTGT CGTTTCGCGT CGGCATGCTC GATGGCCAGC CGCGCCTTGT CGCCTGCCTT
TCCGGCGTCG CCAGGCTGGC GCAATGGGGC GGCGGGGTCG ATGCATCGGG ACAGGGCATT
GCCTGCCATC GCATGGATCT TGCCTCGGGA GGCGGCGCGG TGCGTTCCGG CATGATCGCG
GTCGTCGCCA CCGCACGGCA GGAAGCCGGC GTCGTGCGGG TCGAGCGGCT GAGCGCCTTT
GTCGACATCG GCCGCATCGT GAACATGGAT ATCGCGCGCC AGCAGATCGA GGGCGGCTTG
GTGTTCGGCC TCGCCCATGC GGTCGGCGGG TCGAGCGGCC ACGCTCGCGG AAGGCCGCTG
GCGGGCCATC TCTCGCAACT CGGCCTGCCT CTGCTGGCCG ATTGCCCGAA GGTCGATATT
GCCTTTGCCG ACAGCAACGA AGAGCCGTTC GACCCCGGCG AACTCGGAAT GGTCGCGGTG
GCCCCGGCCA TCGCCAATGC GCTGTTCTCC GCCACGGGCG TGCGCTTCCG CCGCCTGCCG
CTCATTTCCG AAGGACTTTG A
 
Protein sequence
MRLTRRGLLV GAGLGGALLI AFPLIPRRHP VPLQAGEGEH VVDAFLKLGR ARGGKDCILT 
VAVPFCEMGQ GITTLVAQIV ADEAGADWRK VAVEAAPISP AYADPVLSAK WAPLWMPAFA
SLGNDAEGTL ARLHAERGPM MITADGTALA AFETPLREAG AALRAMMAQA AADKWGVGWE
ECETGDSAVT HGKKRLSFAE LLADAVEYDP PDLPVLRAEP PRERPGQFPE GAPARHPRLD
LPAKVDGSFT FAGDVRLPGM VHAAIAHAPQ GSAVLSTYDK QAAASVRGLV GVVHARRWLA
AVATNWHAAD KALRAMEPRF RADGPVADNE KVLVALDKAL DKGDAVRLMA EGDPDALLEK
PVLSARYDVE PALHAPLETT SATARLRDGK LELWIATQAP ERARRAAARA AGLSRQDVIV
YPMHAGGSFD ARLDVRIAAE VATIATIIRK PVQLTWSRWQ ESLAGIPRTP VSARLDGALS
PDKSRVLGWR SRLALPATTI ESGARLLDGQ GIGDALDLQD RADPMACEGA MPLYRIPEKA
VDHVPAALPL PTARFRGQAH GYTAFFTESF VDELAHLAGR EPLSFRVGML DGQPRLVACL
SGVARLAQWG GGVDASGQGI ACHRMDLASG GGAVRSGMIA VVATARQEAG VVRVERLSAF
VDIGRIVNMD IARQQIEGGL VFGLAHAVGG SSGHARGRPL AGHLSQLGLP LLADCPKVDI
AFADSNEEPF DPGELGMVAV APAIANALFS ATGVRFRRLP LISEGL