Gene Saro_0437 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0437 
Symbol 
ID3917583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp475412 
End bp477487 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content65% 
IMG OID640443166 
Productoligopeptidase B 
Protein accessionYP_495719 
Protein GI87198462 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1770] Protease II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.365567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGCGAA TGACTCTTGC CACTCCTCCC GTCGCGGCAA GGAAGCCGCA CAGCTTCTCG 
CACCACGGCC TGACCGTTTC CGACGACTAT TTCTGGCTGC GCGATCCCGG CTATCCGGAC
GTGACCGACA AGGACGTGCT GGCCCATCTG GAAGCCGAGA ACGCCTGGTT CGAGACGCGC
ATGGCCGCGC AGAAGCCGGT GATCGACAAG CTGTTCAAGG AAATGCGCGG GAGGATCAAG
GAAGCGGACA AGTCCGTGCC GCAGAAGGAC GGCAATTTCC TCTACTGGAT CGAGTACGAG
GACGGCGCCG AATACAAGAA GTGGTGGCGC CGCCCGGTCG GTGCGCCCGA TGACGGCAGC
GCCGACGAAC TGATCCTCGA CGAAGTGGCG CTGGCCGAGG GCAAGGAATA CTTCCGCCTC
GGCGCCATCG CGGTCTCCAA CGACGGAACG CGCCTCGCCT GGTCGGTTGA CGACAACGGA
TCGGAGCGCT TCACCGCGCG GATCAAGGTC ATCGCCACCG GCGAGATCCT GCCCGACGAG
ATTCCGGGCA CGCTGTCAGG CCTGATCTGG GTGAAGAACG ACACCGGGCT CGTCTATTCG
CTCGCCAACG AGAACTGGCG CACCGACAAT GCGCGGCTGC ACTGGATCGG CCAGCCGCTG
GAAAGCGACG TGGAACTCTA TCATGAGGAC GATGAAGGGT TTCGCGTCAG TGCGGCGCTT
TCGGCCAACG AGAAGTGGCT CATCATCGCC ACCGGCGACC ACGAGACCGG CGAGGTTCGC
CTCGTGCCCG CCAACGACCC GCTGGCGCCG CCGCTGCTGG TCAAGCCCCG GCAGAGAGGC
GTCGAGTACG ACGTGGATGA GCGCGAGGGC GTGCTCTACA TCCATACCAA CGACACCCAC
GAGAATTTCC GCCTCGCCAC TGCGCCGCTC TCTGACCCGG GCAACTGGAC GACGCTGATC
GAGGGCACGC AGGACTTCTA CCTGACCGGG TTCGAGCTGT TCCGCGACTT CTACATCGTG
GAAGGTCGCG TTCGCGGCCT CGATCGCATC GAGGTGCGCT ACTACGATGA TCCGACCAGG
ATCGAGCCCA TCGAATTTCC CGAGGCGAGC TACGAAGCCT CGCTGGGCGA CAATCCCGAA
TGGGCGATGC AGGTGCTGCG GGTCGGCTAC GAATCGATGG TCAGCCCTTC GTCCGTATTC
GACTACGACG TCGCAACCCG GCACCTGACG CTGCTCAAGG TGCAGGAAAT CCCGAGTGGC
TACGACGCCT CGCTCTACGA GACGACCCGA CTGGAGATCG CGGCGCGCGA CGGCACGATG
GTCCCGGTCA GCGTGGTCTG GCGCAAGGAT CGTCAGCCGG GCGGGCCGCT GCATCTCTAT
GGCTACGGCG CCTATGGCAT CGCCATCGGC CCAGGCTTCT CGACCACCCG CCTCAGCCTG
GTCGACCGGG GCTTCGCCTA TGCCATCGCC CATATCCGCG GCGGCGACGA CCTTGGCCGG
GCATGGTACA AGGCCGGCAA GCTCGAGGCG CGGACCAACA CGTTCAACGA CTTCGTCGAC
GTGGCGAAAG GCCTGATCGA GCGCGGCTTC ACCGAGGCCG GAAAAATCAG CATCTCGGGC
GGATCGGCCG GCGGCGAGCT GATGGGCGCG GTGATCAATT CCGACCCCGA CCTGTGGGGC
GCGGTCGTGG CGCACGTTCC CTTCGTCGAT GTCCTTGCGA CCATGCTCGA CGAGGATCTC
CCGCTGACCC CGGGCGAATG GCCGGAATGG GGCAACCCGA TCGAGGACAA GGCGGCTTTC
GAACTGATCC GGTCCTACTC GCCCTACGAT CAGGTCAAGC CGCAGGCCTA TCCGCCACTC
ATGGTCACCG CCGGGCTGAA TGACCCGCGC GTGACCTACT GGGAGCCGGC CAAGTGGGTG
GCGAGGCTGC GCGAGCTGAA GACCGACGAG AACGAGCTGA TCCTCAAGAC CAACATGGGC
GCGGGCCACG GCGGCAAGTC GGGCCGGTTC GAGAGCCTGA AGGAGACGGC GGAGGAATTC
GCCTTCATCC TGTGGCAACT GGGCGTCGCG GCGTGA
 
Protein sequence
MQRMTLATPP VAARKPHSFS HHGLTVSDDY FWLRDPGYPD VTDKDVLAHL EAENAWFETR 
MAAQKPVIDK LFKEMRGRIK EADKSVPQKD GNFLYWIEYE DGAEYKKWWR RPVGAPDDGS
ADELILDEVA LAEGKEYFRL GAIAVSNDGT RLAWSVDDNG SERFTARIKV IATGEILPDE
IPGTLSGLIW VKNDTGLVYS LANENWRTDN ARLHWIGQPL ESDVELYHED DEGFRVSAAL
SANEKWLIIA TGDHETGEVR LVPANDPLAP PLLVKPRQRG VEYDVDEREG VLYIHTNDTH
ENFRLATAPL SDPGNWTTLI EGTQDFYLTG FELFRDFYIV EGRVRGLDRI EVRYYDDPTR
IEPIEFPEAS YEASLGDNPE WAMQVLRVGY ESMVSPSSVF DYDVATRHLT LLKVQEIPSG
YDASLYETTR LEIAARDGTM VPVSVVWRKD RQPGGPLHLY GYGAYGIAIG PGFSTTRLSL
VDRGFAYAIA HIRGGDDLGR AWYKAGKLEA RTNTFNDFVD VAKGLIERGF TEAGKISISG
GSAGGELMGA VINSDPDLWG AVVAHVPFVD VLATMLDEDL PLTPGEWPEW GNPIEDKAAF
ELIRSYSPYD QVKPQAYPPL MVTAGLNDPR VTYWEPAKWV ARLRELKTDE NELILKTNMG
AGHGGKSGRF ESLKETAEEF AFILWQLGVA A