Gene Saro_1989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1989 
Symbol 
ID3917309 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2117429 
End bp2119192 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content66% 
IMG OID640444741 
Producthemolysin activation/secretion protein-like 
Protein accessionYP_497263 
Protein GI87200006 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0548358 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTGCGA AGTGCGTGGG GAAGACCCTG CTGCGAGGGG CGGCGGGGAT CGCTGGCGTT 
GTTCTCGCGG CGATCCCGCT TGCGGCGAGT GCGCAAAATC TTCCTGGGCA GCCGGGCACG
GGCATTCCGT CACGCGATGA GCTCAAGGGC CTGACGGGGC AGGCTGCTGC GGCGCCGTCG
CGCCTTTCGA TAAAGGGCGG CATCGAGCGC TCCCCCTGTC CGCTCGACGA TCCGCAGTAC
GCAGACATTC GCGTCACGCT GAACAACGTC ACCTTCGCCG GTCTCAAGGA GATCGATCCG
GCCGAGCTTG CCGAGACCTG GAAGCCGCTG GTGGGGACGT CGCAGCCGAT TGCGGTGCTG
TGCGAGATCC GCGATGCGGC AGGCACGATG CTGCGCAACA AGGGATACCT TGCCGCAGTG
CAGGTCCCGA CGCAGAAGAT CGACAACGGC GAAGTGCGGA TGGAGGTCAT CTATGCGCGC
ATCACTACCG TACGGGCACG TGGCGAGACC CGGGGCGCGG AGCGAAAGCT CGAGCAGTAT
CTCTCGGCGC TGACCGGGGA CGAGGTGTTC AACCAGCGGC GGGCGGAACG ATACCTCCTG
CTGGCGCGTG ACCTGCCGGG CTACAACGTC CAGCTCACGC TCAAGCCTGC AGGGACCGGC
GCCGGCGATC TGGTCGGCGA GATCAGCGTC CTGCGCGAGC CTTATGCCGT CGACCTTACC
TTGCAGAACC TGTCTTCAGA AGCGACCGGC CGGTTCGGCG GGCAGATTCG CGCGCAGTTC
TTTGGCCTCA CGGGCATGGG CGATGCGACG ACCCTGTCCT ATTACGCCAC TTCGGATTTC
AGCGAACAGC ACATCCTGCA GGGGGCGCAC GAGTTTCGTC CGGGCAAGGA AGGGCTGATC
GTTGGGGGCC AGCTTACCTA CGCCTGGACC AAGCCTGACG TGCCGGGGCT TCCTGCCAAT
GTCAGCATCG ACGCGCGCAC CCTCTATGCC AGCCTCTATG CCCGTTTCCC GCTGCAGCGC
TCGCTTGCGC GAAACGTCTG GCTAAGCGGC GGGCTGGACC TGGTGAACCA GGACGTGGAC
CTCATCGGAC CGCTCACTCG CGACAAGGTG CGCGTGGGGT GGGCGCGGCT TGACGTCGAT
GCCGTGGATA CCGGACATGC CGCGCCGCAA TGGCGCATCG GGTCTTCGTT CGAAGTGCGG
CAGGGGCTGG ACATCCTCGA TGCGACGCGC GGTTGCATTG GCGCCGCCTG CGCAACACAG
ACGCCGACCA GCAGGTTCGA CGGATCACCG GCCGCGACGG TGCTGCGCTG GCAGGGCGAG
TACGAACGTG CCTTCGGCAG GTTCTCGGTG CTTGTGGCGC CGCGTGCGCA ATATGCGTTC
AAGCCACTGC TCAGCTTCGA GGAATTTTCG GCCGGCAACT ATACCGTCGG CAGGGGTTAT
GATCCGGGCG ACCTGATCGG TGACAGTGCC GTTGGGACGA GCGTGGAAGT GCGCGGCCCC
CGGCTTCCGA TAGGCGAGAG CCGCGACATC CGCATCCAGC CTTTCGTGTT CGGCGATTCC
GCATGGGTCT GGAACAAGGA CGTGCCGGGA TCGGAACGGC TGACGTCATT GGGCGGAGGC
CTGCGCGGCG ATATCGGCGC CCGCTTCAGG ATCGAGGCGA CGCTGGCCGT CCCGCTGGAG
AAGGTGGCGC TTCAGGTCCG CCGTTCCGAC CCGCGATTCC TGGTGACCCT TACCACGCGG
CTGCTGCCGT GGAGGACCTT CTGA
 
Protein sequence
MSAKCVGKTL LRGAAGIAGV VLAAIPLAAS AQNLPGQPGT GIPSRDELKG LTGQAAAAPS 
RLSIKGGIER SPCPLDDPQY ADIRVTLNNV TFAGLKEIDP AELAETWKPL VGTSQPIAVL
CEIRDAAGTM LRNKGYLAAV QVPTQKIDNG EVRMEVIYAR ITTVRARGET RGAERKLEQY
LSALTGDEVF NQRRAERYLL LARDLPGYNV QLTLKPAGTG AGDLVGEISV LREPYAVDLT
LQNLSSEATG RFGGQIRAQF FGLTGMGDAT TLSYYATSDF SEQHILQGAH EFRPGKEGLI
VGGQLTYAWT KPDVPGLPAN VSIDARTLYA SLYARFPLQR SLARNVWLSG GLDLVNQDVD
LIGPLTRDKV RVGWARLDVD AVDTGHAAPQ WRIGSSFEVR QGLDILDATR GCIGAACATQ
TPTSRFDGSP AATVLRWQGE YERAFGRFSV LVAPRAQYAF KPLLSFEEFS AGNYTVGRGY
DPGDLIGDSA VGTSVEVRGP RLPIGESRDI RIQPFVFGDS AWVWNKDVPG SERLTSLGGG
LRGDIGARFR IEATLAVPLE KVALQVRRSD PRFLVTLTTR LLPWRTF