Gene Saro_2383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2383 
Symbol 
ID3915728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2544608 
End bp2546482 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content67% 
IMG OID640445138 
Producthemolysin activation/secretion protein-like 
Protein accessionYP_497653 
Protein GI87200396 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2831] Hemolysin activation/secretion protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.418836 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGATC GTCTTGGTGC GCAGCCCGAT TTCGGGGCAG TTTCGGTCGT TCCGGTCAGG 
AGCGTTGGGC GTGGTGCGCA ATGGCTGCGC GGGCGCAGCG CATTGCTGCT GGTCGCGAGT
GGCGTTGCGT GGTCCGCAAT GGCTTCGGCG CAGACCGTGC CGCCGCAGGG CGTGCAGCCG
CCGACGCGCG AGGAGATCGA GCGCGGCGTT GCCGAGGGTA CGCTGAAGCG TGGCGGGCCG
GTTTCGGTCG ATACATCCGA AGTTGAACGG GCCCCGTGCC CGCTGGCTGC CCCCGACTTC
GCGGGCATCC GCCTGAAGCT GCATTCGGTG ACGTTCACCG GCATGCAGGA AATTCCCGGC
TTCGACCTTT CCGGCAGCTA TGCCGAGTTC GTCGGGACCG ACCAGCCGGT CGCCGTGATC
TGCGAGATCC GCGACCGGGC GGCGACTGCG CTACGCCAGG CGGGCTATCT TGCCGCGGTG
CAGGTGCCGC CGCAGAAGAT CGAGGGCGGG GCGGTCCGGC TCGACGTGCT GCTGGCGCAT
CTCAAGCGGG TTCAGATCAA GGGCGATGCA GGGGCTTCGG AAGGTATCCT TCTGAAGTAC
CTGAACAAGC TGACGCGCGA TCCGGTTTTC AACACGCACG AGGCAGAGCG CTATCTTCTG
CTCGCCAAGG ACGTGCCGGG TCTCGACGTG CGCATGGCCC TTCGGCCGGT CGAAGGTGCG
CCGGGCGAGG TGATCGGCGA AGTGTCCGTG CGCCGGGTGC CGGTCTATGC CGAGCTTGGC
TTGCAGAACT ACGGTTCGCG GGCGGTGGGG CGCTACAGCG GTCTGGCGCG GGTCCAGATC
AACGGCCTGA CGGGCCTTGG CGATGCGACA ACGGCAAGCT TCTTCGCGAC GACCGACCTG
GAGGAGCAGA AGGTCCTGCA GATCGGTCAC GAAATGCGCC TTGGCGGGGA AGGCTTCTCG
CTGCGGGGCG ATTTCACCTA TGGCTGGACC AATCCGACGG TGACAGGGGC CGCGTCGGAC
TTCCACTCGC GGACGCTGTC TGCATCGCTG GAAGGAAGTT ACCCGGTGGT CCGGTCGCAG
GCCTACAACA TGGCCGTGTC GCTCGGTGCC CAGATCGCGG ACCAGGATCT CGATTTCGGG
AAGATACCGC TCAACCGCGA CAGACTGCGC GTGCTCTATG CGCGGGTCGA TACCAACGGC
GTTTCCCGGA AAAGCCTGAC CGGCCGCGAC GGCTTCACAC CGTTCGAACC GCATTGGGCC
TGGGGACTTT CGCTCGAGGC GCGCCAGGGG ATCGACGTGT TCGGGGCGAC GAAGGGTTGC
CAGGGCGCGC TGGCGCCGAC CTGCACGGGT TTCGGCAAGG TCACGCCCAG CCGCATCGAG
GGCACGGCCA AGGGCTTTGT CCTGCGCGCT GCGGGCGTGC TCGACTATCG CCCGGTTCGC
GGCCTGACGT TGAGCGTCCA GCCGCGGGCG CAGTGGTCGC CGGACAAGCT TCTGTCGTAT
GAGGAGTTTT CGGGCGGCAA CTATACCATC GGCCGAGGTT ACGATCCGGG CGCGGTGATT
GGCGACAGCG GGGTTGGGGT GCGCGGCGAG GTTCGCGTCG GGTCCTTGCT GCCCAAGGTC
GCGGGCGGGA ACGCGATCCA GCCCTATGCC TTCGCCGATG CTGCATGGGT CTGGAACAAC
GACACCGCGT TCGACGGGCT CGACCCGCAG AAGGTCGTGT CGGTGGGCGG GGGCCTGCGC
GCCGCGATCC ACGATGCGTT GCGCCTCGAC GCCGGGGTGG CCGTGCCGCT GCACGATCCG
CTCGGCCTGA ATGTGAAGGG CAAGGCCCGG TTCATGCTCA ACCTTTCGTT CCAGCTCCTG
CCGTGGAGGC TGTAA
 
Protein sequence
MSDRLGAQPD FGAVSVVPVR SVGRGAQWLR GRSALLLVAS GVAWSAMASA QTVPPQGVQP 
PTREEIERGV AEGTLKRGGP VSVDTSEVER APCPLAAPDF AGIRLKLHSV TFTGMQEIPG
FDLSGSYAEF VGTDQPVAVI CEIRDRAATA LRQAGYLAAV QVPPQKIEGG AVRLDVLLAH
LKRVQIKGDA GASEGILLKY LNKLTRDPVF NTHEAERYLL LAKDVPGLDV RMALRPVEGA
PGEVIGEVSV RRVPVYAELG LQNYGSRAVG RYSGLARVQI NGLTGLGDAT TASFFATTDL
EEQKVLQIGH EMRLGGEGFS LRGDFTYGWT NPTVTGAASD FHSRTLSASL EGSYPVVRSQ
AYNMAVSLGA QIADQDLDFG KIPLNRDRLR VLYARVDTNG VSRKSLTGRD GFTPFEPHWA
WGLSLEARQG IDVFGATKGC QGALAPTCTG FGKVTPSRIE GTAKGFVLRA AGVLDYRPVR
GLTLSVQPRA QWSPDKLLSY EEFSGGNYTI GRGYDPGAVI GDSGVGVRGE VRVGSLLPKV
AGGNAIQPYA FADAAWVWNN DTAFDGLDPQ KVVSVGGGLR AAIHDALRLD AGVAVPLHDP
LGLNVKGKAR FMLNLSFQLL PWRL