Gene Saro_2147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2147 
Symbol 
ID3918812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2289255 
End bp2290436 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content67% 
IMG OID640444902 
Productsecretion protein HlyD 
Protein accessionYP_497420 
Protein GI87200163 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTGG AAACTGGAAC GGATCCCAAA CGTCTCCTGA TCGGAGCGGC AGTGGCAACC 
CTTATCGTCG GCACTGCGGG TATCATGCTC GGCCGCACCG TGCTTGCCCC CTCTCCAGCC
TCGACAGAGG CGGGGCCATC GGGCGAGGCG GAAGAAGAAG GCCACGTCGA AGGCCTGGTC
GAGATGGACG CCAAGCGTGC TGCATCGGCA GGCATTGTTA CCGAGACCGC GCAGGCTGGT
TCCCTCGGTG CCGAAATCCT CGCACAGGGC GTCGTCGCCC CGACGCCGGA TGGCGAAGCA
ATTCTCACTG CCCGCGCCGA TGGCGCGGTC GTGCGGATCG CCAAGGGTCT CGGCGACGCG
GTAGCTGCTG GCGAAACCAT TGCCTGGCTG GAGAGCCGGG ACGCAGCGGC GATTGCCGCT
GAGCGGAGTT CCGCCGCAGC GCGTGTTGCG CTCGCCCGAT CGACCTTCGA GCGCGAACGG
CGACTCTATG AGGCTAAGGT TACCGCGCGG CAAGATTTCG AAGCCGCCCG CGCCGCGCTG
GCTGAGGCGG AAGCCGAGAT GCGACGCAGC CAGTCGGCGG CGAGCGCGTC GAAAGTGTCC
GGCGATGGCC GGACCCTTGC CGTCACCAGT CTGATTGCGG GGCGGATCAC CAAGTCGGAT
GCGCGGCTCG GCGCCTACGT TTCCGCCGGC ACGGAGCTTT TCCGAGTCGC CGATCCTCGC
CGAATCCAGA TCAACGCCTC GGTGCTACCC GCCGATGCCC GCCGCGTCTC GCCTGGCGAC
CGCGCAGTCG TCGAGCTAGT CGGCGGGGAA ACCGTCGGTG CCACAGTTCG CTCGGCAACG
CCCAGCCTCG ATCCGGAAAG CAAGGTCGCG ACCCTCGTCC TCGTGCCGGA CAGCGGCGCT
CAACTCACTC CGGGCCAGGG GCTGCGTGTG CGGATCACCC CGCGTAATGC TGTTGCCACT
TCAAGCATCG GCCTGCCGGA CGAAGCGGTT CAGTCGGTCG AAGGGCGCGA TGTCGTCTTT
GTGAAGACCG CCAAGGGCTT CCAGGCCACG AACGTGACCG TGGGACAACG CAGCGCGGGC
CGTGTCGAGA TCGTTGCCGG TCTGAAGCCG GGCAGCGTGG TCGCGACGCG CGGCGCATTT
CTTCTGAAGG CCGAACTCGG CAAGGGCGAG GCGGAGCATT GA
 
Protein sequence
MDLETGTDPK RLLIGAAVAT LIVGTAGIML GRTVLAPSPA STEAGPSGEA EEEGHVEGLV 
EMDAKRAASA GIVTETAQAG SLGAEILAQG VVAPTPDGEA ILTARADGAV VRIAKGLGDA
VAAGETIAWL ESRDAAAIAA ERSSAAARVA LARSTFERER RLYEAKVTAR QDFEAARAAL
AEAEAEMRRS QSAASASKVS GDGRTLAVTS LIAGRITKSD ARLGAYVSAG TELFRVADPR
RIQINASVLP ADARRVSPGD RAVVELVGGE TVGATVRSAT PSLDPESKVA TLVLVPDSGA
QLTPGQGLRV RITPRNAVAT SSIGLPDEAV QSVEGRDVVF VKTAKGFQAT NVTVGQRSAG
RVEIVAGLKP GSVVATRGAF LLKAELGKGE AEH