Gene Saro_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1036 
Symbol 
ID3915818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1074419 
End bp1075405 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content69% 
IMG OID640443770 
ProductABC transporter related 
Protein accessionYP_496315 
Protein GI87199058 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01189] heme ABC exporter, ATP-binding protein CcmA 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCAGCG GCGAGGCATC GGCGGTACCT TCCGAAGCGT CGGGCGGTCC CCGACTGCGG 
GCCGAGCCGG ATCTGGTCAT CGATGTGCGC GGGCTGACCA AACGGTTCGG CGGACGGACA
GTGGTCGACG AGGTGAGCTT GCAGGTGGCG CGCGGGTCGA TCTGCGGGTT CCTCGGGCCG
AACGGGTCGG GCAAGACGAC GACGCTGCGG ATGATCTGCG GGCTGCTGAT CCCCGATGCG
GGCGAGGGCG AAGTGCTGGG GCTGGACCTG AGGCGCCAGC GGGGGGCGAT CAAGGGACGG
GTCGGGTACA TGACCCAGAA GTTCGGGCTG TTTTCCGATC TGACGATTGC CGAGAACCTC
GAGTTCTTCG CGCGGGTGCA CGGGCTGGAC CGGCGCCGGG AGCGGGTGGC CGAAGCCTTG
GAAGGGTTGG GTCTCGCCAC ACGTTCGGAT CAGCTGGCAG GCAAGCTTTC GGGCGGGTGG
AAGCAGCGAC TGGCGCTGGC GGCGGCGGTG CTGCACGAGC CGGAAATCCT GCTGCTCGAC
GAACCGACGG CGGGCGTCGA CCCGCAGGCG CGACGCGAGT TCTGGGACCA GATCCACGAT
CTTGCCCAGG GCGGGATGAC GGTGCTGGTG TCGACCCATT ACATGGACGA GGCCGAGCGC
TGCCACGAGA TTGCCTATAT CGCCTATGGC CGCATGCTGG CGCGCGGGAC GATCGGCGAG
GTCGTTGCCG GGTCCGGGCT CAAGGCGCTG ATGGGCGAAG GTCCGGGGGC GGACAGGCTT
GCCTCGCGGA TCGAGGGCCG TCCCGGCGTT GCCATGGCGG CCCCGTTCGG CACCGCGATC
CACGTCTGCG GGCCCGATAT CGCGGCGCTG CGTGCGGCGG TGGCGGAGTT CGACACGGTG
GCATGGCATG AGGCCCGGCC GAGCCTGGAG GACGTGTTCA TCCACCTGAT GCGCGGGGCC
GAGGACAATT CGGTGGTGGC GGCATGA
 
Protein sequence
MSSGEASAVP SEASGGPRLR AEPDLVIDVR GLTKRFGGRT VVDEVSLQVA RGSICGFLGP 
NGSGKTTTLR MICGLLIPDA GEGEVLGLDL RRQRGAIKGR VGYMTQKFGL FSDLTIAENL
EFFARVHGLD RRRERVAEAL EGLGLATRSD QLAGKLSGGW KQRLALAAAV LHEPEILLLD
EPTAGVDPQA RREFWDQIHD LAQGGMTVLV STHYMDEAER CHEIAYIAYG RMLARGTIGE
VVAGSGLKAL MGEGPGADRL ASRIEGRPGV AMAAPFGTAI HVCGPDIAAL RAAVAEFDTV
AWHEARPSLE DVFIHLMRGA EDNSVVAA