Gene Saro_1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1037 
Symbol 
ID3915819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1075402 
End bp1076550 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content66% 
IMG OID640443771 
Producthypothetical protein 
Protein accessionYP_496316 
Protein GI87199059 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID[TIGR00025] ABC transporter efflux protein, DrrB family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.209314 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTGGC GGGCCGCTCT CGCGCGGATC GCCGCTATGG TGTTCAAGGA AGTGCGCCAG 
ATGGTGCGCG ACCCCGGCAC GATCGGCATG ATGCTGATGA TGCCGATCGT GCAGCTCACG
ATCTTCGGCT TTGCGATAAA CAACGATCCC CGCCGCCTCC CAATGGCACT TGAGATCGGC
GACAGCTCGC AGTTCGCGCG GAGCATCGAC GCCGCGCTTC GCAACACCAC CTACTTCCGC
GTGACGCACG TGGTGAGCGA GCCGGGCGAG GGCGAGCGGT TGCTGAAGGA CGGGGCGGTG
CAGTTCCTCG TGACCGTGCC TGCCGACTTC GGCCGCGACC TGGTGCGCGG CGACCGGCCA
CAACTGCTGG TGACGGCGGA CGCGACGGAC CCGGCCTCGA CCGGAAATGC CATCGGCGCG
ATCCAGCAGG CGGTCGATGG AGCGCTGGCG CAGGACCTGA TCGGGCCGCT GGCCTTGCGC
GGGCAGGCCA TCGGGCCGGG CGCCTCACCA GTCGATCTGG TGATCCATCG CAGCTACAAT
CCCGAAGGCA TCACCAGCCA CAATACCGTG CCCGGCCTGC TGGCCATCGT GCTGTCGATG
ACGATGGTCA TGCTGACGGC GCTTTCGGTG ACGCGCGAGG TGGAGCAGGG CACGATGGAA
AACCTGCTGG CGACGCCGCT GCGCCCGTTC GAGGTGATGG TGGGCAAGAT CGTGCCCTAT
CTCGTGATCG GCGTGGTGCA GATGGCGGTG ATCCTGCTGG CGGCGCAGGT GGTGTTCGAT
GTGCCGTTCG AAGGATCGGT GACACTGACG CTGGGATCGA CGCTGCTGTT CATGGTCACC
AGCCTCGCGC TCGGCTTCAT GCTCTCGACA ATTGCCGCCA GCCAGCTCCA GGCGATGCAG
ATGAGCTTCT TCTACATGCT GCCGTCGATC CTGCTGTCGG GCTTTGCCTT TCCGTTCAGG
GGAATGCCGG TTTGGGCGCA GTGGCTGGCC GAACTCCTGC CGACCACCCA CTACATCAGG
CTGGTGCGCG GCATCATGCT AAAGGGCTGG ACGCTGGGCG ACGCCGCGTG GGAACTGGGC
GTGCTGGCGG TGATGCTCGG CGTGCTGGGG ACGGTGGCGG TGCGGCGGTA CCAGGACACG
GTGGCCTAG
 
Protein sequence
MKWRAALARI AAMVFKEVRQ MVRDPGTIGM MLMMPIVQLT IFGFAINNDP RRLPMALEIG 
DSSQFARSID AALRNTTYFR VTHVVSEPGE GERLLKDGAV QFLVTVPADF GRDLVRGDRP
QLLVTADATD PASTGNAIGA IQQAVDGALA QDLIGPLALR GQAIGPGASP VDLVIHRSYN
PEGITSHNTV PGLLAIVLSM TMVMLTALSV TREVEQGTME NLLATPLRPF EVMVGKIVPY
LVIGVVQMAV ILLAAQVVFD VPFEGSVTLT LGSTLLFMVT SLALGFMLST IAASQLQAMQ
MSFFYMLPSI LLSGFAFPFR GMPVWAQWLA ELLPTTHYIR LVRGIMLKGW TLGDAAWELG
VLAVMLGVLG TVAVRRYQDT VA