Gene Saro_3501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3501 
Symbol 
ID5077650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp107072 
End bp108991 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content70% 
IMG OID640481225 
Productfusaric acid resistance protein region 
Protein accessionYP_001165887 
Protein GI146275727 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCGG CGCGCCGGTT CGCGCTGGCC TATTCAGCCA AGACAGCGGC GGCGGCGCTG 
CTGGCGCTTT GGATCAGCCT GTGGGTCGGC CTGTCGATGC CGTTCTGGGC GATGACCACG
GCCTATATCG TCAGCAGCCC CATGTCCGGC GCGACGCGCT CCAAGGCGGT CTACCGGGTG
GGTGGTACGG TGCTGGGCGC GGCGGTGGCG GTGGCGCTCG TTCCTGCGCT GGTCGACTGG
CCCGAATTGT TGAGCCTTGC AATCGCACTA TGGCTGGGCG GTTGCCTGGC CGTCGCGCTG
CTTGACCGCT CGCCGCGTGC CTATGTGGTG ATGCTGGCAG GGTATACCGC AGCGCTTGTC
GCTTTCCCGG CGGTCGACCG GCCGGATGCG GTGTTTTCCA TCGCCGTCGC GCGCGTGACC
GAGATCGCGC TCGGCATCGG ATGCAGCACG GTCGTCCACA GCCTGTTCTG GCCCCGCTCG
GTCGCAGAGG CGATGCAGCC ACGCCTGCGC GCGTGGCTTG CCGATGCGCG GCAATGGCAC
GGCGATATCG TCGGCGGCAG CGACAATGCC AGGTTGCTTA CGGACAAACG CAGGCTTGCC
GTCGACGCCA TGGACTGTGC GCTGCTGGCG ACGCATGTGC CGTTCGACAC CTCGCACTGG
CGCGAGGCGA CGGCGACCTT GCAGGCCCTG TTGCGCCGGA TGCTGCTGCT GTTGCCGGTG
CTGTCGGGCC TTGCGGATCG CAAGGCGGCG CTGGACGGTG AAGGCGACGA GGGCAGGGAT
GGGGCGACCT GGGCCATGCT CCTCCGGGAA AGCCTTGCCC AGCGCGATGG CGAGGCGCGC
ACGCTGCTCG GCGAATGCGA TGCACTGCTG GCGCATCTGG CGGACCCGGC ATCGCCGCGC
CCCGACCTGC CGGATTGGCG CGAGGGCGCG GTCAGGTTTC ACGCCGAGCC CGCGGGCGCG
ATCCTGTCCG GCGCATCTGC GCTGGTGGCA ACGCTTGCGG CCTGCGCCCT ATGGATCTTT
ACCGGGTGGG CGGACGGCGG CGTCGCGGCG GTGCTGACCG GCATCTTCTG CTGCCTGTTC
GCCGCGCAGG ATAACCCGGT GCCCGCCATC CTCTCGTTCG GCGGGGCCAT CGTGGCGGGC
ATTCCGATTG CGGCGCTGTA CCTCTTCTTC GTTCTGCCGG GCGTGGACGG GTTCGCGGCG
CTGGCGCTCC TGCTGGCAAT ACCGCTCGTC GCCATCGGCG CGTTGATGAC GCACCCCCGC
CTTGGCCTGC CGGCGATGGC GTGCCTTGTC GGCTTCTGCA GCGCGATGGC GATACAGGAG
GAATACGTCG CCGATTTCGC GCGCTTCCTC AATTCCAACC TCGCGCAGAT CGTGGCGGTG
ATCCTTGCCG CCGGGACCAC GGCATGCTTC CGGATGGCTG GCGGCGACGT TGCCATCGCG
CGGCTGAACC GGCGCATGCA GCGGGGGCTG GTGGACATTG CCCGCGCCCC TTCCGCACCC
GATCCGCTGG CGACGCTGAG CCGCGTGACC GACCAGCTCG CGCTTATCGC CCAGAGGCTG
GGCGGGGCGA CCGACGCCGC GTCGATGGGG CTTGGCGAAG TGCGCCTCGC GATGAATCTC
GTCTCGATCC AGAGGCTGCG GGCGTCGTCT TCGGGGCCGC TGCGCGCCGC GCTCGACGAT
GTGCTGGAAG AAGCGGCGCA CTGGTTCGCC GCGCCGCCCA CGGCCGAGGG ACCGTCGCGG
CGGATGCTGG ACCGGCTTGA CGGTGCGTTG CGCCTGACGC TGGCCAATCC GCCACCGCGC
CCGGGCGGGC TGGAACACCT GTTCCGCCCA GGCCCCGACC AGGGCCGCCC CGCGCTCGTC
GCCCTGCGGC GCAGCCTCTT CTCCCGGGCC GAGCCGTTTT CAGCAGGAGC CTCCGCATGA
 
Protein sequence
MTAARRFALA YSAKTAAAAL LALWISLWVG LSMPFWAMTT AYIVSSPMSG ATRSKAVYRV 
GGTVLGAAVA VALVPALVDW PELLSLAIAL WLGGCLAVAL LDRSPRAYVV MLAGYTAALV
AFPAVDRPDA VFSIAVARVT EIALGIGCST VVHSLFWPRS VAEAMQPRLR AWLADARQWH
GDIVGGSDNA RLLTDKRRLA VDAMDCALLA THVPFDTSHW REATATLQAL LRRMLLLLPV
LSGLADRKAA LDGEGDEGRD GATWAMLLRE SLAQRDGEAR TLLGECDALL AHLADPASPR
PDLPDWREGA VRFHAEPAGA ILSGASALVA TLAACALWIF TGWADGGVAA VLTGIFCCLF
AAQDNPVPAI LSFGGAIVAG IPIAALYLFF VLPGVDGFAA LALLLAIPLV AIGALMTHPR
LGLPAMACLV GFCSAMAIQE EYVADFARFL NSNLAQIVAV ILAAGTTACF RMAGGDVAIA
RLNRRMQRGL VDIARAPSAP DPLATLSRVT DQLALIAQRL GGATDAASMG LGEVRLAMNL
VSIQRLRASS SGPLRAALDD VLEEAAHWFA APPTAEGPSR RMLDRLDGAL RLTLANPPPR
PGGLEHLFRP GPDQGRPALV ALRRSLFSRA EPFSAGASA