Gene Saro_2843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2843 
Symbol 
ID3915482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3067432 
End bp3068391 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content66% 
IMG OID640445622 
Producthypothetical protein 
Protein accessionYP_498113 
Protein GI87200856 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAACGC GCGCAAACCA CATCTGGGTC GGTCTCGTCA CCCTGCTGCT TCTGGCTGGC 
ACGGCCCTGC TGACCGTGTG GATCGCCCGC ATCAACCAGG GCGATCTCCA TGAATACGAC
ATCTTCTTCA AGCAGTCGGT CGATGGCCTG GCCAAGGGTT CCGAGGTCTC GTTCTCGGGC
GTGCCCTCCG GTCAGGTGAA GGATATCGAA CTGTGGGAGC GCGACCCCGA ATTCGTGCGC
GTGCGCATCG CGGTCGACAA GAAGGTGCCG ATCCTCCAGG GCACCACGGC CAGCCTTCAG
GGCAGCTTCA CCGGCGTCTC GACGATCCAG CTTTCCGGCG CGGTCAAGGG CGCGCCGCCG
ATCGACTGTC CCGATGAAAA CCGGCGCGCC GCCTGCCCGG AAGGCGTCCC CGTTATCCCG
ACCAAGCGCT CCGGCCTTGG CGAGATACTC TCGAATGCGC CGCTCCTGCT GGAACGCCTT
GCCACGCTGA CCGAGCGTCT GACGATGGTG CTGTCCGACA AGAACCAGAA GTCGATCGAG
AACATCCTTT CGAACACCGA CCGCCTCACC GGCAACCTCG CCGATGCCTC GCCCGACGTG
AAGCGTACGC TGGCAGAACT CCAGGCGACG CTGCGGCAGG CGAACTACAC GCTGGCGAGC
TTCGAGAAGC TGACCAATTC GGCCGATTCC ATGCTCAACG ACGAAGGCAA CGGCCTTGCC
CAGCAACTGC GCAAGACGCT CAAGTCGGCG CAGGGCGCCG CCGACGAGCT TCAGGGCACG
CTTTCCGAAG CGCGTCCCGC CGCGCGCCAG CTCAACGAAC GCACGCTGCC CGCCGCCGAA
GCCGCGATCC GCGATCTCCA GGCGACCACG CGGTCCCTGC GCGAAGTGAC CGACCGCCTC
AACGACCAGG GCGTCGGCGG CTTCGTCGGC GGTCCCAAGC TGCCCGACTA CAAGAACTGA
 
Protein sequence
METRANHIWV GLVTLLLLAG TALLTVWIAR INQGDLHEYD IFFKQSVDGL AKGSEVSFSG 
VPSGQVKDIE LWERDPEFVR VRIAVDKKVP ILQGTTASLQ GSFTGVSTIQ LSGAVKGAPP
IDCPDENRRA ACPEGVPVIP TKRSGLGEIL SNAPLLLERL ATLTERLTMV LSDKNQKSIE
NILSNTDRLT GNLADASPDV KRTLAELQAT LRQANYTLAS FEKLTNSADS MLNDEGNGLA
QQLRKTLKSA QGAADELQGT LSEARPAARQ LNERTLPAAE AAIRDLQATT RSLREVTDRL
NDQGVGGFVG GPKLPDYKN