Gene Saro_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1987 
Symbol 
ID3917307 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2106466 
End bp2109585 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content66% 
IMG OID640444739 
Producthypothetical protein 
Protein accessionYP_497261 
Protein GI87200004 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCCGC GCCGGATGCT TTCCGCGAAG AACAGTCGTC TTGGGGGCCA CTTCGCGAAG 
ACGACCCTTT CGATCGCCGC ACTTGCTTGT GCCATGCCGG CAGTCGCCGG AGGAACCGGC
CAGCCGCTGT CGGTGCGCAA TTCCTTCCGC ATAGGATCGT CGGGCGTGAC CTGCACCGCG
CAGAACGCTC CTCTCGACAA GCGGCTTGGT GGCATCTTCG ACCGTGGCTA TCGGCTGAGT
TGTCGCGATG CCGCAGGCGC TGTGGGCACG CTGATCGCAG TGCGGCGCGA GGTGTTGCCT
GGGAGCGAGC CGACCGGGCT TTCCGGCGTT ACGCTTGCTT GCGGGGCGGA CGGGTTTGTC
GCGATCGATG CGGTCGGCCG TGTGAATGCC GCCAATTGCC GGGATCAGGA CGCCAACGTC
GAATACAAGC GCTATGCCGT CACCCGGCGC GGCGTGACCT ATTTCGTCGA GGGGCTTGCC
GGGTACGATC CAGCCCTGCG CCTTGCGCTG GCGAGCGTCG TGAACGACGC ACCCGTGCCG
GGTGAAATAC GTGTCGCAAC GACCGAAGTA AGCGACCCGG CCGCCTTCGC CCGGGTACAG
GCCGGTCAGC TCGACCGCCT TGGCGCCCGC GACGAAGGGT ATCTGCGCAA CAACGGAGGC
CGCTACGCGG AGTCGGCGGA GTTCTTTGAG GCTCTGGCGT CGAGGGACCG TGCCAGTGGC
GGGGCGGGCC TTGCCGAAGC CCTTGCCAAC CAAGGTCTCC AGCAGTCCAA CCTCGGAAAC
TTCGCCGCGG CCGAACGCCT GTTTGGGCAG GCCGCTGCCG CGCCGGGCGG CAGGGATGGC
GTGACGCAGC GACTCGTCCG CAACTACCGC GCGATCAACC AACTCAACCA GCGCAAGCCT
GTGGCCGCGA TCAACGCCCT GGCCGAACCC GTAGCGGCGG TGAGCGTCTC TTTCGACCGC
GACAGCCTGG TGCTGGGGCT GATCAACATG CCCTTGGCGG AGCAGATCAA CCGGGAGAGT
TCGGCGCTCA AGCGCTTGGG CGCAGTCGAC CCCGGATTGA CCGAGACCGA GCGTGCCGAA
ATTCTCGATG CGCAGGCGGA TTCCCTGCGC GCGACCGCAG CGCGGTTGCA GGGCAAGTAC
GACGTTGCGG TGACCGGCTT CGAGATCGCA GGCCGACGCC TGGATGCAGT TCGCGGCGGG
CGCGTGGCTT CCGCTGGATG GTTGCGCTCC GAAATCCAGA TTGAACTTGC TCTGTTGGCA
GAGGCACAGG GCCGCAACGC CGATGCAGCA ACCGCGTTTG ACCGCGCCAT CGCCATCATC
GATAGCGCCT TCCCACAATC GCCCGCACTG CTTTCGGCGC AGGCGCGCAA GGCTGCGTGG
CTGGGCCGCT CGGGCGACGA GGCGGGCGCG CTGGCGCTTT ATGCCCATGT CGTGGATCAA
AGCCTTGCCG TGCCGGATGC CGGTACCACC CTGCGCGATC TGCTGGGCCC CTATTTCGGC
TTGCTCGCGA AGCGGAACGG TGCCGGTAAT GCGAGCGCGA TGTTCGCCGC CTCGCAAGTC
CTCCAGCGTC CGGGTGTTGC CCAGACCCAG GCGGTTCTGG CGCGCCAGAT GTCGGAGGGC
AACGACGAGG CCGCGGCCCT GTTCCGTCTG TCGCTTGCCA GAAGCCGCGA TATCGCGCGC
ACCGAGGCAA TGGTCGAGCA GCTTTCCGCC TTGACCGGAC TGACCGAGCA GCAGGCTGCA
TCACTCAAGG CGGCACAGGA CAACCTGGCA TCGCTCAAGC GTGACCAGAC GGCGCTGGTG
AGCAAGCTTG CAGCCTATCC TCGCTACAAC GTCCTGGCGC CGAAGGGCGT GGAACTTGCC
GAACTGCAAT CCGCGCTCAA GCCGGGCGAA GCGTACTACA AGATGATGGT GGTCGGCGGA
CGGGTCTATG GACTGTTCGT CACGTCAGGG AGCGGCGAGG CGCGGACGTT CGATACCGGG
ATCGATCCGG CGACCTTGTC TCGCAATGTC CAGGCGATCC GCGATACGAT CGTCAAGGTG
GAGAACGGCC AGCAGGTCAA TTATCCTTTC GACCTCGACA AGTCGCGCGC CCTCTACAAG
ACGCTGTTCG GTGCGGTGGA AGACCTTCTG CCCCAGACCC GCCACCTGGT GTTTGAACCG
GACGGCGCGA TGCTGCAGCT TCCTCCCACA GTGCTGGTGT CGGGAGACAA GGGCATCGAG
GCCTACAAGG CCCGGATGGA AAGCCCCGAT GGCGATCCGT TCGACTTCAC CGGGGTCGAG
TGGCTTGGAC GCGGCCGGGA AGTTTCCATC GCCGTCAGTC CGCGCGGATT CCTCGACATC
CGCAAGCTGG CCGCATCGAC CGCGCCGCGC AACTACCTTG GTCTGGGGCA CAATGCGAAG
CCGGCCGCGC GCCCGGTTAC CGCAGTCGCG GATGAGTGCG ACTGGCCGCT TGCGACGTGG
CAGAATCCGA TCTCTGCCGA CGAGCTGTAC TATGCCCAGA AGAAGCTTGG TGCGGGAGGC
AGCGCGGTCA AGACCGACGC CGCGTTCAGT GACAGCGCCT TGCTGGCGGA GAGTGACCTC
GACCAGTATC GCGTTCTGCA CTTTGCCACG CATGGACTGG TCACTGCGCC GCGCGCCGAC
TGTCCTGCGC GCCCTGCGCT GGTGACCAGC TTTGGCGACA TGGGCTCCGA TGGCCTCCTG
AGCTTCCGCG AGATCTTCGA CCTGAAGCTC AACGCGGACC TCGTAATCCT CTCCGCGTGC
GACACGGCCG GTATGGCGAC TGTGGCGGCA AGCCGGGAGG CCGGCGTGAC CTCGGGCGGG
AACTACGCCC TCGATGGCCT CGTCCGTGCG TTCGTGGGGG CTGGTGCGCG TTCGGTCATT
GCCAGCCACT GGCCGGTGCC CGACGACTTC GACGCGACCA AGCGCCTCAT CGGCGGCGTC
ATCGAAGCGA AACCGGGACA GGATCTTGCC GATGCCCTGT CGGGCGCACA GACCCGGCTG
ATGGACGACC CCAACACATC GCATCCGTTC TACTGGGCAG CGTTCATAAT CCTCGGCGAC
GGGGCGAAGC CGCTGGTGTC GGGGAAGGCT GCGATGTCCG CTCCTGATCC GGTCCGGTAA
 
Protein sequence
MSPRRMLSAK NSRLGGHFAK TTLSIAALAC AMPAVAGGTG QPLSVRNSFR IGSSGVTCTA 
QNAPLDKRLG GIFDRGYRLS CRDAAGAVGT LIAVRREVLP GSEPTGLSGV TLACGADGFV
AIDAVGRVNA ANCRDQDANV EYKRYAVTRR GVTYFVEGLA GYDPALRLAL ASVVNDAPVP
GEIRVATTEV SDPAAFARVQ AGQLDRLGAR DEGYLRNNGG RYAESAEFFE ALASRDRASG
GAGLAEALAN QGLQQSNLGN FAAAERLFGQ AAAAPGGRDG VTQRLVRNYR AINQLNQRKP
VAAINALAEP VAAVSVSFDR DSLVLGLINM PLAEQINRES SALKRLGAVD PGLTETERAE
ILDAQADSLR ATAARLQGKY DVAVTGFEIA GRRLDAVRGG RVASAGWLRS EIQIELALLA
EAQGRNADAA TAFDRAIAII DSAFPQSPAL LSAQARKAAW LGRSGDEAGA LALYAHVVDQ
SLAVPDAGTT LRDLLGPYFG LLAKRNGAGN ASAMFAASQV LQRPGVAQTQ AVLARQMSEG
NDEAAALFRL SLARSRDIAR TEAMVEQLSA LTGLTEQQAA SLKAAQDNLA SLKRDQTALV
SKLAAYPRYN VLAPKGVELA ELQSALKPGE AYYKMMVVGG RVYGLFVTSG SGEARTFDTG
IDPATLSRNV QAIRDTIVKV ENGQQVNYPF DLDKSRALYK TLFGAVEDLL PQTRHLVFEP
DGAMLQLPPT VLVSGDKGIE AYKARMESPD GDPFDFTGVE WLGRGREVSI AVSPRGFLDI
RKLAASTAPR NYLGLGHNAK PAARPVTAVA DECDWPLATW QNPISADELY YAQKKLGAGG
SAVKTDAAFS DSALLAESDL DQYRVLHFAT HGLVTAPRAD CPARPALVTS FGDMGSDGLL
SFREIFDLKL NADLVILSAC DTAGMATVAA SREAGVTSGG NYALDGLVRA FVGAGARSVI
ASHWPVPDDF DATKRLIGGV IEAKPGQDLA DALSGAQTRL MDDPNTSHPF YWAAFIILGD
GAKPLVSGKA AMSAPDPVR