Gene Saro_3699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3699 
Symbol 
ID5077847 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp333303 
End bp334886 
Gene Length1584 bp 
Protein Length527 aa 
Translation table11 
GC content63% 
IMG OID640481422 
Productextracellular solute-binding protein 
Protein accessionYP_001166084 
Protein GI146275924 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCACGG GCCCGATCAG TCGCCGCGTT GCGCTCGGCG CGGTCGCCGG TGCAGGCGCG 
GCCGCGCTCG GCGGCAGACT GTTGATGCGA CAGCCCCCGG GCGACCTGCC GCATCCAGCC
AGCGAAGGCG GGCGCAAATC CACGCCTCGC CCCGGCGGAC GCATCCGCGT GGCGAGCACA
TCGACATCGA CTGGCGACAC GCTCGACCCT GCCAAGGGCG CGGTGAACAC GGACTATGTC
CGCCACTTCA TGCTCTACAG TGGCCTGACC GAGTTCGACC GCGACCTGAG CGCGCGCCCG
GCCCTTGCCG AGAGCATTTC GAGCGACGAC CAGAAAGTCT GGACAATCAG CCTGCGCAAG
GGTGTGACGT TCCACGACGG GAAGTCGTTC ACAGCAGACG ACGTCGTATA TTCGCTGCAG
CGCCACAAGG ACCCCAAAGT GGGGTCCAAG ATGTCGGACA TCGCGAAGCA GTTCGCGTCT
GTCGAAGCTC TGGGCCGCAA CATGGTGAGG ATCGTCCTGA CCGGGCCTAA CGCCGATCTC
CCCATCATCC TTGCGCAATC ACACTTCCTG ATCGTGCCGG CGGACATCCA GGATTTCAGC
GCGGGCAACG GCACGGGGCC CTATCGGTTG GCCGAATTCA AGCCGGGAGT GCGCACGGTG
GTCCGCCGCA GCCCGGACTT CTGGAAACCG GGCAAACCCT ATCTCGACGA GATCGAGCTT
ATCGGCATTC AGGACGAGAT CAGCCGGGTC AACGCGCTTC TTTCGGGCGA CGTACAGTTG
ATCAACGCGG TCAATCCGCG TTCGACGCGT CGCATCCTGG CATCGCCGGA GCACGGTATC
GTCGAAACCA AATCAGGGCT CTATACCAAT CTCGTGGCCC GTCAGGACCG GTTGCCGACG
GGTAATCCCG ATTTTACAGC AGCGCTCAAG CACCTTGTCG ACCGGCCGCT CGTCAACCGC
GCCCTGTTCC GCAATTACGG TACGATCGGC AACGACCAGC CGCTGCCACC ATCGCACCAG
TACTTCCGTG CGGACCTGCC GCAGACCGCG CTCGATCTGG ACCGTGCGAA ATGGCATCTG
CAGCGCTCTG GGCTCACGGG AATTCGCTTG CCTGTCTACG CCTCGACCGC CGCCGAAGGT
TCGGTGGACA TGGCCTCGAT CCTGCAGGAA TTCGGCGCGA GGATCGGACT GGATCTTGCG
GTGAATCGGG TCCCGGCGGA CGGTTACTGG TCTACCCACT GGATGAAGCA CCCGTTGTTC
TTCGGAAACA GCAACCCGCG GCCGACCGCC GACCTCATTT TCAGTCTCTT CTACAAGTCC
GATGCCACCT GGAATGAATC AGGTTGGAAG GACCCGCGCT TCGACAGCCT GGTGATCGAG
GCGCGCGGGG AGGCGGACCA TGAGCGGCGT AAGCAGCTAT ACGGCGAGAT GCAGGGTCTG
GTCCGTGACC ATTGCGGCAG CGTGATCCCC GTCTTCATCA GCCTTCTGGA CGGACACGAC
CGGCGCCTGA AAGGGCTCTA CCCCGTGCCG CTTGGCGGTT TCATGGGATA CACGTTTGCC
GAACACGTCT GGTGGGACGC GTGA
 
Protein sequence
MSTGPISRRV ALGAVAGAGA AALGGRLLMR QPPGDLPHPA SEGGRKSTPR PGGRIRVAST 
STSTGDTLDP AKGAVNTDYV RHFMLYSGLT EFDRDLSARP ALAESISSDD QKVWTISLRK
GVTFHDGKSF TADDVVYSLQ RHKDPKVGSK MSDIAKQFAS VEALGRNMVR IVLTGPNADL
PIILAQSHFL IVPADIQDFS AGNGTGPYRL AEFKPGVRTV VRRSPDFWKP GKPYLDEIEL
IGIQDEISRV NALLSGDVQL INAVNPRSTR RILASPEHGI VETKSGLYTN LVARQDRLPT
GNPDFTAALK HLVDRPLVNR ALFRNYGTIG NDQPLPPSHQ YFRADLPQTA LDLDRAKWHL
QRSGLTGIRL PVYASTAAEG SVDMASILQE FGARIGLDLA VNRVPADGYW STHWMKHPLF
FGNSNPRPTA DLIFSLFYKS DATWNESGWK DPRFDSLVIE ARGEADHERR KQLYGEMQGL
VRDHCGSVIP VFISLLDGHD RRLKGLYPVP LGGFMGYTFA EHVWWDA