Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3699 |
Symbol | |
ID | 5077847 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009427 |
Strand | - |
Start bp | 333303 |
End bp | 334886 |
Gene Length | 1584 bp |
Protein Length | 527 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640481422 |
Product | extracellular solute-binding protein |
Protein accession | YP_001166084 |
Protein GI | 146275924 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGG GCCCGATCAG TCGCCGCGTT GCGCTCGGCG CGGTCGCCGG TGCAGGCGCG GCCGCGCTCG GCGGCAGACT GTTGATGCGA CAGCCCCCGG GCGACCTGCC GCATCCAGCC AGCGAAGGCG GGCGCAAATC CACGCCTCGC CCCGGCGGAC GCATCCGCGT GGCGAGCACA TCGACATCGA CTGGCGACAC GCTCGACCCT GCCAAGGGCG CGGTGAACAC GGACTATGTC CGCCACTTCA TGCTCTACAG TGGCCTGACC GAGTTCGACC GCGACCTGAG CGCGCGCCCG GCCCTTGCCG AGAGCATTTC GAGCGACGAC CAGAAAGTCT GGACAATCAG CCTGCGCAAG GGTGTGACGT TCCACGACGG GAAGTCGTTC ACAGCAGACG ACGTCGTATA TTCGCTGCAG CGCCACAAGG ACCCCAAAGT GGGGTCCAAG ATGTCGGACA TCGCGAAGCA GTTCGCGTCT GTCGAAGCTC TGGGCCGCAA CATGGTGAGG ATCGTCCTGA CCGGGCCTAA CGCCGATCTC CCCATCATCC TTGCGCAATC ACACTTCCTG ATCGTGCCGG CGGACATCCA GGATTTCAGC GCGGGCAACG GCACGGGGCC CTATCGGTTG GCCGAATTCA AGCCGGGAGT GCGCACGGTG GTCCGCCGCA GCCCGGACTT CTGGAAACCG GGCAAACCCT ATCTCGACGA GATCGAGCTT ATCGGCATTC AGGACGAGAT CAGCCGGGTC AACGCGCTTC TTTCGGGCGA CGTACAGTTG ATCAACGCGG TCAATCCGCG TTCGACGCGT CGCATCCTGG CATCGCCGGA GCACGGTATC GTCGAAACCA AATCAGGGCT CTATACCAAT CTCGTGGCCC GTCAGGACCG GTTGCCGACG GGTAATCCCG ATTTTACAGC AGCGCTCAAG CACCTTGTCG ACCGGCCGCT CGTCAACCGC GCCCTGTTCC GCAATTACGG TACGATCGGC AACGACCAGC CGCTGCCACC ATCGCACCAG TACTTCCGTG CGGACCTGCC GCAGACCGCG CTCGATCTGG ACCGTGCGAA ATGGCATCTG CAGCGCTCTG GGCTCACGGG AATTCGCTTG CCTGTCTACG CCTCGACCGC CGCCGAAGGT TCGGTGGACA TGGCCTCGAT CCTGCAGGAA TTCGGCGCGA GGATCGGACT GGATCTTGCG GTGAATCGGG TCCCGGCGGA CGGTTACTGG TCTACCCACT GGATGAAGCA CCCGTTGTTC TTCGGAAACA GCAACCCGCG GCCGACCGCC GACCTCATTT TCAGTCTCTT CTACAAGTCC GATGCCACCT GGAATGAATC AGGTTGGAAG GACCCGCGCT TCGACAGCCT GGTGATCGAG GCGCGCGGGG AGGCGGACCA TGAGCGGCGT AAGCAGCTAT ACGGCGAGAT GCAGGGTCTG GTCCGTGACC ATTGCGGCAG CGTGATCCCC GTCTTCATCA GCCTTCTGGA CGGACACGAC CGGCGCCTGA AAGGGCTCTA CCCCGTGCCG CTTGGCGGTT TCATGGGATA CACGTTTGCC GAACACGTCT GGTGGGACGC GTGA
|
Protein sequence | MSTGPISRRV ALGAVAGAGA AALGGRLLMR QPPGDLPHPA SEGGRKSTPR PGGRIRVAST STSTGDTLDP AKGAVNTDYV RHFMLYSGLT EFDRDLSARP ALAESISSDD QKVWTISLRK GVTFHDGKSF TADDVVYSLQ RHKDPKVGSK MSDIAKQFAS VEALGRNMVR IVLTGPNADL PIILAQSHFL IVPADIQDFS AGNGTGPYRL AEFKPGVRTV VRRSPDFWKP GKPYLDEIEL IGIQDEISRV NALLSGDVQL INAVNPRSTR RILASPEHGI VETKSGLYTN LVARQDRLPT GNPDFTAALK HLVDRPLVNR ALFRNYGTIG NDQPLPPSHQ YFRADLPQTA LDLDRAKWHL QRSGLTGIRL PVYASTAAEG SVDMASILQE FGARIGLDLA VNRVPADGYW STHWMKHPLF FGNSNPRPTA DLIFSLFYKS DATWNESGWK DPRFDSLVIE ARGEADHERR KQLYGEMQGL VRDHCGSVIP VFISLLDGHD RRLKGLYPVP LGGFMGYTFA EHVWWDA
|
| |