Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1953 |
Symbol | |
ID | 3917268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2071344 |
End bp | 2072978 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640444700 |
Product | general substrate transporter |
Protein accession | YP_497227 |
Protein GI | 87199970 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGCGTGA ATGCATCCGT AACGAAGAAC GACGCCGCGA AGCCGAGCGC GTCGGATATC AGGCTCGTGA TCGCCGCCAG TTCGGCGGGT ACCGTGTTCG AATGGTACGA TTTCTTCATC TACGGCACGC TGGCATCAAT CATCGGCAGG ACGTTCTTTC CCTCGGACAA CGCAACGCTC CAGGTGCTGC TGGTGTGGGC GGGATTTGCC GTCGGCTTCG GCTTCCGTCC GCTCGGCGCA GTCTTGTTCG GCTATCTCGG CGACAAGCTC GGCCGCAAGT ATACCTTTCT CGTCACGGTC ACGCTCATGG GTGTCGCCAC GGCGGGCGTC GGCCTGATCC CCTCTGCCGC GACCATCGGC CTTGCCGCAC CGGCCATCGT CATCCTGCTG CGCGTGCTGC AAGGGCTGGC ACTGGGTGGG GAGTACGGCG GCGCTGCGAT CTATGTTGCA GAGCATGCAC CGGGCGGCCG TCGCGGCTAT TACACCAGCT ACATCCAGGC CAGCGTCGTG GGCGGCTTCG TGCTGAGTCT GATCGTAGTC CTGTCCAGCA AGGCGCTGAT GAGCGATGCC GTGTGGAACG ACTGGGGTTG GCGCGTGCCG TTCCTGGTCA GCCTCGCGCT TCTCGCGATT TCGTTGTGGA TGCGCATGAA GCTGTCGGAA AGTCCGGTGT TCCAGGCGAT GAAGGAGGAG GGCGAGCTTG CCGGTAATCC CTTCGTCGAA AGCTTCACCT ACCCTGGCAA CAAGCGCCGC ATTTTCATCG CGCTGTTCGG CATCGCCGCC GGGCTAACCG TGATCTGGTA CACGGCGATG TTCTCCGGCC TCAGCTTCCT CAAGTCTGCA ATGCGCATGG AGGATACTCT GGCCGAGGTC GTCGTCGGTA TCGGCGCCAC ACTCGGAATG GGCTTCTTCA TCTACTTCGG CTCTCTTTCC GACCGTATCG GCCGTAAGAA GCCCATCATC ATCGGCTATG CCGTCACGCT GCTAATGCTC TTTCCCACGT TCTGGCTGAT GGGGGCCGCC GCCAATCCGC AACTCGCCGA GGCGGCAGAG CGCAACCCGG TCGTCGTAGC CGGGCCTGAC TGCAACTACA GCCCCTTTGC CTCTGAACAA GTCAGCAATT GCGGCAAGCT CCTGGCTGAC CTGGCGGCGT CCGGTGTGTC TTATAGTCTG CGCGATGACG CTGTGTTTGG CATGACCGCA GGTGGTTCAG CGGTTGATCT TGCCAGCTAT CCGTGGACGG ACAAGGCTGC TGCGCGCGCC AAGGCGCTCC AGTCCGAGCT TTCCGCGCAT GGCTACGATT TCGCCAAGGT CCAGCCCTCG CTCGGCCGGA TTGTCGCGGT TATCGGTGCG CTGCTGGCGC TCATGGCGAT GTCCGGTGCG ACCTACGGGC CGGTGGCCGC CCTTCTTTCC GAGATGTTCC CGCCGCGCAT CCGATACAGT TCGATGTCGA TCCCGTATCA TCTCGGCACG GGCTACTTCG GGGGTTTCCT GCCGCTGATT TCCAGCTACA TCGTCGCGCG CACCGGCGAT CCCTATGCCG GGCTATGGTA CACTTGGGTG GTCGTCCTGG TCGCGCTCCT CGTTGCGGCG TGGGGTTTGC GGCCAGGCCT GCCCGCCGAC TTCACGGATG ACTGA
|
Protein sequence | MCVNASVTKN DAAKPSASDI RLVIAASSAG TVFEWYDFFI YGTLASIIGR TFFPSDNATL QVLLVWAGFA VGFGFRPLGA VLFGYLGDKL GRKYTFLVTV TLMGVATAGV GLIPSAATIG LAAPAIVILL RVLQGLALGG EYGGAAIYVA EHAPGGRRGY YTSYIQASVV GGFVLSLIVV LSSKALMSDA VWNDWGWRVP FLVSLALLAI SLWMRMKLSE SPVFQAMKEE GELAGNPFVE SFTYPGNKRR IFIALFGIAA GLTVIWYTAM FSGLSFLKSA MRMEDTLAEV VVGIGATLGM GFFIYFGSLS DRIGRKKPII IGYAVTLLML FPTFWLMGAA ANPQLAEAAE RNPVVVAGPD CNYSPFASEQ VSNCGKLLAD LAASGVSYSL RDDAVFGMTA GGSAVDLASY PWTDKAAARA KALQSELSAH GYDFAKVQPS LGRIVAVIGA LLALMAMSGA TYGPVAALLS EMFPPRIRYS SMSIPYHLGT GYFGGFLPLI SSYIVARTGD PYAGLWYTWV VVLVALLVAA WGLRPGLPAD FTDD
|
| |