Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_0234 |
Symbol | |
ID | 3916222 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 242270 |
End bp | 243529 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640442959 |
Product | protein of unknown function DUF894, DitE |
Protein accession | YP_495516 |
Protein GI | 87198259 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGCCTGCGA TACCCTCCCA GAACCCCTCC GGCGGCGCGC TCGCGCCGTT CCGCTACCCG GCGTTCCGGG CGATCTGGAC GGCCAACCTG TTCTCCAACA TCGGCTCGAT GATCCAGTCG GTGGGCGCCG CGTGGCTGAT GACCGAGCTG ACCACGTCGC ACTTGCTGGT CGCGCTGGTT CAGGCCTCGG CGACGATACC GATCCTGCTG CTCGGCATGT TCGCGGGCGC CATTGCCGAC AACTACGACC GGCGCCGGGT CATGCTCGCG GCGCAGAGCG GAATGCTTGT CGTGTCCGCC GTGCTGGCCG TGCTGTCCTA TACCGACAAC ATCGGGCCCT GGTCGCTGCT GGCGCTGACG CTGATGGTCG GGATGGGAAC CGCGCTGAAT GGTCCCGCAT GGCAGGCGTC CGTCCGGTTG CAGGTTGCCC ATGCCGATCT CCCGCAGGCG ATCTCGCTCA ACGCCATCTC CTTCAATCTT GCCCGCAGTG TGGGGCCGGC GCTTGGCGGC ATCCTGATAT CGCTGTGGGA TACGAGCCTC GCCTTCGCGC TGAATGCGGT CAGCTACATC GGCATGATCG CGGTCCTCGC CATGTGGCGG CCCGAATCGC TGCCGCCGAT GCGCGAGCCG ATGCTGGGAG CGATCGGGCG CGGCATCCGC TTCTGCGCGT CGTCATCGCC CATTCGCAAG GTGCTGCTGC GCGGCCTCGC CATGGGCCTT GGCGCAGCCG GTCTCCAGGC CCTCATGCCC AGCGTCGCGC GCGACATGCT GAAGGGCAGC GAACTGGACT ATGGCCTGAT GCTCGGCGGA TTCGGCATCG GCTCGATCGT GACGGCGCTG TGGATTTCCA GACTGCGCCG CCGTCTGGGC AGCGAGACGG TGGTGACCGC CGCCACGCTG ATCTTCGCGA GCGCGCAAGT GCTGATGGCA TCGGCGACGA ACATGCCGAT GGCGGTGGTC GCCGCGTTCA TGGGCGGCAT GGGCTGGGCC AGCGCGATGA CCAGCCTCAA CGTCGCGATG CAGCTTCGCA GTCCCGAGGA CATTCTCGGC CGCTGTCTTT CGATCTATCA GGCGGTGACC TTCGGCGGCA TGGCGCTGGG CGCATGGGCC TGGGGCACGG TCGCGGACGT GGCGGGGCTG CCGACCGCGC TGCACGCGGC CGCCCTCTGG CTTGCCGCGT CGCTTGCCTT GCACTTTTTT GCCCCTATGC CGACGCGCGA GGAAGGACGC CTCGACGTTG TGCCGGAGAG CAGACCGTGA
|
Protein sequence | MPAIPSQNPS GGALAPFRYP AFRAIWTANL FSNIGSMIQS VGAAWLMTEL TTSHLLVALV QASATIPILL LGMFAGAIAD NYDRRRVMLA AQSGMLVVSA VLAVLSYTDN IGPWSLLALT LMVGMGTALN GPAWQASVRL QVAHADLPQA ISLNAISFNL ARSVGPALGG ILISLWDTSL AFALNAVSYI GMIAVLAMWR PESLPPMREP MLGAIGRGIR FCASSSPIRK VLLRGLAMGL GAAGLQALMP SVARDMLKGS ELDYGLMLGG FGIGSIVTAL WISRLRRRLG SETVVTAATL IFASAQVLMA SATNMPMAVV AAFMGGMGWA SAMTSLNVAM QLRSPEDILG RCLSIYQAVT FGGMALGAWA WGTVADVAGL PTALHAAALW LAASLALHFF APMPTREEGR LDVVPESRP
|
| |