Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2623 |
Symbol | |
ID | 3917038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2837522 |
End bp | 2839213 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640445382 |
Product | general substrate transporter |
Protein accession | YP_497893 |
Protein GI | 87200636 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAACC TTGAGATAGC GGATGATTTT TCGGCTTTGC CGAAACATCA TCAGGCCACG CACAACGAGA AGCTGATCAT AACCGCGTCT TCGCTAGGCA CGGTGTTCGA GTGGTACGAC TTCTACCTCT ATGGTCTCCT CACCGCGATC ATCGCGGCGA AGTTCCTGAC AGGGCTCAAC CCGACGACCT CGTTCATCAT GGCGCTGCTC GTCTTCGCGG CGGGCTTCAT CGTCCGTCCT TTCGGGGCAC TCGTGTTCGG CCGCATCGGG GACATGGTGG GCCGCCGCTA TACCTTCATC GTGACGCTGC TGGTCATGGG CCTGTCGACC TTCCTTGTCG GTTGCCTGCC CACCTATGAA ACCGTCGGCG TGGCCGCGCC GATCATGCTC GTCGTCCTGC GCATGTTCCA GGGCCTGGCC CTGGGCGGGG AATACGGCGG CGCCGCAACC TACGTGGCCG AGCACGCGCC CGAAGGAAAG CGGGGGCTCT ATACGAGCTG GATCCAGATC ACCGCGACGG CCGGGCTGGC CATGGCCTTG CTGATCGTGA TCCTCGTGCG CTCGCCGGTC ACCGGCGTGG GTGAGGAGGC GTTCAAGGAC TGGGGCTGGC GCGTGCCCTA CCTGATCTCG GGCCTGTTCC TCTGCGTGGG GCTGTGGCTG CGCCTGAAGC TGCATGAATC GCCCGTCTTC CAGAAGATGA AGGACGAAGG CACATCCTCC AAGCGTCCGC TGGGCGAAGC CTTCGGCGAA TGGAAGAACC TCAAGATCGT GCTGATCGCG TTCTTCGGCG CCATCGCGGG CCAGGCCGTG GTCTGGTACA CCGGCCAGCT CTACGCGATG TACTTTCTCG AAAAGATGCT CAAGGTCGAT GGCCTTACCG CCAACACGCT GATCATCGTC GCGCTTGCCT GTGCCACGCC GTTCTTCCTG TTCTTCGGCT GGCTGTCGGA CAAGATCGGT CGCAAGAAGA TCATCCTGGC CGGTTGCGCG CTCGCTGCGC TCACCATGTT CCCGGCGTTC AAGGCGCTGA CCGAAGCGGC GAACCCCGCC CTTGCCCATG CCCAGGCCAA TGCGCCGGTA AAGGTCTTGG CCAACCCCGC CGAATGCTCG TCGCAGTTCG ACCCGGTAGG CGCGAACAAG TTCGACACCA CAAGCTGCGA CATCGTCAAG AACGCGCTCG CCAAGGCGGC GGTGAACTAC GAGAACGTCG CGGCACCCGC CGGCACCATT GCTTCGATCC GGATCGGCAG CACCACCATC GTCGCTCCCG ATCCGTCGAA GGTTTCGGGT GACGAGAAGA AGGCCGCCAT CGCCGCCTTC GCCAAGCAGG TCACCTCCGA CCTGGAGACC GTGGGCTACC CGGCCAAGGC CGACCCTGAC CAGATCGACA AGCCGGTCGT CGTGGCGATC CTGTTCTATC TCGTCCTGCT GGTGACCATG GTCTACGGTC CCATCGCGGC GATGCTGGTC GAACTGTTCC CCAGCCGCAT CCGCTACACC TCGATGAGCC TGCCGTACCA CATCGGCAAT GGCTGGCTCG GCGGCCTGCT CCCGGCCATC GGTTTTGCCA TGGTCGCGGC CAACGGGGAT ATCTACCACG GCTTCTGGTA CCCGGTGATC GTGGCCGCCG CCACTTGCGT CATAGGCCTG GTGTTCCTGC CGGAGACGTA CAAGCGGAGC ATCGACGACT AA
|
Protein sequence | MSNLEIADDF SALPKHHQAT HNEKLIITAS SLGTVFEWYD FYLYGLLTAI IAAKFLTGLN PTTSFIMALL VFAAGFIVRP FGALVFGRIG DMVGRRYTFI VTLLVMGLST FLVGCLPTYE TVGVAAPIML VVLRMFQGLA LGGEYGGAAT YVAEHAPEGK RGLYTSWIQI TATAGLAMAL LIVILVRSPV TGVGEEAFKD WGWRVPYLIS GLFLCVGLWL RLKLHESPVF QKMKDEGTSS KRPLGEAFGE WKNLKIVLIA FFGAIAGQAV VWYTGQLYAM YFLEKMLKVD GLTANTLIIV ALACATPFFL FFGWLSDKIG RKKIILAGCA LAALTMFPAF KALTEAANPA LAHAQANAPV KVLANPAECS SQFDPVGANK FDTTSCDIVK NALAKAAVNY ENVAAPAGTI ASIRIGSTTI VAPDPSKVSG DEKKAAIAAF AKQVTSDLET VGYPAKADPD QIDKPVVVAI LFYLVLLVTM VYGPIAAMLV ELFPSRIRYT SMSLPYHIGN GWLGGLLPAI GFAMVAANGD IYHGFWYPVI VAAATCVIGL VFLPETYKRS IDD
|
| |