Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1504 |
Symbol | |
ID | 3917179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1547734 |
End bp | 1549518 |
Gene Length | 1785 bp |
Protein Length | 594 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444245 |
Product | glycoside hydrolase 15-related |
Protein accession | YP_496779 |
Protein GI | 87199522 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3387] Glucoamylase and related glycosyl hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGCTT CCCTCGATCT CTGGCCAATC GGCAACTGCC AGGTTTCGGC GCTGGTCGAT ACCTCGGGCG CATTTGTCTG GGGCTGCATT CCGCGGGTCG ATGGTGATCC GTTCTTCTCG GCCCTGCTGG GCGGTGAGAT GCCGCGGGAA GGCCTTTGGG CAATCGATCT CGAAGACCGG CTGGAAACCA CCCAAAGCTA TCTGCGCAAT ACCCCGATCC TCGTCACCCG CCACCGCGAT GCCAATGGCG GTGAGATCGA GGTGCTGGAC TTCTGCCCAT ATCTCCCGCG CAATGGCCGC ACCTATCGCC CCGTGGCCTA TGCCCGGATC GTGCGCCCCA TCGCGGGCAG CCCGCGCATC CGCATGCGCC TGCGGCCTAC CTGCGGGTGG GGCAACACCT GCCGGATGAC CATCGGCGGA TCGAACCATA TCCGCTACCT GTCCGAAGCC ATGACGATGC GGCTGACCAC GTCCGCACCG GTCGGCCTCG TTGCCGAGGA ACGGGCCTTC CGCCTTGAAC AGGCGCACTA CTTCTTCCTC GGCCCGGACG AGAGCTTCTC GGGCAACCTT GCCGAAACGC TCGAACGGAT GCTCGAGGCA ACCGCTGCCG AGTGGCGCCA CTGGGTGCGG GGCCTGGCCA CGCCGGTCGA ATGGCAGGAC GTGGTGATCC GGTCGGCCAT AACGCTCAAG CTGTGCCAGC ACGAGGAAAC CGGCGCCATC GTAGCCGCGC TGACCACCTC GATCCCCGAA CATGCCGGAT CGCAGCGGAA CTGGGACTAT CGCTACTGCT GGATCCGCGA TGCCTACTAC ACCGTGCAGG CCCTCAATCG CCTCGGCGCG CTGGACGTGC TCGAAGGTTA TCTCGCCTAC TTGCGCAATG TCGTCGACAA TGCGCGCGGT GGCCATATCC AGCCGCTCTA TGGCGTGCTC GGCGAAGCGA AGCTGGACGA AGGCCTTGCC GAAAGGCTGC CCGGCTATCG CGCGATGGGG CCGGTCCGCA TCGGCAACGC AGCCTGGTCG CAGGTTCAGC ACGATGCCTA TGGCCAGATC GTCCTGTCCA ACACGCAGGC GTTCCTTGAC CAGCGCTTGC TGCGCATGTC CGGCCTCGCC GATTTTGAAG CGCTGGAAAA GGTCGGCGAA AGGGCATGGG CCCTGTTCGA CAAGCCCGAT GCCGGCCTGT GGGAACTGCG CACCCGCCAG TCGGTCCATA CATATTCGGC GGCGATGTGC TGGGCGGCCT GTGACCGGCT GGGCAACGCC GCGCACGCGA TTGGCCTTGA TGACCGCGCG GCCTTCTGGG GCGAGCGCGC AGCCGCGATC CGCGAGCGGA TAGAGCAGGC CGCGTGGTGT CCCGAGACCG AGCGGATGTC GGCCACGTTC TCGGGCGACG ATCTCGATGC AAGCGTGATC CAGTTGCTCG ACCTGCGCTT CCTCGCGCCG GACGATCCGC GCTTCGTCTC CACGCTGGCC GCCATCGAAC AGGGCCTGCG GCGCGGATCG CACATGCTGC GCTATGCCAC CGAGGACGAT TTCGGCCTTC CCGAAACCGC CTTCAACGTC TGCACCTTCT GGCTGATAGA AGCCCTGCAC CTGACCGGAC GCCGTGCCGA AGCCCGCGCG CTCTACGAAG AGATGCTCAG CCGCCGCACC CAGTCGGGCC TCCTTTCGGA GGACATCGAT CCGGCAACCG GCGAACTCTG GGGAAACTAC CCGCAGACCT ACTCACTTGT CGGCCTGATC AACTGCGCCG TCCTGCTGAG CAAACCGTGG AATACCGTAC GATGA
|
Protein sequence | MTASLDLWPI GNCQVSALVD TSGAFVWGCI PRVDGDPFFS ALLGGEMPRE GLWAIDLEDR LETTQSYLRN TPILVTRHRD ANGGEIEVLD FCPYLPRNGR TYRPVAYARI VRPIAGSPRI RMRLRPTCGW GNTCRMTIGG SNHIRYLSEA MTMRLTTSAP VGLVAEERAF RLEQAHYFFL GPDESFSGNL AETLERMLEA TAAEWRHWVR GLATPVEWQD VVIRSAITLK LCQHEETGAI VAALTTSIPE HAGSQRNWDY RYCWIRDAYY TVQALNRLGA LDVLEGYLAY LRNVVDNARG GHIQPLYGVL GEAKLDEGLA ERLPGYRAMG PVRIGNAAWS QVQHDAYGQI VLSNTQAFLD QRLLRMSGLA DFEALEKVGE RAWALFDKPD AGLWELRTRQ SVHTYSAAMC WAACDRLGNA AHAIGLDDRA AFWGERAAAI RERIEQAAWC PETERMSATF SGDDLDASVI QLLDLRFLAP DDPRFVSTLA AIEQGLRRGS HMLRYATEDD FGLPETAFNV CTFWLIEALH LTGRRAEARA LYEEMLSRRT QSGLLSEDID PATGELWGNY PQTYSLVGLI NCAVLLSKPW NTVR
|
| |