Gene Saro_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1504 
Symbol 
ID3917179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1547734 
End bp1549518 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content65% 
IMG OID640444245 
Productglycoside hydrolase 15-related 
Protein accessionYP_496779 
Protein GI87199522 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCTT CCCTCGATCT CTGGCCAATC GGCAACTGCC AGGTTTCGGC GCTGGTCGAT 
ACCTCGGGCG CATTTGTCTG GGGCTGCATT CCGCGGGTCG ATGGTGATCC GTTCTTCTCG
GCCCTGCTGG GCGGTGAGAT GCCGCGGGAA GGCCTTTGGG CAATCGATCT CGAAGACCGG
CTGGAAACCA CCCAAAGCTA TCTGCGCAAT ACCCCGATCC TCGTCACCCG CCACCGCGAT
GCCAATGGCG GTGAGATCGA GGTGCTGGAC TTCTGCCCAT ATCTCCCGCG CAATGGCCGC
ACCTATCGCC CCGTGGCCTA TGCCCGGATC GTGCGCCCCA TCGCGGGCAG CCCGCGCATC
CGCATGCGCC TGCGGCCTAC CTGCGGGTGG GGCAACACCT GCCGGATGAC CATCGGCGGA
TCGAACCATA TCCGCTACCT GTCCGAAGCC ATGACGATGC GGCTGACCAC GTCCGCACCG
GTCGGCCTCG TTGCCGAGGA ACGGGCCTTC CGCCTTGAAC AGGCGCACTA CTTCTTCCTC
GGCCCGGACG AGAGCTTCTC GGGCAACCTT GCCGAAACGC TCGAACGGAT GCTCGAGGCA
ACCGCTGCCG AGTGGCGCCA CTGGGTGCGG GGCCTGGCCA CGCCGGTCGA ATGGCAGGAC
GTGGTGATCC GGTCGGCCAT AACGCTCAAG CTGTGCCAGC ACGAGGAAAC CGGCGCCATC
GTAGCCGCGC TGACCACCTC GATCCCCGAA CATGCCGGAT CGCAGCGGAA CTGGGACTAT
CGCTACTGCT GGATCCGCGA TGCCTACTAC ACCGTGCAGG CCCTCAATCG CCTCGGCGCG
CTGGACGTGC TCGAAGGTTA TCTCGCCTAC TTGCGCAATG TCGTCGACAA TGCGCGCGGT
GGCCATATCC AGCCGCTCTA TGGCGTGCTC GGCGAAGCGA AGCTGGACGA AGGCCTTGCC
GAAAGGCTGC CCGGCTATCG CGCGATGGGG CCGGTCCGCA TCGGCAACGC AGCCTGGTCG
CAGGTTCAGC ACGATGCCTA TGGCCAGATC GTCCTGTCCA ACACGCAGGC GTTCCTTGAC
CAGCGCTTGC TGCGCATGTC CGGCCTCGCC GATTTTGAAG CGCTGGAAAA GGTCGGCGAA
AGGGCATGGG CCCTGTTCGA CAAGCCCGAT GCCGGCCTGT GGGAACTGCG CACCCGCCAG
TCGGTCCATA CATATTCGGC GGCGATGTGC TGGGCGGCCT GTGACCGGCT GGGCAACGCC
GCGCACGCGA TTGGCCTTGA TGACCGCGCG GCCTTCTGGG GCGAGCGCGC AGCCGCGATC
CGCGAGCGGA TAGAGCAGGC CGCGTGGTGT CCCGAGACCG AGCGGATGTC GGCCACGTTC
TCGGGCGACG ATCTCGATGC AAGCGTGATC CAGTTGCTCG ACCTGCGCTT CCTCGCGCCG
GACGATCCGC GCTTCGTCTC CACGCTGGCC GCCATCGAAC AGGGCCTGCG GCGCGGATCG
CACATGCTGC GCTATGCCAC CGAGGACGAT TTCGGCCTTC CCGAAACCGC CTTCAACGTC
TGCACCTTCT GGCTGATAGA AGCCCTGCAC CTGACCGGAC GCCGTGCCGA AGCCCGCGCG
CTCTACGAAG AGATGCTCAG CCGCCGCACC CAGTCGGGCC TCCTTTCGGA GGACATCGAT
CCGGCAACCG GCGAACTCTG GGGAAACTAC CCGCAGACCT ACTCACTTGT CGGCCTGATC
AACTGCGCCG TCCTGCTGAG CAAACCGTGG AATACCGTAC GATGA
 
Protein sequence
MTASLDLWPI GNCQVSALVD TSGAFVWGCI PRVDGDPFFS ALLGGEMPRE GLWAIDLEDR 
LETTQSYLRN TPILVTRHRD ANGGEIEVLD FCPYLPRNGR TYRPVAYARI VRPIAGSPRI
RMRLRPTCGW GNTCRMTIGG SNHIRYLSEA MTMRLTTSAP VGLVAEERAF RLEQAHYFFL
GPDESFSGNL AETLERMLEA TAAEWRHWVR GLATPVEWQD VVIRSAITLK LCQHEETGAI
VAALTTSIPE HAGSQRNWDY RYCWIRDAYY TVQALNRLGA LDVLEGYLAY LRNVVDNARG
GHIQPLYGVL GEAKLDEGLA ERLPGYRAMG PVRIGNAAWS QVQHDAYGQI VLSNTQAFLD
QRLLRMSGLA DFEALEKVGE RAWALFDKPD AGLWELRTRQ SVHTYSAAMC WAACDRLGNA
AHAIGLDDRA AFWGERAAAI RERIEQAAWC PETERMSATF SGDDLDASVI QLLDLRFLAP
DDPRFVSTLA AIEQGLRRGS HMLRYATEDD FGLPETAFNV CTFWLIEALH LTGRRAEARA
LYEEMLSRRT QSGLLSEDID PATGELWGNY PQTYSLVGLI NCAVLLSKPW NTVR