Gene Saro_1890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1890 
Symbol 
ID3917111 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1999508 
End bp2001337 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content64% 
IMG OID640444634 
Productalpha amylase, catalytic region 
Protein accessionYP_497164 
Protein GI87199907 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTAGGA TATCCCTGAT TTCCGCGCTC GCTCTGGTCC TGTCCTTGGG GCAGCCGGCG 
TTGGCCGACG TGCCACTGGC CGAAGTGCGT GCGCGTCCCG CTGCAGGCGA AGTGGTCTAT
TTCCTGCTGC CCGACCGGTT CGATAACGGC GACAGGAAGA ACGACCTTGG CGGGCTCAAG
GGCGATCGGC TGAAGACCGG GTTCGACCCG TCGGCAAAGG GGTTTTATCA CGGCGGGGAC
ATCAAGGGCC TGCTCTCGCG GCTCGACTAC CTCCAGGACA TGGGCGTGAC GGCGCTGTGG
GTGGCGCCGG TGTTCCGCAA CAAGCCGGTA CAGGGCGCGC CCGGGCAGGA GAGCGCGGGG
TATCACGGAT ACTGGGTAAC CGATTTCACC CGGATCGATC CGCACTTCGG CAGCAATGCC
GATTTCAAGG CGCTGGTCGA CGCTGCCCAT GCTCGCGGCA TCAAGGTCTA CATGGACATC
ATCGCCAATC ACACGGCGGA CGTGATCCAC TATGCCGATG GCGATTACTC ATACCGCAAC
CGGGCCGACT GGCCCTATTC GCGCAAGGGT GGATTGTCCG GCAAGGCGAT CAACCAGGGG
TTTGCGGGCG ACGAGGACTC GAGCGCGGCG AACTTTGCGA AGCTGACCGA TCCCGGCGCG
GCATATCAGC CGTTGATCGA CCCGGCCGAG CGAAACGTGA AGGTCCCGGC CTGGCTGAAC
GATCCGATCT ACTACCATAA TCGCGGCAAC ACGACGTTCA CCGGCGAGGA CAGCCGGTTC
GGGGACTTTG CCGGACTGGA CGACCTGTTC ACCGAACATC CCCGCGTTCG TGCCGGGATG
GTTGAGATCT ATGCCGACTG GATCCGCAGG TTCGGCATCG ACGGCTACCG CGTCGATACG
GCCAAGCATG TCGATCCGGG CTTCTGGCAA GTGTTCACTC CGGAAATTCG CAAGGTCGCG
GCAGAGGCCG GCATACCGAA TTTCGCGGTG TTCGCTGAAA TCGCCAACGG CGGCTCGGAC
CCCGGCACGA TCGCGCGACA CACCCGCCGC GACGGCTTTC CGCAAGTGCT CGACTTCGCG
TTCCAGGAAG CGGTGCGGGA CCTTGTCGGC AAGGGTAAGG CGACGCGCGT GCTGGCAGAG
ACCTTTGACG GCGACGTACT CTATGAAGGT GGCGAGGCGG CGGCGCTCGC CATGCCAACT
TTCCTCGGCA ACCACGACAT GGGCCGTTTC TCGACCGCAG TCCGGCAGGA CCGCCCGGGC
ATTTCCGACA AGGAACTGCT CGACCGCGTC GCGCTGGCGC ACGTCATGCT GATGACCCTT
CGCGGTTCGC CGGTGATCTA CTACGGCGAC GAGCAGGGCT TTGTCGGCGA CGGCGGCGAC
CAGGACGCGC GCGAGGACAT GTTCCCGAGC CGCACGGCCA GCTACAACGA CAATCGCCTG
ATCGGGACGG CCAGCACGAC GGCTGCCGAC AATTTCGACG AGGGGCATCC CCTCTATCGG
CTGATCCGCG ATCTTGCCGT ACTGCGGAAG GCGCATCCCG CGCTGGCCCG GGGGCGGCAG
GTTACGCGAA CGTATTCGGA AGGGCCCGGA CTGTTTTCCG TCTCCCGCTT CGATCCGGAG
ACGGGTACGG AATACCTGAT AGCGTTCAAC ACATCGGACA AGCCACTTCG TGCAACCAGC
GTGATTGGCG CGTCGGCAAG CGGTCTCGAA GGGCTCTACG GCCAATGTCC GTTATCGGTG
GCGGCGCCGG GTTCGGTGCT GCTCGATCTC CCCGCATTTG GCAGCGTCGT CTGTCGTGTC
ATCCTCTCCC CGGAATCCAG CGTTCGATGA
 
Protein sequence
MRRISLISAL ALVLSLGQPA LADVPLAEVR ARPAAGEVVY FLLPDRFDNG DRKNDLGGLK 
GDRLKTGFDP SAKGFYHGGD IKGLLSRLDY LQDMGVTALW VAPVFRNKPV QGAPGQESAG
YHGYWVTDFT RIDPHFGSNA DFKALVDAAH ARGIKVYMDI IANHTADVIH YADGDYSYRN
RADWPYSRKG GLSGKAINQG FAGDEDSSAA NFAKLTDPGA AYQPLIDPAE RNVKVPAWLN
DPIYYHNRGN TTFTGEDSRF GDFAGLDDLF TEHPRVRAGM VEIYADWIRR FGIDGYRVDT
AKHVDPGFWQ VFTPEIRKVA AEAGIPNFAV FAEIANGGSD PGTIARHTRR DGFPQVLDFA
FQEAVRDLVG KGKATRVLAE TFDGDVLYEG GEAAALAMPT FLGNHDMGRF STAVRQDRPG
ISDKELLDRV ALAHVMLMTL RGSPVIYYGD EQGFVGDGGD QDAREDMFPS RTASYNDNRL
IGTASTTAAD NFDEGHPLYR LIRDLAVLRK AHPALARGRQ VTRTYSEGPG LFSVSRFDPE
TGTEYLIAFN TSDKPLRATS VIGASASGLE GLYGQCPLSV AAPGSVLLDL PAFGSVVCRV
ILSPESSVR