Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1891 |
Symbol | |
ID | 3917112 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2001334 |
End bp | 2002968 |
Gene Length | 1635 bp |
Protein Length | 544 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640444635 |
Product | alpha amylase, catalytic region |
Protein accession | YP_497165 |
Protein GI | 87199908 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.687574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCAGA CCGAAACAGC CATTGCCGAC CCCAAAATGC ACACGGAGAC GCCGTGGTGG CGCGGTGCGG CGATCTATCA GATCTATCCA CGCAGCTTTT GCGATTCCAA CGGCGACGGC ATCGGGGACC TGAATGGCAT CGCCTCGCGA CTGGATCACG TCGCGCGCCT TGGCGTAGAC GCGATCTGGA TCTCGCCGTT CTTCACCTCG CCGATGAAGG ACTTCGGCTA CGACGTCGCC GACTACTGCG ACGTCGATCC GATCTTCGGG ACGCTGGCGG ATTTCGATGC CCTGGTAAAA CGCGCGCACG AGCTGGGCCT CAAGGTCACG ATCGACCAGG TCTATGCGCA TACCTCGGAC ATTCATCCGT GGTTTGCCGA AAGCCGGCAG GACCGTACCA ACGCCAGGGC GGACTGGTAT GTCTGGGCCG ATCCTAAGCC CGATGGGTCG CCCCCGTCGA ACTGGCAGTC GGTATTCGGC GGTCCGGCAT GGACGTGGGA TGCGCGGCGC TGCCAGTATT ACCTGCACAA CTTCCTATCC AGCCAGCCCC AGGTAAACGC CCACAATCCC GAAGTGCAGG AAGCGCTGCT CGGGGCGATG AAGTTCTGGC TGGACCGGGG TGTTGACGGC TTCCGCCTCG ATGCCCTGAA CTTCCTGATG CATGACCCGA CCTTGCGGGA CAATCCGCCG GCGCCGGACG ATGGCCGACG CAAGACGCGG CCATTCGACT TCCAGCTCAA GATCTACAAC CAGTCGCACC CCGACATCCT GAAGTTCATC CAGCGGGTGC GGAACCTGTG CGACGACTAT GGCGCGGTGT TCACAGTCGC GGAAGTCGGC GGAGACCTTG CCGAGACGGA GATGAAGGCG TTCACTGCGG GGGACAGGCA TCTCAACAGC GCCTACGGCT TCGACTTCCT CTACGCGGAC AGGCTGACGC CGCATTTCGT GGAAAAGGCC GTGGCCAAAT GGCCCGATGC ACCCGGCATG GGCTGGCCGA GCTGGGCCTT CGAGAACCAC GATGCGCCGC GCGCGCTTTC GCGCTGGTGC GCGCCGGACC AGCGCGAGCC GTTCGCCCGG CTGAAGGCCA TGCTCTTCGC TTCGTTGCGG GGGAACATCA TCGTCTACCA GGGCGAGGAA CTGGGACTGA CGCAGGTCGA CATTCCGTTC GAGCAGTTGC AGGACCCCGA GGCCATCGCC AATTGGCCGC TGACGCTTTC ACGCGACGGT GCACGCACGC CGATGCCCTG GTTAGTACAA TCGGGCGAGG GCGGGTTCAC GTCGGGCGCG CCCTGGTTGC CGCTGGGCGA GGAAAACCTG TCGCGGGCCG TCGACCGGCA GGAAGGCGAT CCGGCATCGC TCTTGAACCT GACCACGCGC CTGCTTCGCC TGCGCCGCGA GACGCCCGCG TTACGGATCG GTTCCTTCGA GGTGATCCAT GCAGACGAAT GCCTTCTTGC CATTCGGCGC GTTTTGGGTG AGCAATCGAT TGCAGGGCTG TTCAACCTGT CCTCCGTCCC GGTGGTCTGG CCCCACGGGC TGGTGCGGGA GGGCAAGGAA ATGGCGTCGG TCAACGGCGC GACAGTGGGG CAGTTGCCGC CGTTCGGCGC GCTCCTGATC GAAGAGAGGA TCTGA
|
Protein sequence | MKQTETAIAD PKMHTETPWW RGAAIYQIYP RSFCDSNGDG IGDLNGIASR LDHVARLGVD AIWISPFFTS PMKDFGYDVA DYCDVDPIFG TLADFDALVK RAHELGLKVT IDQVYAHTSD IHPWFAESRQ DRTNARADWY VWADPKPDGS PPSNWQSVFG GPAWTWDARR CQYYLHNFLS SQPQVNAHNP EVQEALLGAM KFWLDRGVDG FRLDALNFLM HDPTLRDNPP APDDGRRKTR PFDFQLKIYN QSHPDILKFI QRVRNLCDDY GAVFTVAEVG GDLAETEMKA FTAGDRHLNS AYGFDFLYAD RLTPHFVEKA VAKWPDAPGM GWPSWAFENH DAPRALSRWC APDQREPFAR LKAMLFASLR GNIIVYQGEE LGLTQVDIPF EQLQDPEAIA NWPLTLSRDG ARTPMPWLVQ SGEGGFTSGA PWLPLGEENL SRAVDRQEGD PASLLNLTTR LLRLRRETPA LRIGSFEVIH ADECLLAIRR VLGEQSIAGL FNLSSVPVVW PHGLVREGKE MASVNGATVG QLPPFGALLI EERI
|
| |