Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1784 |
Symbol | |
ID | 3918343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1881817 |
End bp | 1883664 |
Gene Length | 1848 bp |
Protein Length | 615 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444525 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_497058 |
Protein GI | 87199801 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.272247 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGCCT ATCGTTCGCG CACGACCACC CACGGCCGCA ACATGGCAGG TGCACGCGGC CTCTGGCGCG CTACCGGCAT GAAGGACAGC GATTTCGGCA AGCCGATCAT CGCCGTCGTG AACTCGTTTA CCCAGTTCGT CCCCGGCCAC GTCCACCTCA AGGACCTGGG CCAGATGGTC GCCCGCGAAA TCGAAGCAGC GGGCGGCGTC GCGAAGGAAT TCAACACCAT CGCGGTCGAC GATGGCATCG CCATGGGCCA TGACGGCATG CTCTATTCGC TGCCCAGCCG CGATCTGATC GCCGACAGCG TCGAATACAT GGTCAACGCC CATTGCGCCG ACGCGATGGT CTGCATCTCC AACTGCGACA AGATCACCCC CGGCATGCTC ATGGCGGCAA TGCGGATCAA CATCCCGGTG GTCTTCGTGT CCGGCGGCCC GATGGAGGCC GGCAAGGTCA TCCTCAAGGG CAAGGAACAC GCGCTCGACC TCGTCGATGC GATGGTCGCC GCCGCAGACG AAAGCTTCAC CGACGAGGAA GTGACCGCCA TCGAGCGTTC GGCCTGCCCG ACCTGCGGTT CGTGCTCGGG CATGTTCACC GCCAATTCGA TGAACTGCCT GACCGAGGCG CTTGGACTTT CGCTGCCCGG CAACGGATCG ACCCTTGCCA CCCACGCCGA TCGCCAGCGC CTGTTCCTCG AAGCAGGCCG CCTCGTGGTC GACCTGTGCA AGCGCTACTA CGAGCAGGAC GACGAAAGCG TCCTGCCGCG TTCGATCGCC ACGTTCGAGG CTTTCGAGAA CGCGATGAGC CTCGACATCG CGATGGGCGG TTCGACCAAC ACCGTGCTGC ATCTGCTGGC CGCCGCGCAC GAGGCGGGCG TCAACTTCAC CATGTCCGAC ATCGACCATC TCAGCCGCAA GGTGCCGTGC CTGTCAAAGG TCGCTCCGGC AAAGTCCGAT GTGCACATGG AAGACGTCCA CCGCGCAGGT GGCATTTATG CCATCCTCGG CGAACTGGAC CGCGCGGGCC TGCTTCATAC CCATCTGCCG ACCGTCCACA GCCGCACGCT TGGCGATGCG CTGAACCAGT GGGACGTGAA GCGCACCAAC TCGCCCACGG TGCAGGAATT CTTCCGCGCC GCGCCGGGCG GCGTGCCGAC GCAGGTTGCC TTCTCGCAGG ATCGCCGCTG GAAGGAACTC GATCTGGACC GCGAAACGGG CGTGATCCGT TCGGCCGAGC ATGCGTTCTC GAAGGACGGC GGCCTTGCCG TGCTGTTCGG TAACATCGCC CGCGAGGGTT GCATCGTGAA GACCGCGGGC GTGGACGACA GCATCCTGAA GTTCACCGGT CCGGCAAAGG TCTATGAAAG CCAGGATGCC GCCGTCACCG CGATCCTCAC CGGGCAGGTG ACCTCGGGCG ACGTGGTCGT GATCCGGTAC GAAGGGCCCA AGGGCGGCCC CGGCATGCAG GAAATGCTCT ATCCGACGAG CTACCTGAAG TCGAAGGGCC TGGGTGCGGC CTGCGCGCTG CTGACCGATG GCCGTTTCTC GGGCGGCACG TCGGGCCTGT CCATCGGCCA CGTCTCGCCG GAGGCTGCCG AGGGCGGGGA GATCGGTCTG GTCGAGAACG GCGACGTGAT CGAGATCGAC ATTCCGAACC GCACCATCCA CCTTGCCGTG GCGGACGATG TCCTTGCCCA GCGCCGGGCG GAGCAGGAGG CGAAGGGCTG GAAGCCGGTC AAGGAACGCC CGCGCAAGGT GTCGACCGCG CTCAGGGCCT ATGCCGCGAT GACCACCAGC GCGGCGCGCG GTGCGGTGCG CGATCTTTCG CAACTCAAGA TCGACTGA
|
Protein sequence | MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQMV AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRDLI ADSVEYMVNA HCADAMVCIS NCDKITPGML MAAMRINIPV VFVSGGPMEA GKVILKGKEH ALDLVDAMVA AADESFTDEE VTAIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRQR LFLEAGRLVV DLCKRYYEQD DESVLPRSIA TFEAFENAMS LDIAMGGSTN TVLHLLAAAH EAGVNFTMSD IDHLSRKVPC LSKVAPAKSD VHMEDVHRAG GIYAILGELD RAGLLHTHLP TVHSRTLGDA LNQWDVKRTN SPTVQEFFRA APGGVPTQVA FSQDRRWKEL DLDRETGVIR SAEHAFSKDG GLAVLFGNIA REGCIVKTAG VDDSILKFTG PAKVYESQDA AVTAILTGQV TSGDVVVIRY EGPKGGPGMQ EMLYPTSYLK SKGLGAACAL LTDGRFSGGT SGLSIGHVSP EAAEGGEIGL VENGDVIEID IPNRTIHLAV ADDVLAQRRA EQEAKGWKPV KERPRKVSTA LRAYAAMTTS AARGAVRDLS QLKID
|
| |