Gene Saro_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1784 
Symbol 
ID3918343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1881817 
End bp1883664 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content65% 
IMG OID640444525 
Productdihydroxy-acid dehydratase 
Protein accessionYP_497058 
Protein GI87199801 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.272247 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCT ATCGTTCGCG CACGACCACC CACGGCCGCA ACATGGCAGG TGCACGCGGC 
CTCTGGCGCG CTACCGGCAT GAAGGACAGC GATTTCGGCA AGCCGATCAT CGCCGTCGTG
AACTCGTTTA CCCAGTTCGT CCCCGGCCAC GTCCACCTCA AGGACCTGGG CCAGATGGTC
GCCCGCGAAA TCGAAGCAGC GGGCGGCGTC GCGAAGGAAT TCAACACCAT CGCGGTCGAC
GATGGCATCG CCATGGGCCA TGACGGCATG CTCTATTCGC TGCCCAGCCG CGATCTGATC
GCCGACAGCG TCGAATACAT GGTCAACGCC CATTGCGCCG ACGCGATGGT CTGCATCTCC
AACTGCGACA AGATCACCCC CGGCATGCTC ATGGCGGCAA TGCGGATCAA CATCCCGGTG
GTCTTCGTGT CCGGCGGCCC GATGGAGGCC GGCAAGGTCA TCCTCAAGGG CAAGGAACAC
GCGCTCGACC TCGTCGATGC GATGGTCGCC GCCGCAGACG AAAGCTTCAC CGACGAGGAA
GTGACCGCCA TCGAGCGTTC GGCCTGCCCG ACCTGCGGTT CGTGCTCGGG CATGTTCACC
GCCAATTCGA TGAACTGCCT GACCGAGGCG CTTGGACTTT CGCTGCCCGG CAACGGATCG
ACCCTTGCCA CCCACGCCGA TCGCCAGCGC CTGTTCCTCG AAGCAGGCCG CCTCGTGGTC
GACCTGTGCA AGCGCTACTA CGAGCAGGAC GACGAAAGCG TCCTGCCGCG TTCGATCGCC
ACGTTCGAGG CTTTCGAGAA CGCGATGAGC CTCGACATCG CGATGGGCGG TTCGACCAAC
ACCGTGCTGC ATCTGCTGGC CGCCGCGCAC GAGGCGGGCG TCAACTTCAC CATGTCCGAC
ATCGACCATC TCAGCCGCAA GGTGCCGTGC CTGTCAAAGG TCGCTCCGGC AAAGTCCGAT
GTGCACATGG AAGACGTCCA CCGCGCAGGT GGCATTTATG CCATCCTCGG CGAACTGGAC
CGCGCGGGCC TGCTTCATAC CCATCTGCCG ACCGTCCACA GCCGCACGCT TGGCGATGCG
CTGAACCAGT GGGACGTGAA GCGCACCAAC TCGCCCACGG TGCAGGAATT CTTCCGCGCC
GCGCCGGGCG GCGTGCCGAC GCAGGTTGCC TTCTCGCAGG ATCGCCGCTG GAAGGAACTC
GATCTGGACC GCGAAACGGG CGTGATCCGT TCGGCCGAGC ATGCGTTCTC GAAGGACGGC
GGCCTTGCCG TGCTGTTCGG TAACATCGCC CGCGAGGGTT GCATCGTGAA GACCGCGGGC
GTGGACGACA GCATCCTGAA GTTCACCGGT CCGGCAAAGG TCTATGAAAG CCAGGATGCC
GCCGTCACCG CGATCCTCAC CGGGCAGGTG ACCTCGGGCG ACGTGGTCGT GATCCGGTAC
GAAGGGCCCA AGGGCGGCCC CGGCATGCAG GAAATGCTCT ATCCGACGAG CTACCTGAAG
TCGAAGGGCC TGGGTGCGGC CTGCGCGCTG CTGACCGATG GCCGTTTCTC GGGCGGCACG
TCGGGCCTGT CCATCGGCCA CGTCTCGCCG GAGGCTGCCG AGGGCGGGGA GATCGGTCTG
GTCGAGAACG GCGACGTGAT CGAGATCGAC ATTCCGAACC GCACCATCCA CCTTGCCGTG
GCGGACGATG TCCTTGCCCA GCGCCGGGCG GAGCAGGAGG CGAAGGGCTG GAAGCCGGTC
AAGGAACGCC CGCGCAAGGT GTCGACCGCG CTCAGGGCCT ATGCCGCGAT GACCACCAGC
GCGGCGCGCG GTGCGGTGCG CGATCTTTCG CAACTCAAGA TCGACTGA
 
Protein sequence
MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQMV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRDLI ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAAMRINIPV VFVSGGPMEA GKVILKGKEH ALDLVDAMVA AADESFTDEE
VTAIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRQR LFLEAGRLVV
DLCKRYYEQD DESVLPRSIA TFEAFENAMS LDIAMGGSTN TVLHLLAAAH EAGVNFTMSD
IDHLSRKVPC LSKVAPAKSD VHMEDVHRAG GIYAILGELD RAGLLHTHLP TVHSRTLGDA
LNQWDVKRTN SPTVQEFFRA APGGVPTQVA FSQDRRWKEL DLDRETGVIR SAEHAFSKDG
GLAVLFGNIA REGCIVKTAG VDDSILKFTG PAKVYESQDA AVTAILTGQV TSGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGAACAL LTDGRFSGGT SGLSIGHVSP EAAEGGEIGL
VENGDVIEID IPNRTIHLAV ADDVLAQRRA EQEAKGWKPV KERPRKVSTA LRAYAAMTTS
AARGAVRDLS QLKID