Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2334 |
Symbol | |
ID | 3915679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2481079 |
End bp | 2481999 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640445090 |
Product | putative exopolysaccharide biosynthesis protein |
Protein accession | YP_497605 |
Protein GI | 87200348 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG0489] ATPases involved in chromosome partitioning |
TIGRFAM ID | [TIGR03018] exopolysaccharide/PEPCTERM locus tyrosine autokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.018256 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGACC AGAGCAAGAT TCCCGTGCCG CCTTCCGGCC CTTCGCTGAT CGAGCGTGCC TCGGGCAGCT TCGATTTTCG CGCGCTGATC CGGCCGGTCG TGGCCGACCT CCCGCCTGCT CCCGCGCCAA GGATCGCTGC CGCGCCTTCG CCGGTGGCCG GCTCCGAAGC GCTGCCGCAG GTCCGTCCTT TCGGCGGTCC GCGCCACCCC ATCGACCGCG CGCACTTGCG CGAACAGTGC CTGATCGAGC CCGAGAGCGC CGTCACCGGC CTCCTCGAGG AGTTCCGCAT CGTGAAGCGC CAGCTCCTGC GCACCGCCGC CGAGACGCGC GGCAAGGGTC ACGGCGAGCG CATCCTCGTG GCCTCGGCGC ATCCGGGCGA GGGCAAGACC TTCTGCGCGG TCAACCTCGC ACTGTCGATG GCGGCGGAGA AGGACACCGA AGTCCTGCTC GTCGATGCCG ATTTCGCCAA GCCATCGGTG CTCTCGACGC TGGGCCTGCC CGGCGGCCCG GGCCTCATGG ATGCGCTGGC CGATCCGGGC ATCGCGGTCG AGGACTGCGT GATCGGCACC GACATCGCCG GGCTCTACGT GCTCCCGGCC GGCAACGTGA CCGGATCGGA CACCGAATAC CTTGCCTCGT CGCGGACGGA GGCGGTGCTG GCGCGGCTGA CGGCCAATGC GCCCAACCGC ATCGTCATCT TCGATTCGCC GCCCGTACTG GCCGCATCGC CCGCGACCGT CCTTGCCAAC CATGTCGGGC AGACGGTGAT GGTCGTGCGT GCCGACGTGA CCGGCGAGGC CGCGCTGCGT GACGCGGTGG GCCTGCTTTC GGCCTGCGAG GATATCAAGC TCCTGCTCAA CGGCACGCGC TTTTCCACCA CTGGCCGCCG TTTCGGCACC TACTACGGAT ACACGGAATG A
|
Protein sequence | MTDQSKIPVP PSGPSLIERA SGSFDFRALI RPVVADLPPA PAPRIAAAPS PVAGSEALPQ VRPFGGPRHP IDRAHLREQC LIEPESAVTG LLEEFRIVKR QLLRTAAETR GKGHGERILV ASAHPGEGKT FCAVNLALSM AAEKDTEVLL VDADFAKPSV LSTLGLPGGP GLMDALADPG IAVEDCVIGT DIAGLYVLPA GNVTGSDTEY LASSRTEAVL ARLTANAPNR IVIFDSPPVL AASPATVLAN HVGQTVMVVR ADVTGEAALR DAVGLLSACE DIKLLLNGTR FSTTGRRFGT YYGYTE
|
| |