Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2199 |
Symbol | |
ID | 3918865 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2339884 |
End bp | 2340771 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640444954 |
Product | SMF protein |
Protein accession | YP_497471 |
Protein GI | 87200214 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000000109566 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTCG CCCTAAACGA TTCAGGGCCA GCGCTGGCGC CAATCTCGCC GCGCCGAGAG CTCGGCGCCT ATGAGGCCCT GTGGCTCGAA AAGGGGGCGA CCTTCAAAAC CCTGGCCGAT CGCTTTGCGC TCGATGCCGA AGCGCTTCCG TCCGACTTCG TGCCCGCCCA GCTTGCCGAG CAGTGCGCGG CCGAGGTGAT GGCAAAGCTC AAGAAGGCCG GTGTCCATCA GTTTGGCGTC CGCATCCATC ATGCCGGTGA CTATCCCGCA AAGCTGCGCG ACGCGCGCCA CCCAGTCGAG CTTCTCTATT ATCGCGGCGC CTGGGAGATC ACCGAAACCC GGTGCGTGGC CGTCGTCGGA AGCCGCGAGG CCTCGCCCGA CGGTATCCGT CGCGCCGAGC GGCTTGCGCG CGAACTCGTC GATCGCGATT TCACGGTCGT CTCTGGCCTT GCCAAGGGCG TCGATTCGGC TGCCCATCGC GGCGCGATCG CGCGCGGTGG ACGCACCATT TCCGTGATCG GGACGCCGCT TGGATCCTGC TACCCCAAGG AGAATGCCGA TCTGCAAGAG GAGATCGCCC GCGATCATCT GCTGATCTCG CAGGTGCCGG TTCTTCGCTA CGCCAAGCAA GCACCCCAGC ATAACCGCCT TTTCTTCCCC GAGCGCAATG TCACGATGAG CGCTCTCACC GAGGGCACGA TCATCGTCGA GGCTGGCGAT ACGTCGGGCA CGCTGACCCA GGCGCGCGCC GCGCTCCATC AGGGCCGCAA GCTCTTCATT CTCGACAATT GCTTTCAGCG GACGGACATC ACGTGGCCAG CCCGCTTCGA AGCCGAAGGT GCAGTGCGCG TGAAGACGCC CGACGACATC TGGAGCGCCC TTGGTTGA
|
Protein sequence | MRLALNDSGP ALAPISPRRE LGAYEALWLE KGATFKTLAD RFALDAEALP SDFVPAQLAE QCAAEVMAKL KKAGVHQFGV RIHHAGDYPA KLRDARHPVE LLYYRGAWEI TETRCVAVVG SREASPDGIR RAERLARELV DRDFTVVSGL AKGVDSAAHR GAIARGGRTI SVIGTPLGSC YPKENADLQE EIARDHLLIS QVPVLRYAKQ APQHNRLFFP ERNVTMSALT EGTIIVEAGD TSGTLTQARA ALHQGRKLFI LDNCFQRTDI TWPARFEAEG AVRVKTPDDI WSALG
|
| |