Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2201 |
Symbol | |
ID | 3918867 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2341420 |
End bp | 2342457 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444956 |
Product | SMF protein |
Protein accession | YP_497473 |
Protein GI | 87200216 |
COG category | [L] Replication, recombination and repair [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake |
TIGRFAM ID | [TIGR00732] DNA protecting protein DprA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.00000000301868 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATCGAC CAAGCCACGA CGGACCGCCA GACCGTATCA GCGACGTCCA TGCGGCGACC GCGCTCACTC CGCCGAGCGC GGGCACGATC GACGAGGAGC GTGCCTTTCT CGCTCTCGCG ACGGTGCGGG GCATTGGTCA GAAAACCCTG TTCGCCATGG CCGACGACAA AAGATCGTTC AGCGATGCGC TCGAATATGG GCCAGAGGCG TCCAACCCTG CAGGCAGCGA CGGCAAGGCC ATCTCAGAGC GGCACTGGTC GCGGGTTCGC GGGCATGCGC TTGAACAGGG TGACCGCCTG GCCGAGCATC TCGAAGCGCT CGGCATCGGT CTGCTCTTTC GGGGGTCGCC CGGCTTTCCC TCCGCCTTGC TCGACCTTGA ACGTCCGCCG CACTGGCTTT TCGTGCAGGG CAGCGTCGAG CGCCTTGCCG AGCCGTCCAT TGCGGTCGTC GGTACCCGCA AGCCCAGCGC CGACGGCTTC TTCCTGTCAC GCTATGTGGG CGCTTGTCTC GGCGAATGGG GTGTACCGAC CGTCAGTGGC CTCGCGGCCG GCATCGATCA GCTGGCGCAT GAACACTCGC TACGCGCTGG CGTGCCGACG ATCGCGGTGC TGGGCACCGG CATGCTCGAA GACTATCCCA AGGGCTCAGG TCGACTGCGC GATCATATTC TGGCGACCGG CGGCGCGATC GTCAGCGAGT ATCTACCAAC AGCGTCCTAC AGCGCCGAGA ATTTCGTCCA GCGCAACCGG CTCCAGGCGG CGCTCGGCCG GATCCTGATC CCAGCCGAAT GGAATCGCCG CAGCGGCACG GCCCATACGG TCCGCTTCGC GACCGCGCTT GGGCGGCCTA TTGCCTGCCT GCGCTTGCCT GAGTGGCCGG ACGAGCGCGT AGTGCTGGAG CGTGGCATGG GGCTTCCGAC CGGCGAAATC TTCACCGTAC CGCACGACCA GGGACGGTTC GACGCCTTCG TCCGGTCGGC GATCGGCAAG TCTTCACCCG CTCAGTTGGG CCAACTTTCG CTATTTGGGG ATAGCTAG
|
Protein sequence | MDRPSHDGPP DRISDVHAAT ALTPPSAGTI DEERAFLALA TVRGIGQKTL FAMADDKRSF SDALEYGPEA SNPAGSDGKA ISERHWSRVR GHALEQGDRL AEHLEALGIG LLFRGSPGFP SALLDLERPP HWLFVQGSVE RLAEPSIAVV GTRKPSADGF FLSRYVGACL GEWGVPTVSG LAAGIDQLAH EHSLRAGVPT IAVLGTGMLE DYPKGSGRLR DHILATGGAI VSEYLPTASY SAENFVQRNR LQAALGRILI PAEWNRRSGT AHTVRFATAL GRPIACLRLP EWPDERVVLE RGMGLPTGEI FTVPHDQGRF DAFVRSAIGK SSPAQLGQLS LFGDS
|
| |