Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_4009 |
Symbol | |
ID | 5077539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 176656 |
End bp | 178032 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640481114 |
Product | TraB pilus assembly family protein |
Protein accession | YP_001165776 |
Protein GI | 146275615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.702041 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTGGA AGAACCCGTT CCGCAAAGGC CCCAAGGCGC TCGACGGTCC CGCAACCCAC CCTGGAAGTG GTGACGCCTC GCCGCTCGGC GATGCAATCG TGGCCAATGA GGCGGTCAGG AAGAAGCAGA TGCTGCTGCT CGGCGCTGGC GGTCTTGCCG CGCTTGTCAT CGGCTCCTCG TGGATCTTCG CCGGCGATCA CAAGAGCAGC GACGCCAGCG TCAAGCGGCA GGTCGCCGAC GTCTCGGTCG ACGAGATGGT CAACAAGAAC ATGGCCGAGA AGGAGTGGCG TGCCCAGTCC GAGGCGCAGA TGATGTCGAT GGACACCAAC ATGCGCGCGC TCGCTGCCCG TGCGCAGCGG GCCGACCAGT TGGAAGCGCA GCTGGCCCAG GGTCAGGCAG CGCAGAGCAC GTCAGGCGGC GGGATATCTC CCGACACCGA GCGGGTGCTT TCCGCCTACC AGAACGAGAA TGAGCAGTTG AAGGCAGCGC TCGCAGCCGC GCGCCAGTCG CCGGTCATGG GTGCCGGCGC GACACCGGTC GGACCCAACG CGCTTTACGG GCGGACCAGC CCACCGAGCT ACCAGACCGC GCCCAGCACG CCTGCCGGGC AAGCCGCAAT GGCCGCGGCG GGACTCCCGG CAGGGCGCGG GAGCGAGGTC AGCCTGGTCT CGTTCAACGA GGGCGCAAGC GGAACCGGAA GCCCGGTCCC CAAGGGCAAC ACTGTCTTCA CCGACAGCGC CAATTACCTG CCGCCGAACT CGATCGCGGT CGCCAAGGTC ATAGTCGGGG TCGATGCGGC CGCCGGCGTC CAGAGTCAGA CCGATCCCCT GCCGGTCGTG CTGCGCATCA CCGGCCCTGC GCGCTCGGTC TACGACAACG GGCGGCTGCT CACGACCAAT ATCGCCGGAT GCCTGGTCAA CGGCGCGGCG CGCGGCGACC TCTCGAGCGA GAAGGTCTAC GTCAAGCTGC AGCGCATGAC CTGCCCGCAG CCCAATGGCC GTTACGCGGT CTCGGACGTC AAGGGCTTCA TCGCCTTCGG CGGCAAGACC GGGGTCAGGG GGAGGGTGGT CAGCCGGGAA GGTTCGCTGA TCGGCCAGGC CTTCCTTGCC GGTCTCGCCG GCGGCTTTGG CCGCGGCTTT GCCGCCAACA CCAATTCGAC GCTCACCGGA ACAAACGTCA ACGTCAATGG GCAGCGCCAG AAGCTCGGCA CCGGCGACAT CCTCGAGGGC GGGCTCGGCG AAGGCATCGC CACCTCGGGC GACATGGTCA GCAAGTACCT GATCGAGCGG GCCGAACAGT ACCAGCCCGT GATCGAGATG CCGACCGGGA TCGATGTCGA AATCGTGTTT CTCGAAGGCG TGTTCATCAA CGGGTGA
|
Protein sequence | MDWKNPFRKG PKALDGPATH PGSGDASPLG DAIVANEAVR KKQMLLLGAG GLAALVIGSS WIFAGDHKSS DASVKRQVAD VSVDEMVNKN MAEKEWRAQS EAQMMSMDTN MRALAARAQR ADQLEAQLAQ GQAAQSTSGG GISPDTERVL SAYQNENEQL KAALAAARQS PVMGAGATPV GPNALYGRTS PPSYQTAPST PAGQAAMAAA GLPAGRGSEV SLVSFNEGAS GTGSPVPKGN TVFTDSANYL PPNSIAVAKV IVGVDAAAGV QSQTDPLPVV LRITGPARSV YDNGRLLTTN IAGCLVNGAA RGDLSSEKVY VKLQRMTCPQ PNGRYAVSDV KGFIAFGGKT GVRGRVVSRE GSLIGQAFLA GLAGGFGRGF AANTNSTLTG TNVNVNGQRQ KLGTGDILEG GLGEGIATSG DMVSKYLIER AEQYQPVIEM PTGIDVEIVF LEGVFING
|
| |