Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2159 |
Symbol | |
ID | 3918824 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2297182 |
End bp | 2298522 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444914 |
Product | TraB pilus assembly |
Protein accession | YP_497432 |
Protein GI | 87200175 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATG CAACCCATAC GCCAGGCACT GCCCCTTTGC AGGCCGAGAG CGCAGCATCG TCCGAACTTT CCGGGCTCAA CGCCCGCACC GCGCGCCGTC AGAAGCTGCT GCTGGGGTCG CTGGGAGCAC TCGCCCTCAT CGGCGGAAGC TGGTTCATCC TTGGCGGCGA CGACAAGGCC AAGACCGGCG ATCCCAATGC GGCGCAGACG ATCGACACGG CAGGCCTGGT CAACCGCGAC CTATCCCAGC GCGAGTTCGT CGCGACTTAT GGCAACCGGC TGGATGCGGT CACGCGCGAA CAGAAGGCGC TGAAAGACGG CAGCATTCCG CGAAGCGAGA TCGAAGCGCA GCTGGCGGCT CTCAAAGCCG AGAACCAGGC CATGCGTGTC GACGGACAGG CTGCGATCGA TGCTATCTCG GCAGAGAACG CCGAGCTCAA GACCCGGCTT GCCGCGCAGC CTGCATCACC GGCACCGGCT GTACCGCCAC CGGCTTATGG GCCGCAGGCA GGCGGCTATG ACGCAAGGGC CCGGTCACCC CAAACCAGTA CCGGCGCCGC AGCAGGCGAT CCGCAGGGCG GCATGATCCC TTCGCCTGGC GAGGTGAAGC TGATGAGCTT CAGCTCCGAC AAGGCAGCGA CCAATGGCCT CCGTGTGGGC AGGCCCGATG CGCCACCGGT CGTGGTCGAA GATTCGCCTG ATTACCTGCC GCCCAACTCC TACGCGCCGG CACGCGTAAT CGTCGGCGTC GATGCCTCGG CAGGCGTCGC CAGCCAGACC GATCCGCTGC CTGTGGTGCT TCGGATTACC GGCCCTGCTC GTTCGGTCAT GCAGAACGGA AAGGTTTTGA CCACCCGGAT CCAGGGCTGC GTTGTCAACG GCGCGGCACG CGGGGATCTC AGCAGTGAGA AGGTCTACGT GAAGCTCGCC CGCATGACCT GCGATCAGCC TGGCGGCAGG GTCGCAGTGA GCGAGGTCAA AGGCTTCATC AGCTTCGCTG GCAAGTCCGG GGTCCGCGGC CGCGTCGTCA GCCGCGAAGG TAGCCTCGTC AGCCAGGCCC TGCTGGCCGG GATCGTCGGC GGCTTCGGGC GCGGCTTCTC GGCCAATGCC AACAGTGTCT TTTCCGGCGT CACGACCAAC CCTGACGGCA GTCGCTCCAA GCTCTCGGCC GGCGACATTC TCGGTGGCGG GCTTGGCCAG GGTGCCGCCG ACGCTGCCGA CACGGTCAGC AAATACCTGA TCGAGCGCGC CGAACAGTAC CAACCCGTCG TCGAGATGCC GACCGGCATC GATGTCGAGA TCGTGTTCCT CGACGGCGTC TACGTGAGGA ACTCCCAATG A
|
Protein sequence | MTDATHTPGT APLQAESAAS SELSGLNART ARRQKLLLGS LGALALIGGS WFILGGDDKA KTGDPNAAQT IDTAGLVNRD LSQREFVATY GNRLDAVTRE QKALKDGSIP RSEIEAQLAA LKAENQAMRV DGQAAIDAIS AENAELKTRL AAQPASPAPA VPPPAYGPQA GGYDARARSP QTSTGAAAGD PQGGMIPSPG EVKLMSFSSD KAATNGLRVG RPDAPPVVVE DSPDYLPPNS YAPARVIVGV DASAGVASQT DPLPVVLRIT GPARSVMQNG KVLTTRIQGC VVNGAARGDL SSEKVYVKLA RMTCDQPGGR VAVSEVKGFI SFAGKSGVRG RVVSREGSLV SQALLAGIVG GFGRGFSANA NSVFSGVTTN PDGSRSKLSA GDILGGGLGQ GAADAADTVS KYLIERAEQY QPVVEMPTGI DVEIVFLDGV YVRNSQ
|
| |