Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2120 |
Symbol | |
ID | 3918783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2258634 |
End bp | 2260151 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640444873 |
Product | TPR repeat-containing protein |
Protein accession | YP_497393 |
Protein GI | 87200136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAATC TGCGTGGGTT GATCGTGCCG CTGGTGTTGC CGTTCGTGCT CGCCGGATGC GGGGCGAGCC CCGAGGAGCG GGCCGAGCGA GCACGCAAGG CATTCGAGAC GCATGATTTC CGTGCGGCGC AGGTTGATAT TGCGGCGGCG CTGGAGGCGA AGCCCGGCGA TGCCGCGCTG ATCGAATTGC AGGCGCGCAA TGCGCTGGCG CTGGGTGACG GCATCGCGGC CGAGGCTGCG CTTTCCCGCC TGGCGGAAGG GCAGCGACCG GCTGACTTCG CGCAACTGAT GGGCGAGGCG GCGCTGCTGC GGCAGATGCC CGACGAGGCG CTTTCCGCTA TCGGCAACGA CGCATCGCCC TCCGCACAAC GTATCCGCGC GCTGGCCATG CTCGCCAAGG GCGATCGCGC GACGGCGGAG GCGGCGTTCG CGGCGGGGGC GCAAGGGGCA AGCGATGCAC GCCTGCTGGC GGACTACGCG CGGTTCAGCC TGATGGGGGG AGACGTGGCA AAGGCGCGAG CGCTTGCCGA CCGGGCGGTC AAGGCTGCCC CGGATCTCAT CGATACGCTG CTGGCGGACG CCGAAGTATC GGTCGCGCAG GGCAAGCTGG CGCAGGCCCT GGCGACCTAT GACAGGGCGG CGAAGGACTG GCCGGGCAAT CTTGCCGCGC TGGCCGGGAA GGCGGCGGTG CTGGGCGACC TGGGACGGAC GAAAGATATG GAGGCGGTCC TCGCCTCGCT GGCCGAAGTG AAGGGCGGCG GGCAGGTCGC CTATCTCCAG GCGCGCGCGG CTGCGGCGCG GGGGGATTGG AGCACCGTGC GCAGCGTGCT TCAGGCCAAC GAGAAGGCTC TGGAAGGCAA GGACGAGGCG ACCGTGCTCT ATGCCCAGGC GCTGGTGGCG CTGAAGCAGC CGGAGCAGGC GCGCGCCCGG CTCCAGCCAC TGCTGACGCG CAATCCGCAA AGCGCGATGA TACGGCGCGA ATTGGCCAAG GCCCAGCTCG CCGCAGGCGA TGCGCGCGGC GCGGTCGAGA CGATGCGGCC GTTTGCCGAA GTGCAGACCG CCGATGCGGA AGACTTGCGC CTGCTGGCAA GGGCCGCGGC GGCTTCAGGC GACCCGGAAG CGGCGAAGCT GGCCGAGAAG GCGAAGTATC CTTCACCCCA GGCACTGGCG GCGACCTTGG CGCAGGCCGA TACGGCGATG AAGCAGGGCA ACTGGGGCAA TGCCGTTGCC GCCTACGATC GCATCCTGGC GGTGACGGAT GGTTCGAACG CGCTGGTGCT GAACAACATG GCCTATGCGC AGGGGCAATT GGGCAACAGT GCCAAGGCGC TGGACTTCGC GGAACGTGCG CTGAAAGCGG CGCCGGGCAA TGCCTCGGTC ATGGACACAC TGGGCTGGCT GCTGGTCGAG AGCGGCAAGG ACAAGGCGCG CGGGCTGAAG CTGTTGCAGG ATGCGGCGGC CAAGGCGCCG GGCAATGCAG CGATCCGCCA GCACCTCGAC AAGGCGCGGC AGGGCTAG
|
Protein sequence | MKNLRGLIVP LVLPFVLAGC GASPEERAER ARKAFETHDF RAAQVDIAAA LEAKPGDAAL IELQARNALA LGDGIAAEAA LSRLAEGQRP ADFAQLMGEA ALLRQMPDEA LSAIGNDASP SAQRIRALAM LAKGDRATAE AAFAAGAQGA SDARLLADYA RFSLMGGDVA KARALADRAV KAAPDLIDTL LADAEVSVAQ GKLAQALATY DRAAKDWPGN LAALAGKAAV LGDLGRTKDM EAVLASLAEV KGGGQVAYLQ ARAAAARGDW STVRSVLQAN EKALEGKDEA TVLYAQALVA LKQPEQARAR LQPLLTRNPQ SAMIRRELAK AQLAAGDARG AVETMRPFAE VQTADAEDLR LLARAAAASG DPEAAKLAEK AKYPSPQALA ATLAQADTAM KQGNWGNAVA AYDRILAVTD GSNALVLNNM AYAQGQLGNS AKALDFAERA LKAAPGNASV MDTLGWLLVE SGKDKARGLK LLQDAAAKAP GNAAIRQHLD KARQG
|
| |