Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2118 |
Symbol | |
ID | 3917766 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2256144 |
End bp | 2257658 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640444871 |
Product | TPR repeat-containing protein |
Protein accession | YP_497391 |
Protein GI | 87200134 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGAAACG TCAGGATACT TTCTCTCGCC CTGCTGCTGT CCGCCTGCGG AACCGATACG GCCGACCGCG TGGCCCGGGC CCGCTCGGAG ATCGCCGGGA TGGAACTGGC CGCGGCCCGC GTCGATCTGG CGGCGGCGCT TGCCGAGCGG GGCGACGATG CCGAACTGCT GCGGTTGCTG GCCTCGGTGC AACTGCGCCT CGGCGACGGG GACGGGGCGG AAGCCACGGC GGCAAGGCTG GAACGGACCG GGGCAACGGG CGCGGAACTC GCGCGCATGA GAGCTGAAGC CGCACTGCTG CGCGGCCGTG CACGGGAAGC GCTGGCGCTT CTGGGCAACG ATGCCACGAC TGGCGGCTGG CGGGTGAGGG CCGCCGCGCT TTCCGCAGTG GGCGACGGGG AGGGAGCATT CAGGGCGCTG CAATCGGGTC TTGCCGCAGG GTCCGATCCG CTACTGCTGC GCGACATGGC GCGGTTCCTG ATCGATGCGC AGGACCTTGA TGGCGCACAG CGGCAGGTGG ATGCGCTGGC CCGGATGCAG GACGATGGTT TCGATGCCCT GATGCTTTCG GCAGACATCG CGGCGCGGCG CGGGCGCTAT GCCCAGGCCC ATGCCACGCT GGAACGCGCG GCAAAGCGCT ATCCGCGCAT TCCGGACCCG TGGATCGCCC GGGCCGATGC CTATGATCGA GAGGGCAAGC TCGACGAGGC GGTGGCAATG ACGGCGCGGG CGGCGGCCCT TGCGCCGGAC GATCCGCGCG TGACCAATCT CAAGGTCGAG TTCGCCGCGA TGAAGGGCGA CTGGGAGGCG GTCCGCACGG CGCTGGCGCG GCAGGAGGCA ACGCTCGACC CGCTGTCGGC CAACGGGCTC ACCTATGCCG AGGCGATGCT GCGGCTGGGG CGGCCGGAGC AGGCGCGGGC GATGTTCCAG CGCGCCCTCA CACGGTCGCC CAACAATCCG TACTCGCGGC TCATGCTTGC GGAAGCGCAT CTGGCGACGG GCAATGCCGT TGCCGCGCTG GAAACCGTGC GTCCGCTCGC GCAAAGCCTG ACGGCAGGTC CGCGCGAACT GGAACTTGCC GAGAAGGCGG CACGGGCGGC CAACAGCGGC GAGGCCGACG CGCTGGCGGC TCGGCTCGCG GCGGTCCGGA AGTCACAGGT CTCGGCGCTG GCTGCCAAGG GTCAGGCTGC GCTGGTCGGC GAAGACTGGA ATGGCGCGAT CGAAGCCTAC GGACAACTTG CGCAGATGGG CGAGGATGCG GACGTGCTGA AGCGGCTGGC GCTGGCGCTG AGCCACGCCG GACGGGTCGA CGAGGCCATC AGGGCAGCGG ACCGTGCACG GACCCTTCGG CCCGGCGATC CGGACATGAG CTACATGGCC GGGTATGTGC GCGTCGCGGG CGGGAAGGAC AAGGCCACCG GGCTTGGTCT GCTCCGCCAC GCGACCGAGT CCGCGCCGGA CAATCTGGTC TTCAAGCGGG CGCTTGCGCG GTATTCGGCG GCTGGCGGCG CCTGA
|
Protein sequence | MRNVRILSLA LLLSACGTDT ADRVARARSE IAGMELAAAR VDLAAALAER GDDAELLRLL ASVQLRLGDG DGAEATAARL ERTGATGAEL ARMRAEAALL RGRAREALAL LGNDATTGGW RVRAAALSAV GDGEGAFRAL QSGLAAGSDP LLLRDMARFL IDAQDLDGAQ RQVDALARMQ DDGFDALMLS ADIAARRGRY AQAHATLERA AKRYPRIPDP WIARADAYDR EGKLDEAVAM TARAAALAPD DPRVTNLKVE FAAMKGDWEA VRTALARQEA TLDPLSANGL TYAEAMLRLG RPEQARAMFQ RALTRSPNNP YSRLMLAEAH LATGNAVAAL ETVRPLAQSL TAGPRELELA EKAARAANSG EADALAARLA AVRKSQVSAL AAKGQAALVG EDWNGAIEAY GQLAQMGEDA DVLKRLALAL SHAGRVDEAI RAADRARTLR PGDPDMSYMA GYVRVAGGKD KATGLGLLRH ATESAPDNLV FKRALARYSA AGGA
|
| |