Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3134 |
Symbol | |
ID | 3918176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3344891 |
End bp | 3345964 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640445918 |
Product | OmpA/MotB |
Protein accession | YP_498403 |
Protein GI | 87201146 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins [COG3637] Opacity protein and related surface antigens |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.08024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAAAC TGGTCCTGGG CGTTGCCTTG GCATCGAGTG CCTTGGCCTC GCCGGCACTT GCGCGGAATG ATTCCTGGTA CGTCGAAGCC GATGCCGGCG CGATGATCCT TGAAGATTCG GAGTTCAAAG TCGACGGTGT CGACGCTGCT GCCATCCTCG ACAGCCACAC CGGCTACGAT TTCGGTGGTA TTGTCGGTTA CGACTTCGGC GGCTTCCGCC TGGAAACGGA AGTCGGCTAT CGCCGCGCCT ACAACCAGGA AGTCGACATT GCCGGCACCG TCTATACCAA GCCGCAAGGT CATGCTGACG CCCTCAGCTT CATGGTCAAC GGCATGCTGG ACTTCGGTCC GGACGATGGC ATGCAGGGCT TCGTCGGTGG CGGCGTTGGC GTCGCTCGCG CGAAGTACTG GATCAACACG CCCGGCGGCG AGTTGAACGA TTCCGACACC GGCTTCGCCT GGCAGGCCAT TGCCGGTATC CGCGTCCCCG TTTCGGACAG CGTGGACGTC GGCCTGAAGT ACCGCTTCTT CAACGCCGAC AAGGTGGATC TGGTCACGGA TGGCGGCGAT TCGGCCCGCA CCCGTTTCCG TTCGCACTCG CTGCTCGGCA CGCTGACCTT CAACTTCGGT GGCGCTGCCC CGGTCGAGGA AACTCCTCCG CCGCCCCCGC CGCCCCCGCC GCCCCCGCCG CCTCCGCCCC CGCCCCCGCC GCCGGCTCCG GTCTGCAACA AGGGTCCGTA CATCGTGTTC TTCGACTGGG ATAAGTCGGA CATCACGCCG GAAGCAGCGA CCATTCTCGA CAACGCCATC ACGGCATACG GCAACTGCGC CAGCGTTCCG ATCATGCTTG CCGGCTATGC CGACCGTTCG GGCACCGTGA AGTACAACCA GGGCCTGTCG GAGCGCCGCA ACGCTTCGGT TCGCTCGTAC CTCACCACGC ACGGCGTGCC GGACGGCTCG ATCACGAGCC AGGCATTCGG CGAAAGCAAC CCGCGCGTTC CGACCGCCGA TGGCGTTCGC GAACTTCAGA ACCGTCGCGT GGAAATCACG TACGGTCCGG GTTCGGGCAT GTAA
|
Protein sequence | MRKLVLGVAL ASSALASPAL ARNDSWYVEA DAGAMILEDS EFKVDGVDAA AILDSHTGYD FGGIVGYDFG GFRLETEVGY RRAYNQEVDI AGTVYTKPQG HADALSFMVN GMLDFGPDDG MQGFVGGGVG VARAKYWINT PGGELNDSDT GFAWQAIAGI RVPVSDSVDV GLKYRFFNAD KVDLVTDGGD SARTRFRSHS LLGTLTFNFG GAAPVEETPP PPPPPPPPPP PPPPPPPPAP VCNKGPYIVF FDWDKSDITP EAATILDNAI TAYGNCASVP IMLAGYADRS GTVKYNQGLS ERRNASVRSY LTTHGVPDGS ITSQAFGESN PRVPTADGVR ELQNRRVEIT YGPGSGM
|
| |