Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1987 |
Symbol | |
ID | 3917307 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2106466 |
End bp | 2109585 |
Gene Length | 3120 bp |
Protein Length | 1039 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444739 |
Product | hypothetical protein |
Protein accession | YP_497261 |
Protein GI | 87200004 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCCGC GCCGGATGCT TTCCGCGAAG AACAGTCGTC TTGGGGGCCA CTTCGCGAAG ACGACCCTTT CGATCGCCGC ACTTGCTTGT GCCATGCCGG CAGTCGCCGG AGGAACCGGC CAGCCGCTGT CGGTGCGCAA TTCCTTCCGC ATAGGATCGT CGGGCGTGAC CTGCACCGCG CAGAACGCTC CTCTCGACAA GCGGCTTGGT GGCATCTTCG ACCGTGGCTA TCGGCTGAGT TGTCGCGATG CCGCAGGCGC TGTGGGCACG CTGATCGCAG TGCGGCGCGA GGTGTTGCCT GGGAGCGAGC CGACCGGGCT TTCCGGCGTT ACGCTTGCTT GCGGGGCGGA CGGGTTTGTC GCGATCGATG CGGTCGGCCG TGTGAATGCC GCCAATTGCC GGGATCAGGA CGCCAACGTC GAATACAAGC GCTATGCCGT CACCCGGCGC GGCGTGACCT ATTTCGTCGA GGGGCTTGCC GGGTACGATC CAGCCCTGCG CCTTGCGCTG GCGAGCGTCG TGAACGACGC ACCCGTGCCG GGTGAAATAC GTGTCGCAAC GACCGAAGTA AGCGACCCGG CCGCCTTCGC CCGGGTACAG GCCGGTCAGC TCGACCGCCT TGGCGCCCGC GACGAAGGGT ATCTGCGCAA CAACGGAGGC CGCTACGCGG AGTCGGCGGA GTTCTTTGAG GCTCTGGCGT CGAGGGACCG TGCCAGTGGC GGGGCGGGCC TTGCCGAAGC CCTTGCCAAC CAAGGTCTCC AGCAGTCCAA CCTCGGAAAC TTCGCCGCGG CCGAACGCCT GTTTGGGCAG GCCGCTGCCG CGCCGGGCGG CAGGGATGGC GTGACGCAGC GACTCGTCCG CAACTACCGC GCGATCAACC AACTCAACCA GCGCAAGCCT GTGGCCGCGA TCAACGCCCT GGCCGAACCC GTAGCGGCGG TGAGCGTCTC TTTCGACCGC GACAGCCTGG TGCTGGGGCT GATCAACATG CCCTTGGCGG AGCAGATCAA CCGGGAGAGT TCGGCGCTCA AGCGCTTGGG CGCAGTCGAC CCCGGATTGA CCGAGACCGA GCGTGCCGAA ATTCTCGATG CGCAGGCGGA TTCCCTGCGC GCGACCGCAG CGCGGTTGCA GGGCAAGTAC GACGTTGCGG TGACCGGCTT CGAGATCGCA GGCCGACGCC TGGATGCAGT TCGCGGCGGG CGCGTGGCTT CCGCTGGATG GTTGCGCTCC GAAATCCAGA TTGAACTTGC TCTGTTGGCA GAGGCACAGG GCCGCAACGC CGATGCAGCA ACCGCGTTTG ACCGCGCCAT CGCCATCATC GATAGCGCCT TCCCACAATC GCCCGCACTG CTTTCGGCGC AGGCGCGCAA GGCTGCGTGG CTGGGCCGCT CGGGCGACGA GGCGGGCGCG CTGGCGCTTT ATGCCCATGT CGTGGATCAA AGCCTTGCCG TGCCGGATGC CGGTACCACC CTGCGCGATC TGCTGGGCCC CTATTTCGGC TTGCTCGCGA AGCGGAACGG TGCCGGTAAT GCGAGCGCGA TGTTCGCCGC CTCGCAAGTC CTCCAGCGTC CGGGTGTTGC CCAGACCCAG GCGGTTCTGG CGCGCCAGAT GTCGGAGGGC AACGACGAGG CCGCGGCCCT GTTCCGTCTG TCGCTTGCCA GAAGCCGCGA TATCGCGCGC ACCGAGGCAA TGGTCGAGCA GCTTTCCGCC TTGACCGGAC TGACCGAGCA GCAGGCTGCA TCACTCAAGG CGGCACAGGA CAACCTGGCA TCGCTCAAGC GTGACCAGAC GGCGCTGGTG AGCAAGCTTG CAGCCTATCC TCGCTACAAC GTCCTGGCGC CGAAGGGCGT GGAACTTGCC GAACTGCAAT CCGCGCTCAA GCCGGGCGAA GCGTACTACA AGATGATGGT GGTCGGCGGA CGGGTCTATG GACTGTTCGT CACGTCAGGG AGCGGCGAGG CGCGGACGTT CGATACCGGG ATCGATCCGG CGACCTTGTC TCGCAATGTC CAGGCGATCC GCGATACGAT CGTCAAGGTG GAGAACGGCC AGCAGGTCAA TTATCCTTTC GACCTCGACA AGTCGCGCGC CCTCTACAAG ACGCTGTTCG GTGCGGTGGA AGACCTTCTG CCCCAGACCC GCCACCTGGT GTTTGAACCG GACGGCGCGA TGCTGCAGCT TCCTCCCACA GTGCTGGTGT CGGGAGACAA GGGCATCGAG GCCTACAAGG CCCGGATGGA AAGCCCCGAT GGCGATCCGT TCGACTTCAC CGGGGTCGAG TGGCTTGGAC GCGGCCGGGA AGTTTCCATC GCCGTCAGTC CGCGCGGATT CCTCGACATC CGCAAGCTGG CCGCATCGAC CGCGCCGCGC AACTACCTTG GTCTGGGGCA CAATGCGAAG CCGGCCGCGC GCCCGGTTAC CGCAGTCGCG GATGAGTGCG ACTGGCCGCT TGCGACGTGG CAGAATCCGA TCTCTGCCGA CGAGCTGTAC TATGCCCAGA AGAAGCTTGG TGCGGGAGGC AGCGCGGTCA AGACCGACGC CGCGTTCAGT GACAGCGCCT TGCTGGCGGA GAGTGACCTC GACCAGTATC GCGTTCTGCA CTTTGCCACG CATGGACTGG TCACTGCGCC GCGCGCCGAC TGTCCTGCGC GCCCTGCGCT GGTGACCAGC TTTGGCGACA TGGGCTCCGA TGGCCTCCTG AGCTTCCGCG AGATCTTCGA CCTGAAGCTC AACGCGGACC TCGTAATCCT CTCCGCGTGC GACACGGCCG GTATGGCGAC TGTGGCGGCA AGCCGGGAGG CCGGCGTGAC CTCGGGCGGG AACTACGCCC TCGATGGCCT CGTCCGTGCG TTCGTGGGGG CTGGTGCGCG TTCGGTCATT GCCAGCCACT GGCCGGTGCC CGACGACTTC GACGCGACCA AGCGCCTCAT CGGCGGCGTC ATCGAAGCGA AACCGGGACA GGATCTTGCC GATGCCCTGT CGGGCGCACA GACCCGGCTG ATGGACGACC CCAACACATC GCATCCGTTC TACTGGGCAG CGTTCATAAT CCTCGGCGAC GGGGCGAAGC CGCTGGTGTC GGGGAAGGCT GCGATGTCCG CTCCTGATCC GGTCCGGTAA
|
Protein sequence | MSPRRMLSAK NSRLGGHFAK TTLSIAALAC AMPAVAGGTG QPLSVRNSFR IGSSGVTCTA QNAPLDKRLG GIFDRGYRLS CRDAAGAVGT LIAVRREVLP GSEPTGLSGV TLACGADGFV AIDAVGRVNA ANCRDQDANV EYKRYAVTRR GVTYFVEGLA GYDPALRLAL ASVVNDAPVP GEIRVATTEV SDPAAFARVQ AGQLDRLGAR DEGYLRNNGG RYAESAEFFE ALASRDRASG GAGLAEALAN QGLQQSNLGN FAAAERLFGQ AAAAPGGRDG VTQRLVRNYR AINQLNQRKP VAAINALAEP VAAVSVSFDR DSLVLGLINM PLAEQINRES SALKRLGAVD PGLTETERAE ILDAQADSLR ATAARLQGKY DVAVTGFEIA GRRLDAVRGG RVASAGWLRS EIQIELALLA EAQGRNADAA TAFDRAIAII DSAFPQSPAL LSAQARKAAW LGRSGDEAGA LALYAHVVDQ SLAVPDAGTT LRDLLGPYFG LLAKRNGAGN ASAMFAASQV LQRPGVAQTQ AVLARQMSEG NDEAAALFRL SLARSRDIAR TEAMVEQLSA LTGLTEQQAA SLKAAQDNLA SLKRDQTALV SKLAAYPRYN VLAPKGVELA ELQSALKPGE AYYKMMVVGG RVYGLFVTSG SGEARTFDTG IDPATLSRNV QAIRDTIVKV ENGQQVNYPF DLDKSRALYK TLFGAVEDLL PQTRHLVFEP DGAMLQLPPT VLVSGDKGIE AYKARMESPD GDPFDFTGVE WLGRGREVSI AVSPRGFLDI RKLAASTAPR NYLGLGHNAK PAARPVTAVA DECDWPLATW QNPISADELY YAQKKLGAGG SAVKTDAAFS DSALLAESDL DQYRVLHFAT HGLVTAPRAD CPARPALVTS FGDMGSDGLL SFREIFDLKL NADLVILSAC DTAGMATVAA SREAGVTSGG NYALDGLVRA FVGAGARSVI ASHWPVPDDF DATKRLIGGV IEAKPGQDLA DALSGAQTRL MDDPNTSHPF YWAAFIILGD GAKPLVSGKA AMSAPDPVR
|
| |