Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2387 |
Symbol | |
ID | 3915732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2550202 |
End bp | 2553357 |
Gene Length | 3156 bp |
Protein Length | 1051 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640445142 |
Product | hypothetical protein |
Protein accession | YP_497657 |
Protein GI | 87200400 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGAACC GTGGAGCAAG AAACTGGGCT GCGCTGGCGG CAGGAGTTGC CGCCCTTGCG CTGGCATCGT CCGCATCGGC GGCGCCAGAC CAATCGCTGA TGTTGCGCGA CAGTTTCCCG CTGGGGTCGG GCGGCAGCAA GGCGCTGTGC CAGGTCCAGT CGCGCGTGGT CGATCCAGCG AACCGGAGTC CACTGGACCG GACCTGGGCG GTGGTCTGCC GCGATTCGGC GCTACCTGTC GGCTACGTGT TTGCGCTGCG GCAGGGTGAA GGTGACGTGA TGGCCCGCCT GGCCGAGCGG CGCCGCGCGC AGGTCGATTG CGCCGCCGCG ACGAGTGCGT CGGTCGCGGG TCGTATGCGT CAGGATTGCC GGTGGAAGGA CCCCGAGCTC GCCTATGAGG TCGTCAACGT GAATCGCGGC AAGACGAGCT ATGTGGTGGA GGGCTTCTCT GCCTATCACG GGGCGCTCGA CCTTGCCTTG CGTTCGATCT TCGAGGACCG GCCGATCGAC GAGCCGATCG AGGTGGCGAC GACATCGGTC AACGACACCG AAGCCTATGC ACGCATCCAG GCGCTGACGC TCGACCCGTC GACCGCGCTG GCCGAAGGGT ACCGGCGCAA CAATTCGGGC GACTATGCCG AGGCCGCCGC GTACTTCGAG ACGCTGGACC AGCGGCAGGC GTCGACCGGA GACGTGCCGA TCGACCGCGT CGAATTTCTT GTGAACGCCG GACTTCAGCG TTCGAACCTC GGTCAGTTCG CGGAGGCGGA TCGCCTGTTT GCCGAGGCCG ACGCGATGCC CGCGGGCAGC GGCGTGGTGG AGCGGCTGCG GCGCAACTAC GAGGCGATCC ACCTTCTCAA CCAGGAGAAG TACGCCCAGG CGCTCGAGCG GATCGACGCG CCGCTGCGGC AGACAAGCGC GATGGGCGCG GAGACGTTGC GCGACACGAT GGAACTGACC CCGTCGGTGG CGCGGCGCAT CAATGCGTCG GACGTGACCG CCAATGCCAT GGGCATGGTC GACGACTTGC GCCTGACCGA CGACGAGCGC GCGACGATCC TCGATGCGCA GACGCTCCAG CTCAAGGGCA CGGCGCTGCG CCTGCAGGGG CGGCGGACCG AGGCCAAGGC CGCGCTCGAG CAGGCCCAAT CGCGCGCGCT GGCGGTGCGT AACGGGCGTG TCGTCTCGAT CGTGCGGATG CGGGCGCAAC TGCTGACGGA GCTTGCGACG CTGGCCGAGG ATGAAGGTCG CGTTGGCGAT GCGGAAGGGC TCTTGCGCTC GGCCGTGGAC ATCGTGGGCG TCCAGTATCC CGATACGCGC AGCCTTGCGG CGGCGCAGGC GCGGCTCGCG GCGTTCCTTG TGCGGCACGA CAAGGCGGAT CAGGCCGAGC CGCTGTACCG CGAAGTCGTG GCGCGTTCGG TCGAGCGGGA GAACGGTCTT TCGGGCCTCT CGCGCCAGAT GGCCCCATAC TTCGACCTGC TGGCCGGGAA GATGGGCACC GATGCGGGGG CGAGCGATGC CTTCTTCGTC GCCTCGCAAG TGCTGGTACG GCCGGGCGTG GCGGAAACGC AGGCCGTTCT CTCGCGCGAG CTTTCGGGCG GAAGCGACGA GGCGGCCAGG CTTTTCCGGC AATCCAACAG CCTGACGCGC GGGATCGAGC GGGCGCGCAT GACCTATGCC GCGCTGCAGA GGCTGGACGA TGCAGCCGCG CGCACGGACG AGATCGCGGA AGCGGCAAAG CGGGTACAGG AGCTTGAAGC GCAGGAGCAG GCGACGGTTA TCCAGCTTGC CAACTATGCC CGCTACCGCG TTGTTTCGCA GCGGGTCATC GGCCAGAAGG AATTGCAGGA CGGCCTTGGT GCGGGCGAGG CCTATGCCAA GGTGGCGGTG CTCGGTCCGG ATGTCTTCGT GTTCTTTGCG AACAAGGACA AGGCCGTGGG CTACCGCGCG CCGATCACTT CGGGCGAGCT GGAGAAATCT GTCCAGTCGA TCAGGGATTC GATCTCGCGC TACGACGGCA AGCAGTACGT TACGAGCGAA TTCGCGGCCG CCGAGGCCTA TGGCGTCTTC AAGGCACTGT TCGGGCCGGT CGATGCGGAA CTCATGGCCG CGCACCACCT GATCTTCGAG CCGGACGGGG CGATGCTCAA GCTGCCGGTC AACGTGCTTG TAGCCGACGA CGCGTCGGTC AAGGCCTACC TCAAGCAGGC GGAGAGCCCA TCGGGCGACC CGTTCGACCT GCGGGGTATG AACTGGCTGG GCAAGGACAA GCTGGTCTCG ACCGCCGTTT CCGCCCGCTC GTTCATGGAT GCGCGCAAGC AGCCGGGGTC GAAGGCGCGT GAAGCCTATC TCGGTCTTGG CCGGAATGCG GCCATTTCGG CGAGCAGCAC GCTTGGCGCG GCACGGGTGC GCGGCGCGGA CGGAGACATG ACGGCAGACT GCAACTGGCC GCTCGCGACC TGGAATCGGC CCATTTCCGA GGCCGAGCTG CTGAAGGCGC GGATGCTGCT GGGATCGCAG GGGACCGACA TCATGGTCGG CCCGGCCTTT TCCGACACGG CGATCAAGTC GCGCCCCGAC CTCGACAACT TTCGCATCCT GCACTTTGCC ACGCATGGCC TCGTGACGCC GCCGCAGCCG ACCTGTCCGG CTCGGCCGGC GCTGGTCACG TCCTTCGGTC CCAAGGGATC GGACGGACTG CTCAGCTTTT CGGAGATTTT CGACCTCAGG CTAGACGCCG ACATGGTGAT CCTTTCGGCC TGCGACACGG CGGGTCAGGC AGACGTTGCC GCCACGCGCG CGGCGGGCAT CGTAACCGGC GGCGGATCGG CTCTGGAGGG GCTGGTGCGC GCGTTCATCG GTGCGGGGAG CCGTTCTGTC CTCGCCAGCC ACTGGCCAGC CCCGGACGAT TTCGATGCCA CCGCGCGCCT TATCAACGGC CTGTTCGAAG CGCCTCCGGG TACCTCGTCG GGCGAGGCCC TGCTCAAGGC GCAGCAGGCG CTGATGGCGG ACGCGGACAC TTCGCATCCC TATTACTGGG CCGGATTTGC GGTTATCGGT GACGGTGCGC GGCCGCTTGT TTCGCGGGAC GCGCGCGTGG CAATGAAGGC GACCGGCATG GCCGCTGGAC CGGCCAAGGT TTCGGGGGGG AACTGA
|
Protein sequence | MRNRGARNWA ALAAGVAALA LASSASAAPD QSLMLRDSFP LGSGGSKALC QVQSRVVDPA NRSPLDRTWA VVCRDSALPV GYVFALRQGE GDVMARLAER RRAQVDCAAA TSASVAGRMR QDCRWKDPEL AYEVVNVNRG KTSYVVEGFS AYHGALDLAL RSIFEDRPID EPIEVATTSV NDTEAYARIQ ALTLDPSTAL AEGYRRNNSG DYAEAAAYFE TLDQRQASTG DVPIDRVEFL VNAGLQRSNL GQFAEADRLF AEADAMPAGS GVVERLRRNY EAIHLLNQEK YAQALERIDA PLRQTSAMGA ETLRDTMELT PSVARRINAS DVTANAMGMV DDLRLTDDER ATILDAQTLQ LKGTALRLQG RRTEAKAALE QAQSRALAVR NGRVVSIVRM RAQLLTELAT LAEDEGRVGD AEGLLRSAVD IVGVQYPDTR SLAAAQARLA AFLVRHDKAD QAEPLYREVV ARSVERENGL SGLSRQMAPY FDLLAGKMGT DAGASDAFFV ASQVLVRPGV AETQAVLSRE LSGGSDEAAR LFRQSNSLTR GIERARMTYA ALQRLDDAAA RTDEIAEAAK RVQELEAQEQ ATVIQLANYA RYRVVSQRVI GQKELQDGLG AGEAYAKVAV LGPDVFVFFA NKDKAVGYRA PITSGELEKS VQSIRDSISR YDGKQYVTSE FAAAEAYGVF KALFGPVDAE LMAAHHLIFE PDGAMLKLPV NVLVADDASV KAYLKQAESP SGDPFDLRGM NWLGKDKLVS TAVSARSFMD ARKQPGSKAR EAYLGLGRNA AISASSTLGA ARVRGADGDM TADCNWPLAT WNRPISEAEL LKARMLLGSQ GTDIMVGPAF SDTAIKSRPD LDNFRILHFA THGLVTPPQP TCPARPALVT SFGPKGSDGL LSFSEIFDLR LDADMVILSA CDTAGQADVA ATRAAGIVTG GGSALEGLVR AFIGAGSRSV LASHWPAPDD FDATARLING LFEAPPGTSS GEALLKAQQA LMADADTSHP YYWAGFAVIG DGARPLVSRD ARVAMKATGM AAGPAKVSGG N
|
| |