Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2020 |
Symbol | |
ID | 3917341 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2152601 |
End bp | 2154541 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 640444772 |
Product | hypothetical protein |
Protein accession | YP_497293 |
Protein GI | 87200036 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGGCT TCTTCCGCTC GTTCCTCAAG TCCCGCATCG GCGTAGCAAT CGCCCTGCTT TTCCTCGGCC TGATCGCGCT CGCCTTTGCC AGCGCCGATG TCACCGGCAG CGGCTTCGGC GGCGTCGCCG GCGGGGACCG TGCCGCCAAG GTCGGCTCCG GCCGCATCGG CACGGGCGAA CTCGGCAAGG CACTGACCAG CGCGTTCGAG CAGGACCGCC AGAAGCAGCC CGGCCTGACC ATGAAGCAGT TCCTCGACGG CGGCGGCCTC GAAGGCGTCC TGTCGGGCAT GATCGACAGG CTGGCGCTTG CGGAATGGGG CAAGAAGCAC GGCCTCGCGG TCAGCGACCG CCTGGTCGAC AGCGAGATCG TCAAGATCGG CGCGTTCCAG GGCCCTGACG GCAAGTTCAG CCAGAAGGCC TATGAGCAAC TCCTCGCCCA GCGCGGCCTT ACGGACAAGG AAGTGCGCAA CGACCTTGCA CAGGGCCTGA TGGCCCGCCA GCTTCTGCTG CCCGCCGCGT TCGGCGCGCA GATGCCGGCC GAAGCGGTCC TGCGCTATGC CTCGCTGCTT ACCGAAAAGC GCGTCGGCAC GATCATTTCG GTCCCCTCGC TGGCCTTCGC GCCCGCCGGC GGCCCCGATG ACAACGCCCT CGCCGCGTTC TACAACGCCA ACAAGGGCCG CTACATGCAG CCCGAACGCC GCACGATCCG CTATGCGCTC GTCGACGAGG CAACGCTCAA GAACGTGCCC GCTCCGACCG ACGCGGAAAT CGCCAACCGC TACAAGCTCA ACACCGCCGT CTACGCGCCC AGCGAACAGC GCTCGGTCAC GCAGGTGATC GTGCCGACCG AAGCCGCGGC GCGCGCGCTC GCCGCCGAAG TCGGCAAGGG CGGCGCGCTC GACACGGCCG CCCGATCCAA GGGCCTCGTC GCCAGCAAGC TCGCCGACCA GACCCGCGAC ACCCTTGCAA ACCAGACGTC GAAGGCCGTA GCCGATGCCG CGTTCGCCGC CACGGCGGGC ACGCTCGCCA CGCCGGCGAA GTCGGGCCTG GGCTGGCATG TGCTCAAGGT CGATGCGGTA AAGCGCAATC CCGGCAAGAC GCTCGATCAG GCGCGCGCCG AAATCGTCAC GGCCCTGACG CTGGAAAAGC GCCGCGCGGC GCTTTCCGAC CTCGCCGCGC AAGTCGAGCA GGAAATCGAC AGCGGCACCG GCCTTGCCGA CATCGCGAAG AACCTTGGCC TCACCGTCCA GACGACGCAG CCGCTGCTGG CCAACGGCAC TGTCTTCGGC AAGGCGGCGG AAAAGGCGCC CGCCGATATC GCGCCGCTCG TCCAGGCCGC CTTTGCGATG GAACGGGAAG GCGAAGCCCA GCTTGCCGAA ATCAAGCCGG GCGAAAAGTT CGCCATCTAC GACGTCGGAC AACTCACCGC CGCGACCCCG GCTCCACTTG CCGCCATCAA GGACGCCGTC GCCCGCGACT GGGCGCTCCA GCAGGGCTCG GCCAAGGCGA AGGCCGCCGC CGACAGGATC CTCGCAGCGC TCGACAAGGG CACCCCGCTT GCCGAAGCCG TAAAGCTCGC CGGTGTCGCC ATTCCCGCGC CGCAGCCCAT CGACATGGGC CGCCAGCAGA TCGGCGCGAT GCAGGGCCAG GTCCCGCCGC CGCTCGCGCT GCTCTTCGCC ATGGCCGAAG GCAGCAACAA GCGCCTCGAG GGCCCGAACA AGGCCGGATG GTACGTCGTC TCGCTCAAGG ACATCGTGCC CGGCGCAGTG AAGCGCGAAG ACCAGATCTT CGCCGGCGCC TCGCGCGAGC TGGGCGCCGT CACCGGCAAC GAATATGCTG AATCCCTGCG CCGCGCGATC GGCAAGGACC TTGGCATCGA ACGCAACGAA TCCGCGATCA AGGCCGTCCG CAACCAGCTG ACCGGCGCGA ACAACCAGTA A
|
Protein sequence | MLGFFRSFLK SRIGVAIALL FLGLIALAFA SADVTGSGFG GVAGGDRAAK VGSGRIGTGE LGKALTSAFE QDRQKQPGLT MKQFLDGGGL EGVLSGMIDR LALAEWGKKH GLAVSDRLVD SEIVKIGAFQ GPDGKFSQKA YEQLLAQRGL TDKEVRNDLA QGLMARQLLL PAAFGAQMPA EAVLRYASLL TEKRVGTIIS VPSLAFAPAG GPDDNALAAF YNANKGRYMQ PERRTIRYAL VDEATLKNVP APTDAEIANR YKLNTAVYAP SEQRSVTQVI VPTEAAARAL AAEVGKGGAL DTAARSKGLV ASKLADQTRD TLANQTSKAV ADAAFAATAG TLATPAKSGL GWHVLKVDAV KRNPGKTLDQ ARAEIVTALT LEKRRAALSD LAAQVEQEID SGTGLADIAK NLGLTVQTTQ PLLANGTVFG KAAEKAPADI APLVQAAFAM EREGEAQLAE IKPGEKFAIY DVGQLTAATP APLAAIKDAV ARDWALQQGS AKAKAAADRI LAALDKGTPL AEAVKLAGVA IPAPQPIDMG RQQIGAMQGQ VPPPLALLFA MAEGSNKRLE GPNKAGWYVV SLKDIVPGAV KREDQIFAGA SRELGAVTGN EYAESLRRAI GKDLGIERNE SAIKAVRNQL TGANNQ
|
| |