Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2500 |
Symbol | |
ID | 3916821 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2700925 |
End bp | 2702313 |
Gene Length | 1389 bp |
Protein Length | 462 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640445257 |
Product | hypothetical protein |
Protein accession | YP_497770 |
Protein GI | 87200513 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2342] Predicted extracellular endo alpha-1,4 polygalactosaminidase or related polysaccharide hydrolase |
TIGRFAM ID | [TIGR01370] possible cysteinyl-tRNA synthetase, Methanococcus type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.967827 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGCGGA CGGGAATGTA TGTTCTGCAG GGAATTGTTC CTTCGGAAAT TGCTGTAGCT CCGTTTGATG TCAAGGTAAT AGACATCTAC AACGATGACG GCGTAGCTTT CACGCCCACG CAGGTTGCGC AGATGGGCGG CGGACCGGGC AGTGCATTAC TTCTCGGATA CTTCAGCATT GGTGAAGCAG AGGTCTATCG AGACTACTTC AACACCATTC CGAAATCGGC CCTCGGACCT GAAAATCCAC AGTGGGCCGG TAACTATCAG GTCGCCTACT GGACGGCCGA ATGGCGCACC GTGGCAACCG CCTACATCGA CCGGATCATC GCTGCCGGCT ATGATGGCGT CTACTTCGAC GTCGTCGACG AATACCAGCA GAAGTGGGCG CAGACCTATT GCCCGGGCGG CGCGGCAGGT GCCGAACAAG CCATGGCGGA TCTGGTCGCC TACCTCGCCG ACTATGCCCA TGCCAAGAAC CCGGCGTTCA AGATCTGGGC AAACAACGCC GAGGAACTGC TGACCAACCA GACATATTTC AGCCACCTCG ACGGCATGTT CAAGGAGAAC CTGTTCTATA CGGACAGCGG TTCGAAGCAG CCTTCGAGCG AGACGCAGTA CAGCATGAGC CTCCTCCAGA TGATGCTGGC GGCGGGCAAG GATGTCATCG CCATCGAGTA CGTTTCGGAC TCGGCAAAGA TCGCCGACGT GGAAACGCAG GCCGCGCACT ACAACGTCGG CTACTACACC GCGGACATCA ATCTCGACGG CATCAGCTAT ACCGGCGTGC TGCCCGGCCA GTACATCCAC GAAGACTGGA GCGGTCTGAC GACGACCACG ACGACGACCA GCACCAGCAC CACCTCTACG TCGACAACCA CCTCCACCAC GACAACGTCG ACCACCACGC TGGTGACCGA TCTCACCCTG ACCGGGACGA GCAGTTCCGA CAACCTGGCT GGCAAGTCCG GCAACGACAA GCTCTACGGC AAAGCCGGTG CCGACGTGCT GTCAGGCAAC GGCGGCAACG ACTGGCTCGA AGGCGGCAAC GGCAACGACA AGCTTGCCGG CGGCGCGGGA GCGGACTCCT TCGTGTTCCG CGCCTATGGC AACAAGCACA GGGACGCGAT CGCCGACTAC CAGGCCGGCA CGGACCATAT CGTGCTCGAC CACGCGGTGT TCACTGCGGC GGGCGCCATC GGCACCTTGA GCGATGCCGG ATTCTGGATC GGATCGGCAG CCCACGATGC CAGCGACCGG ATCATCTACA ACGCATCCAC TGGATTGATC AGCTACGATG CGGACGGCAC CGGCAGACTC TCGGCCGTCG CCATCGCCAC CGTCGCGCCC GGCACGCTGC TGACGCATGC CGACTTCCTG ATCATCTGA
|
Protein sequence | MARTGMYVLQ GIVPSEIAVA PFDVKVIDIY NDDGVAFTPT QVAQMGGGPG SALLLGYFSI GEAEVYRDYF NTIPKSALGP ENPQWAGNYQ VAYWTAEWRT VATAYIDRII AAGYDGVYFD VVDEYQQKWA QTYCPGGAAG AEQAMADLVA YLADYAHAKN PAFKIWANNA EELLTNQTYF SHLDGMFKEN LFYTDSGSKQ PSSETQYSMS LLQMMLAAGK DVIAIEYVSD SAKIADVETQ AAHYNVGYYT ADINLDGISY TGVLPGQYIH EDWSGLTTTT TTTSTSTTST STTTSTTTTS TTTLVTDLTL TGTSSSDNLA GKSGNDKLYG KAGADVLSGN GGNDWLEGGN GNDKLAGGAG ADSFVFRAYG NKHRDAIADY QAGTDHIVLD HAVFTAAGAI GTLSDAGFWI GSAAHDASDR IIYNASTGLI SYDADGTGRL SAVAIATVAP GTLLTHADFL II
|
| |