Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_3174 |
Symbol | |
ID | 3918216 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 3389088 |
End bp | 3390464 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640445958 |
Product | hypothetical protein |
Protein accession | YP_498443 |
Protein GI | 87201186 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGTGC TCTTCGCCTT TTCGCTTAAA TCTGTTAGGG GCCTGTGCGT GACACCTTTT CCATGGTCTG ACGTCTTCGT GATCCTGGGC CTCGTCCTGC TCAATGGCCT GTTTTCCATG TCGGAACTGG CGATCGTTTC GGCACGCCCG GCGCGCCTGA AGGTTGCCGC GGAGGAAGGC AGCAAGGGCG CGAAGGTTGC GCTGGCGCTC GCAGCCGACC CCGGAAAATT TCTTTCGACC GTACAGATCG GGATCACCCT CGTCGGCATC ATCGCAGGCG CCTATTCAGG GTCCAGCCTC GGCGGGCCGA TGGCGGAGCG GCTCGCCGCA TGGGGTTTTC CGGCCCGTTA CGCGGACGAT GCCGGGTTCG TCATCGTCAT CGCCTTCACC ACGTACCTGA GCCTCGTCGT CGGCGAACTC GTACCCAAGC AGCTCGCGCT GCGTGCGGCG GAACCGATCG CCAAGATCGC GGCGCCCGCC ATGGCGCTCA TGTCGAAGGT GACGGCCCCC TTCGTCTGGC TGCTCGACAA CTCGTCCAGC CTGCTCATCC GCCTGCTCGG CCTCAAGCAG GGCACGGACC AGGAAGTGAC CGCCGAAGAA CTCCACATGA TCTTCGCCGA GGCGACCCGC TCCGGCGTGA TCGAGGAGGA GGAGCGGGCG CTGATGACGG GCATCATGCG CCTTGCGGAA CGCCCGGTGC GCGAAGTGAT GACGCCGCGA ACCGAACTGC ACTGGATCGA GCGCAAGGCC CCCGAGGCCG AACTGCGCAG CGCGATCGAG GACAGCCCGC ACTCGCTGCT GCTGGTGGCC GACGGGTCGG TCGACAAGAT CGTCGGCGTG GTCAAGGTGC GCGACGTGCT GTCCACGCTG TTGCGGGGAC GCAAGGTCCA GCTCGGACGC CTGATGAAGA AGCCGGCCAT CGTTCCGGAC CAGCTCGACA CGATGGACGC GCTCGGCATG ATCCAGCAGG CCGAGGTCGC GATTGCGCTG GTCCACGACG AGTACGGCCA TCTCGAAGGC ATCGTCACCC CGGCCGACCT GCTGTCCGCC ATCGCGGGCA ATTTCGTCGG CCACGCGGAC GCGGGCGACG AACCCATGGT GGTCGAGCGC GAGGACGGTT CACTGCTGAT TTCGGGCGCC CTGCCCGCCG ACGCCCTTTC CGACCGGCTG GGCCTCGACC TGCCCGACGA CCGTGAGTTC GCGACGACGG CGGGCTACTG CCTTTCGGTG CTCAAGCGAC TGCCGAACGA GGGCGAGCAT TTTCACGACC AGGGCTGGCG CTTCGAAGTG GTCGACATGG ACGGGCGCAA GATCGACAAG CTGCTGGTCT GCCGCAGCAA GGCAATGCCC ATCGCCGCGC CGGAAGCCGA CGGCTGA
|
Protein sequence | MRVLFAFSLK SVRGLCVTPF PWSDVFVILG LVLLNGLFSM SELAIVSARP ARLKVAAEEG SKGAKVALAL AADPGKFLST VQIGITLVGI IAGAYSGSSL GGPMAERLAA WGFPARYADD AGFVIVIAFT TYLSLVVGEL VPKQLALRAA EPIAKIAAPA MALMSKVTAP FVWLLDNSSS LLIRLLGLKQ GTDQEVTAEE LHMIFAEATR SGVIEEEERA LMTGIMRLAE RPVREVMTPR TELHWIERKA PEAELRSAIE DSPHSLLLVA DGSVDKIVGV VKVRDVLSTL LRGRKVQLGR LMKKPAIVPD QLDTMDALGM IQQAEVAIAL VHDEYGHLEG IVTPADLLSA IAGNFVGHAD AGDEPMVVER EDGSLLISGA LPADALSDRL GLDLPDDREF ATTAGYCLSV LKRLPNEGEH FHDQGWRFEV VDMDGRKIDK LLVCRSKAMP IAAPEADG
|
| |