Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_4007 |
Symbol | |
ID | 5077537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_009426 |
Strand | - |
Start bp | 174601 |
End bp | 175752 |
Gene Length | 1152 bp |
Protein Length | 383 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640481112 |
Product | hypothetical protein |
Protein accession | YP_001165774 |
Protein GI | 146275613 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.503267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCGCGC TCAGCTCCAA GCATCTTGCG AGCCTCACGC GCGCAGTCGC GCGGCGCTCT TGTGGGCGAG TCTCGCTGCC GCGGGTGGAG CTCAGCCGCG CCGAAGCAAT CCAGCTGTGG CAGGATCTCG TCCTTGCCAC TGCGGTTCCG CGAGGCGTGC TGCTCACGCC TCGCGAACAG GCATTGCTGG GCAAGCGCGC GTGGGCGCGC TCGGCGGGCC ATGTCGCGCT TGGTGAGGCT CCGCTGTCGC GCCAATCGGG CCTCGCCATC TGGCGGCTCG CGATCCGGCT TGGGGATCTT GCGGCGCGGC GATCGCCAAA CCCCGTCACC CGGGCTGCAA GCATCGCGGC CATCGGACTC GCGGCTACCC AGCTCGCCGC CTGCACCTCG CTGTTCGGCG GCAACATCAA GGGCAGCTTT GCCTGCAGCG CACCAGGCGG GACCTGCGCG CCTTCGACGG TGATCGACGA CCAGGCGCTG TCGGTGATCC AGAATGCCCG GCCGATGACC CCGGCAGGGC CGTACATCCG CCAGCCCGCC GCCGCCAAGC CGGTTACTGC GTCGTACACG CCGTCGGGTT CGGGCCGCAT CACCTCGGCC GGCGGCGGCA TGGTCCACCG CGAGCGCCGC GTGCTCAAGG TGGTGTTCCC CTCTTTCGTC GATGGTGGTG GCAATCTCCA CGAACCGCGG ATCATCCACG CAGTGGTCGA CAACGGCGCC TGGATGGAGC TGTCCTCGGG CGAGCCCAAT ATCGGCGAGC AGGTCGAGGG CAGGGCAGTT AGCCTCGCGA GCGCGGCATT CGTGCCGGCA CCGGTACCCG TCGCCCCGAT CGCCCAGGAC GCTGGGTCAC CCACTCCCAA AGCAATAGAT GCCGCGCCTT CCGGGCCGCC GCGCCCCGAG GCCGTTGCGG CGGCGCGCGC CAAGGGTGCC GCGCTCCAGT CGGGCAATGC GATCGAGGCG ATTCAGGCTC AGGTGCAGGC GCGTCTCGCG CCCACTGCGA AAGCGCACGT CCCGGCAGCA GCGGTACGTG CGGCGCCCAG TAGTGCGGTG CCGACCCCAG CTCCGGCTTC CGCCTCGGCG GTCAGCGCCG CTCCGGCGGC GACCAAGCCC GCCAATGGCC CCGCCGCCTT TCCGGCCAAG GTCGAGGAGT AA
|
Protein sequence | MTALSSKHLA SLTRAVARRS CGRVSLPRVE LSRAEAIQLW QDLVLATAVP RGVLLTPREQ ALLGKRAWAR SAGHVALGEA PLSRQSGLAI WRLAIRLGDL AARRSPNPVT RAASIAAIGL AATQLAACTS LFGGNIKGSF ACSAPGGTCA PSTVIDDQAL SVIQNARPMT PAGPYIRQPA AAKPVTASYT PSGSGRITSA GGGMVHRERR VLKVVFPSFV DGGGNLHEPR IIHAVVDNGA WMELSSGEPN IGEQVEGRAV SLASAAFVPA PVPVAPIAQD AGSPTPKAID AAPSGPPRPE AVAAARAKGA ALQSGNAIEA IQAQVQARLA PTAKAHVPAA AVRAAPSSAV PTPAPASASA VSAAPAATKP ANGPAAFPAK VEE
|
| |