Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1066 |
Symbol | |
ID | 3916362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 1108934 |
End bp | 1110097 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443801 |
Product | hypothetical protein |
Protein accession | YP_496345 |
Protein GI | 87199088 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5653] Protein involved in cellulose biosynthesis (CelD) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCATGCCC CGTCCTTAAC CCTGATGCCC GATCTGTCGC GCCCGGCGGC CGAACTGATC ACCAGCATGG AGCCCTGGGA CCAGTGGGCA GGCGCGCAAG CGATAAGCCT TTGGGCGGAG CTTGCCGATG CGGCGACGAC GCCGAACCCC TTCTTCGAAC ACTGGTATCT CCTTCCCGCA CTGGAAGCCT TCGATGCCGA CCGTGCGGCA CGCATCCTCG CGATAAGGAC CGGCGGCAGG CTGATCGGGC TCATGCCGAT CGTCATGCAA CGCAGCTATC AACGCTGGCC CCTGCGACAC CTTGCCTCGT GGCAGCACGC CAATGCCTTC CTTGGTACAC CGCTGGTGCG CGCCGGATGC GAGACACTGT TCTGGCAAGG ATTGCTGGAT TGGGCCGACA AGGCAGCGCC CGACGATGGC GCGCTGTTCC TGCACCTGCC GGCAATGCCC CTCGAGCAGC CGCTGACCCA GTGCCTGATG GACCTGTGCT TCGAAAATGG CCGTCCCGCG GGACTGGTGA TGCGCGAACA GAGGGCCTTG CTGCACTCTC CGCTGAGCCC GGAAGCCTAT CTGGAACGTG CGCTGCGAGG CAAGAAGCGC AAGGAACTTC GCCGCCAACA TGCCAGGCTT GCCGAACAGG GAGCGCTGGC GTTCGAGCGG CGCGAGGATG CGCAAGGGAT CGACGCGTGG ATCGAGACCT TTCTGGCGCT TGAAGCGGCT GGCTGGAAAG GGCGGTCGGC AAGCGCCATG GCCTTTGCCC CCGAAACCGC ATCGCTCTTC CGCCAGGCAC TGATCCAGGC GGCGGCGCTC GGTAAGCTGG AACGGCTTTC GCTGACGCTG GACGGGCGAC CTGTCGCGAT GCTGGCGAAC TTCATCACGC CGCCGGGCAG CTTCTCCTAC AAGACCGCAT TCGACGAGAC GCTCGCCCGC TTCTCGCCCG GCGTCCTCCT GCAACTGGAA AACCTCGCAT TGCTGCGCCG GGACGACGTG ACCTGGTGCG ATAGCTGCGC CGCGCCCGAT CACCCGATGA TCGACAGCAT CTGGACCGAA CGGCGCCCCA TCGGGCGCCT GTCGGTCGGC ATCGGTGGCA AGATCCGGCG TGCGATCTTC AAAACGACGC TTGCCCTCGA ACTGCAACGC AACCCGACTG GAATCGGTGC ATGA
|
Protein sequence | MHAPSLTLMP DLSRPAAELI TSMEPWDQWA GAQAISLWAE LADAATTPNP FFEHWYLLPA LEAFDADRAA RILAIRTGGR LIGLMPIVMQ RSYQRWPLRH LASWQHANAF LGTPLVRAGC ETLFWQGLLD WADKAAPDDG ALFLHLPAMP LEQPLTQCLM DLCFENGRPA GLVMREQRAL LHSPLSPEAY LERALRGKKR KELRRQHARL AEQGALAFER REDAQGIDAW IETFLALEAA GWKGRSASAM AFAPETASLF RQALIQAAAL GKLERLSLTL DGRPVAMLAN FITPPGSFSY KTAFDETLAR FSPGVLLQLE NLALLRRDDV TWCDSCAAPD HPMIDSIWTE RRPIGRLSVG IGGKIRRAIF KTTLALELQR NPTGIGA
|
| |