Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_3970 |
Symbol | |
ID | 8727728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 4767151 |
End bp | 4768431 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | |
Product | UDP-glucuronosyl/UDP-glucosyltransferase |
Protein accession | YP_003388759 |
Protein GI | 284038829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00770108 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGGCC ACCTGAATCC ACTGACCGGA CTGGCTGTTC ACCTTCAACA ACTTGGCCAC GATGTGCGCT GGTATACCGG TCCAACCTAC GCCGACAAAA TCAAATCGCT GGGTATCCCG TACTATCCTT ATCAGCAGGC GAAGGAAATC AACCAGCTCA ACATGGATAC GGCGCTGCCC GAACGCCAGC ATATCAAAGG AACCATAGCC CGGCTGCGGT TCGACCTCAA CAACCTTTTT CTACTTCGGG CACCGGAGTT CGTGATTGAT TTGAAGGCCA TTTACAATGA GTTTCCGTAC GATCTGCTCG TATGCGATAT GATCTTCACT GGAGCACCTT TCATCCAGAA ACTGCTGAAT GTGCCGGTGG CTGCGGTGGG TGTGGTGCCT TTGTCCGAGA CAGGGCGGGA CGTACCACCG GGTGGTCTGG GCATGGTGCC CGCCAACGGA TTGTTCGGGA AACTGAAGCA GGATTTTATT CGCTACCTGA CCGTCAATCA CCTGCTCAAA CCCTGCACCG ATCTGTTCAA TCACCTGCTG GAAGAACATG GTCTGCCCAC GACGACCGAT TTTATGTTCG ATACTTTCAT CCGGCAACCC GATCTTTTTT TGCAGAGCGG TACGCCCGCT TTCGAATATC CGCGTCAGAC GATGAGCCCG AACATTCGGT TCGTTGGCCC AATGTTGCCT CATAACAAAG GGGGGCGGCA TCCGTTCCGG CAGGTGGAGT TGGCGAAGCA GTACAAAAAG GTGGTACTGG TAACGCAGGG AACCGTTGAG CGCGATCCCG CCAAGATCAT CGTTCCTACG CTTGAGGCTT TTAAAGATGA TCCTAAAACG CTGGTAGTTG TCACAACGGG GGGCTCACAG ACCGCCGAGC TGCGAGCGCG TTACCCGCAA ACGAATTTTA TCATTGAAGA CTTTATTGAT TTCAATTCGG TCATGCCGCA TGTGCATGTA TACGTGACCA ATGCGGGTTA TGGTGGGGTA ATGCTGGCCT TACAGCATGG ATTGCCGATG GTGGCCGCCG GGGTGTATGA AGGCAAAAAC GACATTGCTG CCCGCATCGG GTACTTTAAA GTGGGCGTAA ACCTGAAAAC GGAAACGCCA ACAGCCGCCC AGATTCGAAA AAGCGTGGCC CAGGTGCTGG CCGACCGCAA TTACAAACGA AACGTGCAGC GTATAGGTGT CGACTTCATG CAGTACGACG CAAACACGGT CTGCACAACG TACATCAACG AACTGCTGGG AAAGTTCGAA CCTGAAGCGG AACTCGTGTA G
|
Protein sequence | MDGHLNPLTG LAVHLQQLGH DVRWYTGPTY ADKIKSLGIP YYPYQQAKEI NQLNMDTALP ERQHIKGTIA RLRFDLNNLF LLRAPEFVID LKAIYNEFPY DLLVCDMIFT GAPFIQKLLN VPVAAVGVVP LSETGRDVPP GGLGMVPANG LFGKLKQDFI RYLTVNHLLK PCTDLFNHLL EEHGLPTTTD FMFDTFIRQP DLFLQSGTPA FEYPRQTMSP NIRFVGPMLP HNKGGRHPFR QVELAKQYKK VVLVTQGTVE RDPAKIIVPT LEAFKDDPKT LVVVTTGGSQ TAELRARYPQ TNFIIEDFID FNSVMPHVHV YVTNAGYGGV MLALQHGLPM VAAGVYEGKN DIAARIGYFK VGVNLKTETP TAAQIRKSVA QVLADRNYKR NVQRIGVDFM QYDANTVCTT YINELLGKFE PEAELV
|
| |