Gene Slin_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3970 
Symbol 
ID8727728 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4767151 
End bp4768431 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content53% 
IMG OID 
ProductUDP-glucuronosyl/UDP-glucosyltransferase 
Protein accessionYP_003388759 
Protein GI284038829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00770108 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCC ACCTGAATCC ACTGACCGGA CTGGCTGTTC ACCTTCAACA ACTTGGCCAC 
GATGTGCGCT GGTATACCGG TCCAACCTAC GCCGACAAAA TCAAATCGCT GGGTATCCCG
TACTATCCTT ATCAGCAGGC GAAGGAAATC AACCAGCTCA ACATGGATAC GGCGCTGCCC
GAACGCCAGC ATATCAAAGG AACCATAGCC CGGCTGCGGT TCGACCTCAA CAACCTTTTT
CTACTTCGGG CACCGGAGTT CGTGATTGAT TTGAAGGCCA TTTACAATGA GTTTCCGTAC
GATCTGCTCG TATGCGATAT GATCTTCACT GGAGCACCTT TCATCCAGAA ACTGCTGAAT
GTGCCGGTGG CTGCGGTGGG TGTGGTGCCT TTGTCCGAGA CAGGGCGGGA CGTACCACCG
GGTGGTCTGG GCATGGTGCC CGCCAACGGA TTGTTCGGGA AACTGAAGCA GGATTTTATT
CGCTACCTGA CCGTCAATCA CCTGCTCAAA CCCTGCACCG ATCTGTTCAA TCACCTGCTG
GAAGAACATG GTCTGCCCAC GACGACCGAT TTTATGTTCG ATACTTTCAT CCGGCAACCC
GATCTTTTTT TGCAGAGCGG TACGCCCGCT TTCGAATATC CGCGTCAGAC GATGAGCCCG
AACATTCGGT TCGTTGGCCC AATGTTGCCT CATAACAAAG GGGGGCGGCA TCCGTTCCGG
CAGGTGGAGT TGGCGAAGCA GTACAAAAAG GTGGTACTGG TAACGCAGGG AACCGTTGAG
CGCGATCCCG CCAAGATCAT CGTTCCTACG CTTGAGGCTT TTAAAGATGA TCCTAAAACG
CTGGTAGTTG TCACAACGGG GGGCTCACAG ACCGCCGAGC TGCGAGCGCG TTACCCGCAA
ACGAATTTTA TCATTGAAGA CTTTATTGAT TTCAATTCGG TCATGCCGCA TGTGCATGTA
TACGTGACCA ATGCGGGTTA TGGTGGGGTA ATGCTGGCCT TACAGCATGG ATTGCCGATG
GTGGCCGCCG GGGTGTATGA AGGCAAAAAC GACATTGCTG CCCGCATCGG GTACTTTAAA
GTGGGCGTAA ACCTGAAAAC GGAAACGCCA ACAGCCGCCC AGATTCGAAA AAGCGTGGCC
CAGGTGCTGG CCGACCGCAA TTACAAACGA AACGTGCAGC GTATAGGTGT CGACTTCATG
CAGTACGACG CAAACACGGT CTGCACAACG TACATCAACG AACTGCTGGG AAAGTTCGAA
CCTGAAGCGG AACTCGTGTA G
 
Protein sequence
MDGHLNPLTG LAVHLQQLGH DVRWYTGPTY ADKIKSLGIP YYPYQQAKEI NQLNMDTALP 
ERQHIKGTIA RLRFDLNNLF LLRAPEFVID LKAIYNEFPY DLLVCDMIFT GAPFIQKLLN
VPVAAVGVVP LSETGRDVPP GGLGMVPANG LFGKLKQDFI RYLTVNHLLK PCTDLFNHLL
EEHGLPTTTD FMFDTFIRQP DLFLQSGTPA FEYPRQTMSP NIRFVGPMLP HNKGGRHPFR
QVELAKQYKK VVLVTQGTVE RDPAKIIVPT LEAFKDDPKT LVVVTTGGSQ TAELRARYPQ
TNFIIEDFID FNSVMPHVHV YVTNAGYGGV MLALQHGLPM VAAGVYEGKN DIAARIGYFK
VGVNLKTETP TAAQIRKSVA QVLADRNYKR NVQRIGVDFM QYDANTVCTT YINELLGKFE
PEAELV