Gene Slin_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0084 
Symbol 
ID8723812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp99936 
End bp101099 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content54% 
IMG OID 
Product3-dehydroquinate synthase 
Protein accessionYP_003384956 
Protein GI284035026 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCATT TGCAGCAGTC TTTCACGGTT AAATTCGTTT ATAACGTCTT CTTTACCGGA 
AAACTTTTTG ATAGTACCAA TTCATTACTC GCCGATTTTT TTACTCAACA GGCCACCGGT
GATTTAAAGA AAAAACTCTT GTTCGTCGTT GATTCAGGCG TCATCGACTC GCATCCTGAT
TTGTCCGAAA AGATAAAATC CTATTTTTCT CATCATACCA ACCTTGTCGA GCTTGTCCCT
GATTTTCTGC TTATTCCGGG GGGCGAAGCA GCCAAGAATG ACCCTGACCT GGTCGAAAAA
ATTGTAGCCG CTGTTGATAC GTACGGCATT GACCGCCACT CGTATGTAGC CGCTATCGGG
GGCGGCTCCG TTCTCGATTT GGTGGGCTAC GCAGCCGCCA TCTCGCACCG GGGCATTCGC
CATATCCGTA TTCCAACCAC CGTTCTTTCG CAGAACGATT CGGGCGTGGG GGTAAAAAAT
GGCGTTAACT ACAAGGCCAA AAAGAATTTC CTCGGCACGT TTGTACCACC CGTAGCGGTT
TTCAACGACA GCGAATTTCT TCAGACGCTC GACGACCGCG ACTGGCGGGC GGGCATTTCG
GAAGCCATTA AAGTAGCGCT CATCAAAGAC AGGGCCTTCT TCGAGTGGAT CGAGGCAAAC
GCCGAAGCCC TGGCGGCCCG CGATATGACG GTTATGAGCT ACCTCATCCA CCGCTGCGCC
GACATGCACC TCCAGCACAT TGCCGGGGGC GACCCGTTTG AGATGGGTTC GTCGAGACCG
CTGGACTTTG GTCACTGGAG CGCACACAAG CTGGAACAAC TTACCAACTT CGAGCTGCGT
CATGGCGAAG CCGTTGCCAT TGGCATTGCC CTCGACAGCC TGTATTCGCA GCAACTCGGC
TGGCTTACCG AAGCCGATAC CAACCGTATT TTAAGCACGC TCCGTACCCT TGGCTTCTCG
CTCTACCAGC CCATGCTCGA CGCCGACAAC AGCGCGGGCG TCCTGAAAGG GCTGTCGGAG
TTCCGGGAGC ATCTGGGCGG ACAACTGACC ATCATGCTGC TAAAAGGCCT CGGTCATGGC
GTTGAAGTGC ATGAGATGGA TGCCGACCGG ATTCGGCAGG CGATTGCCAC CCTGCGCGAG
CACGAATTAT CAACCACAGT TTAG
 
Protein sequence
MSHLQQSFTV KFVYNVFFTG KLFDSTNSLL ADFFTQQATG DLKKKLLFVV DSGVIDSHPD 
LSEKIKSYFS HHTNLVELVP DFLLIPGGEA AKNDPDLVEK IVAAVDTYGI DRHSYVAAIG
GGSVLDLVGY AAAISHRGIR HIRIPTTVLS QNDSGVGVKN GVNYKAKKNF LGTFVPPVAV
FNDSEFLQTL DDRDWRAGIS EAIKVALIKD RAFFEWIEAN AEALAARDMT VMSYLIHRCA
DMHLQHIAGG DPFEMGSSRP LDFGHWSAHK LEQLTNFELR HGEAVAIGIA LDSLYSQQLG
WLTEADTNRI LSTLRTLGFS LYQPMLDADN SAGVLKGLSE FREHLGGQLT IMLLKGLGHG
VEVHEMDADR IRQAIATLRE HELSTTV