Gene Slin_3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3966 
Symbol 
ID8727724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4761930 
End bp4763735 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content56% 
IMG OID 
ProductGlucose/sorbosone dehydrogenase-like protein 
Protein accessionYP_003388755 
Protein GI284038825 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAAAA TTTACATCAA AAGCATTGCC CTCGCCCTGC TCACGTTTGT CGGATTTACG 
CAGCGGGCGA CGAAAAACGG ATTACCGATT CAGACAGAAG CCAACGGCGG GCTCTTTCTG
CCCGAAGGGT TCGAAGCAAC GGTGGTGGTC GACAGCCTGC CCGGTCGGGC GCGGCATCTG
GCGGTCAACG AGAACGGCGA CATTTACGTA AAAGCGCGTT TCGCCCGGAA TAAAGATGAG
TCGGTGATTG CCCTGCGGGA TACGAACGGC GATGGCCGGG CCGACATCAT CAAAACCTTT
GGCGGTATCG GTAAAGAGCG GGCGTATGGT ACAGCTGTGC GGATCTACAA AGGTTACCTC
TACTTCAGTT CGGAGATGAA CGTGTTTCGC TATAAACTCA AGCCCGGCGA ACTGATCCCG
AGTAGTCCGA TGGAAACGAT CCTGACCGAC GACCATGAGC ACGGGATGCA CGAACACATC
GCCAAACCCA TTACGTTCGA CAACGACGGC CATATGTACG TAGCTTACGG TGCGCCTTCC
AACGGGTGCC AGCCTAAGAA CCGGACGCCC AACATGGCCG GTATCGACCC ATGCCCCATG
CTGGAAGACC ACGGCGGCAT CTGGCAGTTC GACGCCAACA AACCCAACCA GACCCAGCGG
GATGGCCGTC GGTATGCTAC CGGATTACGG TCTGTAGTGG GCATGGACTG GAACCCGACC
AACAATAGTT TGTTTGCCCT CCAGCATGGC CGCGACGATT TGCTGATGCT GTGGGCCGAA
AAATACAACC CCTGGCAGAG TGCCGTTTTC CCCGCCGAAG AGCTTTTCCA GGTGAAGGAC
GGCATGGACG GCGGCTGGCC CTATTGCTAT TACGACCAGA TCCAGGGCAA GAAACTGCTC
AACCCCGAGT ACGGTGGCGA TGGTAAACTG GTGGGCCGCT GTGGCGACTA CGAAAAACCA
CTGATCGGTT TTCCGGCACA CTGGGCTCCC AATGACATTC TGTTCTACCA GGGCAACAGC
GCATCGAACG GCTTCCCGGA GCATTACAAA AATGGTGCCT TCATTGCCTT TCACGGCTCG
ACCAACCGGG CACCGTATCC GCAGGCGGGG TATTTTATCG GCTTCGTACC GGCCAAAGGC
AATGGTTTAT CGAGCAGTTG GGAAGTCTTT GCCGATGGCT TTGCCGGTGT CGACCCGATT
GTCAATGTCA GCGACGCACA TTATCGTCCG ATGGGTGTGG CCATGGGGCC CGATGGTTCG
CTGTATTTCG CCGAAACCGA GAAAGGCAAA ATCTGGAAGG TTACCTACAA AGGCAACAAA
CAGAACTTCG GTGCGGCACA ACTGGCCCAG ATGGAGAAGC GCAAGACCCT GTCGAATATC
CGCGACCCAC ATATCATCAC CGACAATCTG GACCGGGATC GTCCCGTGGC GGGAGGAAAA
GTCTACGGTG TTTACTGCTC GGCCTGCCAC CAGCGGAACG GCCTGGGCGA CTCGCAACGG
TTTCCGCCCC TGGCCGGTTC GGAATGGGTA ACGGGCGATA AAAAGAAACT CATTACGGTG
CTTTTGAAAG GGCTGGAAGG GCCCATTGAG GTGAAAGGCC AGTCGTACAA CAACGCCATG
CCCCAGCACA GTTTTTTGAA AGATGAAGAG CTTTCGGAAG TATTGACCCA CATCCGGCAG
AACTTTGGCA ACACGGCCGA TGGCATCAGC GCGGCCGAGG TCAATGAAGT TCGGCTGGCG
ATCAACCAGC AGGAGCGTAA AGAGACTACC CCAAAACGCA AAAACAGTAC AAAAGCCAAA
CGATAA
 
Protein sequence
MTKIYIKSIA LALLTFVGFT QRATKNGLPI QTEANGGLFL PEGFEATVVV DSLPGRARHL 
AVNENGDIYV KARFARNKDE SVIALRDTNG DGRADIIKTF GGIGKERAYG TAVRIYKGYL
YFSSEMNVFR YKLKPGELIP SSPMETILTD DHEHGMHEHI AKPITFDNDG HMYVAYGAPS
NGCQPKNRTP NMAGIDPCPM LEDHGGIWQF DANKPNQTQR DGRRYATGLR SVVGMDWNPT
NNSLFALQHG RDDLLMLWAE KYNPWQSAVF PAEELFQVKD GMDGGWPYCY YDQIQGKKLL
NPEYGGDGKL VGRCGDYEKP LIGFPAHWAP NDILFYQGNS ASNGFPEHYK NGAFIAFHGS
TNRAPYPQAG YFIGFVPAKG NGLSSSWEVF ADGFAGVDPI VNVSDAHYRP MGVAMGPDGS
LYFAETEKGK IWKVTYKGNK QNFGAAQLAQ MEKRKTLSNI RDPHIITDNL DRDRPVAGGK
VYGVYCSACH QRNGLGDSQR FPPLAGSEWV TGDKKKLITV LLKGLEGPIE VKGQSYNNAM
PQHSFLKDEE LSEVLTHIRQ NFGNTADGIS AAEVNEVRLA INQQERKETT PKRKNSTKAK
R