Gene Slin_2631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2631 
Symbol 
ID8726376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp3182404 
End bp3184509 
Gene Length2106 bp 
Protein Length701 aa 
Translation table11 
GC content54% 
IMG OID 
ProductBeta-galactosidase 
Protein accessionYP_003387447 
Protein GI284037517 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.214804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCA CAATCGCCCG GTATGTACTG CTTACCATCT GTTTATTCCG TTCTCTGACG 
GGTTTTACAC AATCCACCCC GCCCGCAAAG CTGCTGTATG GCGTGGCGTA TTATGATGAG
TACATGCCCT ACGAACGGCT GGATAAGGAT ATTGCGATGA TGAAAGAGTC GGGTATCAAC
GTGGTCCGCA TTGCCGAGTC GACCTGGGGG ACGGTGGAAC CGCAGGATGG CGTATTTAAC
TTTTCGCATA TCGACCGTGT CCTGAACGCC ATGCACAAAG CCGGTATTCG GGTCATCGTC
GGTACGCCGA CCTACGCGGT GCCTACCTGG CTGGTTCGCA AATACCCCGA TGTGCTGGCT
ATCACCCCGA CCGGCCCGAA CCGCTACGGC CAGCGTCAGA ACATGGACAT AACCAATCCG
CATTACCGCT TCCATGCCGA ACGGGTCATT CGCAAGATCA TGGAGCACGT GAAAGACCAT
CCAGCCATTA TTGGCTATCA GATCGATAAC GAAACGAAGG CGTATAATAC CGCCGGGCCT
GACGTACAAA AGCAATTTGT CGAGTATGCT AAAGCCAAGT TTGGCTCGCT GGATTCGCTT
AACAAAGCCT TTGGTCTCGA CTACTGGAGT AACCGCATCA ATAGCTGGGA CGATTTTCCG
TCGATGGTCG GCTCGATCAA CGGCAGTCAG ACGGCCGAAT TCGCCAAATT TCAGCGGAAG
CTGGTGACGG ATTTTTTGGC CTGGCAAGCT GCCATCGTCA ACGAATACAA GCGGCCCGGC
CAGTTTGTTA CCCAGAATTT CGACCTGGAA TGGCGGGGGT ATTCCTTCGG GATTCAGCCC
GACGTGGATC ATTTTGCAGC CGCCAAACCG CTCGACATTG CCGGTATCGA CATCTACCAC
CCCACTCAGA ACGCGCTGAC CGGCATCGAA ATAGCGTTGG GCGGTGATCT GGCCCGGTCG
ATGAAAGTCG ATGGGTCGAA CCGGGGCCGG AATTACCTGG TGCTGGAAAC CGAAGCGCAG
GGTTTTGCGG AGTGGCTGCC GTACCCCGGT CAACTACGGT TACAGGCCTT CAGCCACCTG
GCGTCGGGCG CTAACATGGT TAGTTACTGG CCGTGGCATT CCGTTCATAA TGCGATAGAA
ACGTACTGGA AGGGCCTGCT GAGCCATGAT TTCCAGCCAA ACCCGACTTT TGATGAAGCC
AAAACCATTG GTAAGGATTT CGAGCGATTG AGTCCGCAGC TAGTCAACCT CAAAAAAACG
AATCAGGTAG CGGTTCTCTT CAGCAACGAA GCCCTGACCG CCTTTAACGC CTTCCGGTTC
GGCTGGTATA ATCGGGACAC GTACAACGAT ATTTTGCGGC CTATGTACGA TGCCCTCTAC
CGCATGAATG TGGGCGTCGA TTTTGTCGAT CCCAGCAGTA CCAACCTGCA ACAGTATAAA
CTAATCGTTG TGCCGGTTCT GTACGCAGCC TCCGATGAGC TACTGCACCG ACTGAACAAC
TACGTCAAAA ATGGCGGCCA CATTGTTTAT ACTTTCAAAA GTGGCTTCAC CGACCAGAAC
GTCAAGGTTC GCACGCAGGT GCAGCCGGCC ATCATCGGCG AGGCTCTGGG CATTACCTAC
AGTCAGTTCA CCACGCCCCA GAACGTAACG CTGAAAGGCG ACCCGTATGG CGTGGGGGCT
GAGCACAACA AGGTGAAAAC GTGGATGGAA CTCATCACGC CCACCACGGC GAAGGTGATG
GCCTATTACG ATCATCCGGT TTGGGGCAAG TACGCGGCCG TTACCCAGAA TGCCTTTGGC
AAGGGCCTGG CCACCTACAT AGGCTGCTGG ACTGACGACG CCATCACCGA AAAAATACTG
GCCGACGCGG TGAAGAAAGC CAACCTCTGG GGTGATGCCC AGTCGCTTGC GTTTCCCATC
ATTACCCGGC AGGGCGTCAA CCAGCAGGGC AAGACGATCC AGTACGTGTT CAATTATTCG
GCCAAACCGG TTAGCGTAAC CTACCCCTTT GCCAACGGCC GCGAATTGCT ACAAGGGCAA
TCGGTGCTGA AGAACGGCAA GCTGGAACTG GAACCCTGGG GGATTAAGAT TATTGAGGTC
AATTAA
 
Protein sequence
MQTTIARYVL LTICLFRSLT GFTQSTPPAK LLYGVAYYDE YMPYERLDKD IAMMKESGIN 
VVRIAESTWG TVEPQDGVFN FSHIDRVLNA MHKAGIRVIV GTPTYAVPTW LVRKYPDVLA
ITPTGPNRYG QRQNMDITNP HYRFHAERVI RKIMEHVKDH PAIIGYQIDN ETKAYNTAGP
DVQKQFVEYA KAKFGSLDSL NKAFGLDYWS NRINSWDDFP SMVGSINGSQ TAEFAKFQRK
LVTDFLAWQA AIVNEYKRPG QFVTQNFDLE WRGYSFGIQP DVDHFAAAKP LDIAGIDIYH
PTQNALTGIE IALGGDLARS MKVDGSNRGR NYLVLETEAQ GFAEWLPYPG QLRLQAFSHL
ASGANMVSYW PWHSVHNAIE TYWKGLLSHD FQPNPTFDEA KTIGKDFERL SPQLVNLKKT
NQVAVLFSNE ALTAFNAFRF GWYNRDTYND ILRPMYDALY RMNVGVDFVD PSSTNLQQYK
LIVVPVLYAA SDELLHRLNN YVKNGGHIVY TFKSGFTDQN VKVRTQVQPA IIGEALGITY
SQFTTPQNVT LKGDPYGVGA EHNKVKTWME LITPTTAKVM AYYDHPVWGK YAAVTQNAFG
KGLATYIGCW TDDAITEKIL ADAVKKANLW GDAQSLAFPI ITRQGVNQQG KTIQYVFNYS
AKPVSVTYPF ANGRELLQGQ SVLKNGKLEL EPWGIKIIEV N