Gene Slin_5281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5281 
Symbol 
ID8729046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6438671 
End bp6440599 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content55% 
IMG OID 
Productmalto-oligosyltrehalose trehalohydrolase 
Protein accessionYP_003390049 
Protein GI284040119 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.64117 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACC TCATCGATAG CGGCCAACGC TCGCTGGGCG TCACGTTTCC CAACGAGCAC 
GAAGCCAGCA TACAGCTATG GGCACCCCTG GCCAAGTATG TAGCCATAAA AATATACGGG
CATCCGACAG CCCTTCCTTT GACGTGCGAA GAACTGGGCT ACTGGCATTT GACAACCACC
CAGCTTAAAC CCGGCGATCT GTACACGTTT AAGCTCGACG GGCAGGAAGA ATATCCCGAT
CCGGTATCCC TTTGCCAGCC GCAGGGCGTA CACGGCCCCT CGCAGGCGGT CGATACAGGA
AGCTTCTCGT GGACCGATCA GGACTGGCAA AATCCGGCGC TGGACAGCTA CGTGCTTTAC
GAACTGCATA CCGGCACGTT TACCGAAGAA GGCACCTTTC AGTCGCTGGA GAGCAAGCTG
GATTACCTGA AAGCGCTGGG CGTAACGGCC ATTGAGATCA TGCCAGTAGC GCAGTTTTCC
GATTCGCGCA ACTGGGGTTA CGATGGCGTG TACACCTATG CCGTTCAGCA GTCGTACGGG
GGAGCCAATG GCCTCCATCA CCTGGTCGAT ACCTGCCATA AAAAAGGCAT TGCGGTGGTG
CTGGACGTAG TATACAACCA CTTCGGACCG GAAGGAAATT ACCTCGGCAA CTTCGGCCCT
TACCTGACCG ACAAATACTG CACCCCCTGG GGAAAGGCCG TTAACTTCGA CGATGCCTGG
TGCGATGGGG TTCGGCGGTA TGTGCTCGAA AATGCCCTGA TGTGGTTTCG GGATTTTCAC
ATCGACGCCC TGCGGCTCGA TGCCGTTCAT GCCATCAAAG ATTTCAGCCC GGTCCATATC
CTACAGGAAC TCCGGCAAAA AGTCGATGAA CTTATGGCCG CTACGGGTCG CCGGTACTAC
CTCATTGTCG AGAACGACCT AAACGATCCG CGCTACATCG ACCCGCTGTC TGAGCATGGT
TACGGCATGG ATGCCCAGTG GAACGACGAA TTTCACCACG CGCTCCGGGT AGCCGCTGGC
GAAGAAAAAA CCGGTTACTA CGCCGACTTC GACGGGCTGA GCCACTTGGC GAAATCGTAC
AGAGATTCTT ACGTATATGA TGGTCAGTAC TCAGCCGTTC GTAACCGGTT TTTCGGTGGC
AAAGCCGAGA CGAATCCGGG GCAGCAATTC ATTGTCTTTT CGCAGAATCA CGACCAGGTG
GGCAACCGCA AGTTGGGCGA GCGGTCGAGT CAGCTGTACA GCTTCGATGC GCTCAAGCTG
CTGGCGGGCG CAGTACTGGT CAGTCCCTAC ATTCCGCTAC TATTCATGGG TGAAGAATGG
GGCGAAACGA GTCCGTTCTT CTACTTTGTA AGCCATACGG AACCGGAGCT GGTCGAGGCC
GTTCGGCAGG GACGCAAGGA AGAATTTGCT TCCTTTCATT CCGACGGCGA CGACGTGCCC
GATCCGCAAA GCCACGAAAC CTACCAGCAG GCCAAACTCC AGTGGAACCT CATCGGGCAG
AAACCGCATC AGCTACTGCT TCGCTATTAC CAGACCTTAC TTGCCCTGCG CCGACAGTTA
CCCGCCCTGG CTCATCTGGA CCGGACCAAA CTCAACGTCA TTGACGATCT GAAGGCCGAA
ACGCTGGTGT TGCACCGCTG GCATGACGAC CAGCATGTGC TGTGCCTGAT GAATTTTTCC
AAACAACCCC AATCCATTGC CCTGCCAGCC GTTGGCGAGC CCAACACAAG CTGGCAAAAA
GTACTGGACT CTGCCGATGA ACTGTGGCAA CCGGAACCCG CATCCGATCT GAGCCAGGCA
CCCGAATCGG TAACGGGTTC CGAAACCGTT CCAGTCCGGC CCGAGTCATT TATTCTTTAC
GCACAATCTC ATGAAAAATC CCGTTTCCAC CTACCGGATC CAATTTCACA AGGACTTTAC
CTTTCGTGA
 
Protein sequence
MTHLIDSGQR SLGVTFPNEH EASIQLWAPL AKYVAIKIYG HPTALPLTCE ELGYWHLTTT 
QLKPGDLYTF KLDGQEEYPD PVSLCQPQGV HGPSQAVDTG SFSWTDQDWQ NPALDSYVLY
ELHTGTFTEE GTFQSLESKL DYLKALGVTA IEIMPVAQFS DSRNWGYDGV YTYAVQQSYG
GANGLHHLVD TCHKKGIAVV LDVVYNHFGP EGNYLGNFGP YLTDKYCTPW GKAVNFDDAW
CDGVRRYVLE NALMWFRDFH IDALRLDAVH AIKDFSPVHI LQELRQKVDE LMAATGRRYY
LIVENDLNDP RYIDPLSEHG YGMDAQWNDE FHHALRVAAG EEKTGYYADF DGLSHLAKSY
RDSYVYDGQY SAVRNRFFGG KAETNPGQQF IVFSQNHDQV GNRKLGERSS QLYSFDALKL
LAGAVLVSPY IPLLFMGEEW GETSPFFYFV SHTEPELVEA VRQGRKEEFA SFHSDGDDVP
DPQSHETYQQ AKLQWNLIGQ KPHQLLLRYY QTLLALRRQL PALAHLDRTK LNVIDDLKAE
TLVLHRWHDD QHVLCLMNFS KQPQSIALPA VGEPNTSWQK VLDSADELWQ PEPASDLSQA
PESVTGSETV PVRPESFILY AQSHEKSRFH LPDPISQGLY LS