Gene Slin_5014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5014 
Symbol 
ID8728779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6111984 
End bp6113591 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content52% 
IMG OID 
ProductAlpha,alpha-trehalase 
Protein accessionYP_003389790 
Protein GI284039860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAC CACCGGCGAA CAGATTCATC CTTCTCTTCT CGCTACTTTG CCTGACACCC 
GCCTGGAGTC AGGCAGTTTT TGAGAAACCC CAGTCAACGA CATTGAATTT GGCGAGTCCT
GACGAACAGT TCGGGGCGTT GTTTGAAGCT GTTCAGTTAA AAGCCGTCTT TCCGGATTCG
AAAACATTTG CTGACTGCAC CCCCAAATTT CCAATAGCCA CCATTCTGGC AAGCTATGAA
AGTGCGCGGC AACGCAGCGA CTTCGATCTG AAAACGTTTG TTACCCAGAA TTTCACGCTA
CCCATCAAAC CGGCGTCTGG CTACACCAGC AAAGCAGGAC AAACGGCCCA GGAGCACATT
ACTGATTTAT GGTCCGTACT TACCCGACCG GCATCGACCG GCACTAAAGC GGGTACACCA
GCGGGTTCAT TAATTGCCTT GCCCAAGCCT TACGTGGTGC CGGGTGGGCG TTTTGGCGAG
ATCTACTATT GGGATAGTTA TTTTACCATG CTCGGCTTGA AAGCATCCGG CCAGACGGCC
CTGATTCGGA ATATGATCGA CAACTTCGCC TATCTGATCC GAACGTTCGG CTTTATTCCC
AATGGAAACC GGACGTATTT TTTAGGCCGG TCGCAGCCCC CGTTTTTTTC ATTAATGGTC
AACCTGCTTA GCGAAGTGCA GGGCCGTCGC GTTCTTGTTA CCTACCTGCC CGAGTTACAG
AAAGAGTATA ATTTCTGGAT GGATGGCAGA GACCAACTGA CCGACGAACG TCCGGCTTAC
CGACGCGTGG TGCGGCTCGA AGAAGGGGTT TACCTGAACC GATATTATGA TGATAAAATT
ACACCACGGC CGGAGTCGTA CAGGGAGGAT GTTCAACTGG CGAAACGAAC CAAAACCCCG
GCCATACTCT ACAAGCATAT CCGGGCCGGG GCCGAATCGG GCTGGGATTT CAGCAGCCGG
TGGTTTCGCG ATGGAAAGAA TCTGAAGACC ATCCATACAA CAGATTTCAT TCCGGTCGAT
TTAAATGCCC TCCTGGTCAA TTTAGAACAA ACACTTGCGG AAGGCTATCG GCTGAAGGGC
GATAAAGTTC AGGCCAAAAA ATACACCGTC CTGGCGCAGC AACGGCGCGA CGCTATCCTA
CGTTACTGCT GGAACGCCAA AAGCCAATTC TTTTTTGATT ACGATTTCGT TGCGGAGAAA
CTGTCGACGG TGTACTCACT TGCCGCTGTT TATCCCCTTT TTGTTCGAAT CGCGACACCC
TCGCAGGCGC AGGCGGTAGC TGTTACGCTG GAGAAATCGT TCCTGAAACC CGGTGGTCTA
ACAACGACGC TTGTCCGAAC CGGCGAGCAG TGGGATGCAC CCAACGGCTG GGCGCCCTTG
CAGTGGCTAT CCATCCGGGG CCTTCGTAAT TACAATCAGG TACAACTGGC CAACAAGGTC
AAGACCAACT GGGTCAATGA AAATTTGCGG GTGTATAAAG CTTCCGGAAA AATGGTGGAG
AAGTACGACG TCATCAGTAC GGCCGGAGCC AAAGGAGGGG AGTACCCCAA TCAGGACGGC
TTCGGCTGGA CAAACGGGGT GCTCCTGACG CTGCTGACCG AAAAATAG
 
Protein sequence
MKLPPANRFI LLFSLLCLTP AWSQAVFEKP QSTTLNLASP DEQFGALFEA VQLKAVFPDS 
KTFADCTPKF PIATILASYE SARQRSDFDL KTFVTQNFTL PIKPASGYTS KAGQTAQEHI
TDLWSVLTRP ASTGTKAGTP AGSLIALPKP YVVPGGRFGE IYYWDSYFTM LGLKASGQTA
LIRNMIDNFA YLIRTFGFIP NGNRTYFLGR SQPPFFSLMV NLLSEVQGRR VLVTYLPELQ
KEYNFWMDGR DQLTDERPAY RRVVRLEEGV YLNRYYDDKI TPRPESYRED VQLAKRTKTP
AILYKHIRAG AESGWDFSSR WFRDGKNLKT IHTTDFIPVD LNALLVNLEQ TLAEGYRLKG
DKVQAKKYTV LAQQRRDAIL RYCWNAKSQF FFDYDFVAEK LSTVYSLAAV YPLFVRIATP
SQAQAVAVTL EKSFLKPGGL TTTLVRTGEQ WDAPNGWAPL QWLSIRGLRN YNQVQLANKV
KTNWVNENLR VYKASGKMVE KYDVISTAGA KGGEYPNQDG FGWTNGVLLT LLTEK