Gene Slin_5052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5052 
Symbol 
ID8728817 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6155369 
End bp6157132 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content53% 
IMG OID 
Productglycoside hydrolase family 9 
Protein accessionYP_003389826 
Protein GI284039896 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.886971 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGTTT CTGTAATCAT ACTTCTCTTG TTGGCCACCA GATTGGTCAA TGGACAATCT 
CTGTCCACAA CTATTAGGCT CAATCAGGTT GGGTTTTACC CTAATGCCGC AAAAGTGGCC
GTGATTGTAC ATGATGCAGG CGCATCTGCC GCGTCCGGTA AGTTTCAGCT TATGACTCCC
GACCAGAAAA AGGTTGTCTT TACGGGTACG TTGAGTGGCC CTAAACAGAA CCAGATTTCG
GGTAAGACAA GCCAATTCGC TGACTTTTCG GCTTTTAAAC AGAAGGGGAC TTATGTTGTA
GCCGTACCCG GTTTGGGCCA CTCGTACCCG GTCGAGATTC GGCCGGATGT TCACCGGGCC
GCAGCCATTG GTGCTCTGAA AGGATTTTAT TATCAGCGTA CGTCTACCGA CTTACCCGCC
AGGTATGCCG GTAAATGGGC CCGTCCGGCC GGGCATCCGG ACACTCGGGT CTTGATTCAC
CCATCGGCCG TTTCGCCAGG GCGCTCGGCG GGTACGGTTA TTTCGTCGCC CAGGGGGTGG
TACGATGCGG GCGATTACAA TAAGTACATC GTCAATTCGG GCATTACGGT GGGCACGCTG
CTTTCTCTGT ACGAAGACTT CCCTGCCTAT GCCAAATCGT TCGATACCAA CATTCCCGAA
ACAGGCAATG CTGTTCCCGA TCTGCTGGAC GAAGTGCTCT GGAACCTCCG CTGGATGCTG
ACCATGCAGG ACCCGGCTGA TGGGGGCGTT TACCATAAAT TGACGAATCC AAGCTTCGAT
GGGATGGTTA TGCCCGATAA AGCCACAAAA GATCGGTATG TGATTCAGAA AAGCATTACC
GCAACCCTCG ATTTTGCCGC TGTACTGGCC CAGGCCAGCC GGGTATTCAA ACAGTACAAC
AAAGCATTGC CCGGCCTATC GGACTCCTGC CGCACGGCGG CTGTGAAAGC CTGGAAATGG
GCTGAGCAAA ACCCCAACGC CGTCTATCGT CAGAACGAAA TGAATACGCA GTTCGACCCC
GACGTGGTAA CGGGGAGTTA CGAAGACCGC GATGCTAGCG ACGAGTGGGT TTGGGCTGCG
CTTGAACTGT ATGTGACGAC AAAAGACGAC GCCTATTATA CGGCTGCAAA ACTAAAATCG
GATGAGAAGC TAAGCTTACC CTCCTGGAAT CAGGTTCGCC TGTTGGGGTA TTATACACTA
GCCCGCTTTG CGAATGAGCT TACACCCCTT GGTAAACAGG GAAGCGAGTC GGTGAAGAAA
CAACTGCTTG CCTTTGCCGA TGACCTTATT CAGGGAACAG ACCAGCAGGC GTATGGTACG
GTGATGGGCA AATCGGCGCG GGATTACATT TGGGGGAGTA GCTCCGTTGC CGCTAACCAG
GGTATTGCGC TGATTCAGGC GTACAACCTC ACCAAAGATA ACCGCTACCT GCGGTTTGCC
TTAACCAATC TCGACTATCT GCTGGGCCGC AATGCAACGG GCTACTCGTT GTTGACGGGA
TTTGGCGCCA AACCGATTAT GAATCCGCAC CATCGGCCAT CCGTAGCCGA CGGAATCACC
GAACCAGTAC CAGGCCTACT ATCGGGCGGG CCGAATGGCA ATGCACCGAA GCAGGACAAA
TGCCCCGGAT ACACCGCCAC TTCGGCCGAC GAAATGCTTC TGGATGATTC CTGTTCGTAT
GCCTCCAACG AAATTGCCAT TAACTGGAAT GCCCCGTTGG TTTATCTGGC GACAGCTATC
GAGGCATTGC AAACAAAGCT TTAG
 
Protein sequence
MRVSVIILLL LATRLVNGQS LSTTIRLNQV GFYPNAAKVA VIVHDAGASA ASGKFQLMTP 
DQKKVVFTGT LSGPKQNQIS GKTSQFADFS AFKQKGTYVV AVPGLGHSYP VEIRPDVHRA
AAIGALKGFY YQRTSTDLPA RYAGKWARPA GHPDTRVLIH PSAVSPGRSA GTVISSPRGW
YDAGDYNKYI VNSGITVGTL LSLYEDFPAY AKSFDTNIPE TGNAVPDLLD EVLWNLRWML
TMQDPADGGV YHKLTNPSFD GMVMPDKATK DRYVIQKSIT ATLDFAAVLA QASRVFKQYN
KALPGLSDSC RTAAVKAWKW AEQNPNAVYR QNEMNTQFDP DVVTGSYEDR DASDEWVWAA
LELYVTTKDD AYYTAAKLKS DEKLSLPSWN QVRLLGYYTL ARFANELTPL GKQGSESVKK
QLLAFADDLI QGTDQQAYGT VMGKSARDYI WGSSSVAANQ GIALIQAYNL TKDNRYLRFA
LTNLDYLLGR NATGYSLLTG FGAKPIMNPH HRPSVADGIT EPVPGLLSGG PNGNAPKQDK
CPGYTATSAD EMLLDDSCSY ASNEIAINWN APLVYLATAI EALQTKL