Gene Slin_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1420 
Symbol 
ID8725154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1726687 
End bp1728279 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content56% 
IMG OID 
Productglycoside hydrolase family 43 
Protein accessionYP_003386269 
Protein GI284036339 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.394008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.931127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTATTA GTAAAGGACT ACATAAGAAA GCACTGCTTT GGGCTGGTTT AATGGTCGTG 
TTGCTACAAA CACAAAATAG GGTAGTTGCG CAAACAGCCA CGAACCCGGT CATTTTCGCT
GACGTGCCCG ACATGGCCAT GATTCGGGTG GGCAACACTT ACTACATGAG CAGCACGACT
ATGCACCTGA GTCCGGGGCT GCCCATCATG AAGTCGAACG ACCTGATCAA CTGGAAGATG
GTGGGGTACG CTTACGATAC GCTGACGACA GCGGACGCGA TGAGCCTGAC CAACGGCAAA
AGCACCTACG GGCGCGGGTC GTGGGCCAGT AGCCTGCGCT ACCTCAACGG CCTGTATTAC
GTGACCACCT TTGCGCAGAC CAGCGGCAGA ACCCACATCT ACACCACCAA AGACATCGAA
AAAGGACCGT GGAAGGCCGT TTCATTCAAA CCGTCGTACC ATGACCACAG CCTGTTTTTC
GACGACGACG GCCGAACGTA TCTAATCTAC GGCGCGGGCA AACTCCGGCT GGTTGAACTG
AACACCGATG CGTCGGGGGT GAAACCCGGC ACCACGGAGC AGGTCATCAT CGAGAACGCC
AGCACACCCT CAGGAACCGG CGGTGGCCTG CCTGCAGAAG GGTCGCAACT GTTTAAAGTA
AAGGGTAAAT ATTATCTGTT CAACATCAGC TGGCCCCGGG GCGGTATGCG TACGGTGATT
ATTCACCGGG CCGACAAGAT CACCGGTCCC TGGGAAGGCC GCGTGGCCTT GCAGGACCTG
GGTGTGGCAC AGGGTGGACT CATCGACACG CCCGACGGCC GCTGGTTTGC CTACCTGTTC
CGCGATTTCG GCGGGGTGGG CCGGATTCCA TATTTGGTAC CCGTCGAGTG GAAAGACGGC
TGGCCGGTGC TGGGCGTCAA TGGCAAAGTG CCCGAAACCC TCGATCTCCC CGCCAGCAAA
GGCTTAATTC CCGGCATTGT CAACTCCGAC GAATTCACCC GCAAAAAAGG CGACCCCGCC
CTGCCGCTAG TCTGGCAGTG GAATCACAAC CCCGACAACG CCCTGTGGTC GGTTTCGGAG
CGAAAAGGAT TTCTGCGCCT GAAAACCGGA CGAACCGACA CCTCGTTTGT GATGGCCCGC
AACACGCTTA CACAGCGTAC TATCGGCCCC GAATGTACCG GGTCTACCCT ACTCGATGCG
TCGAACATGA AAGACGGCGA TTTTGCCGGT CTGAGCCTGT TGCAAAAGAA TTACGGGCTG
GTGGGAGTGA AAGTCGAAAA TGGAAGCCGG TCGATTGTCA TGGTGAGTGC CAGTTCGGGC
AAGCCGGTAG AGATACAGCG CGTTCCGTTG AGCCAGAAAA TGGTCTACCT GAAAGCGGAG
TGTGATTTCA ACGACCGTAA AGACACCGCC CATTTCTTCT ACAGCCTCGA CGGAAAAGCG
TGGAGCCCCA TTGGGGAACC GCTGAAAATG CCGTACACCA TCCCGCACTT CATGGGCTAT
CGGTTCGGGC TGTTCAACTA CGCCACCTTG CAAACGGGCG GTTTTGCCGA CTTCGATTAT
TTCCGGATTA CCAACACAAT CTCCGGGCAA TAA
 
Protein sequence
MIISKGLHKK ALLWAGLMVV LLQTQNRVVA QTATNPVIFA DVPDMAMIRV GNTYYMSSTT 
MHLSPGLPIM KSNDLINWKM VGYAYDTLTT ADAMSLTNGK STYGRGSWAS SLRYLNGLYY
VTTFAQTSGR THIYTTKDIE KGPWKAVSFK PSYHDHSLFF DDDGRTYLIY GAGKLRLVEL
NTDASGVKPG TTEQVIIENA STPSGTGGGL PAEGSQLFKV KGKYYLFNIS WPRGGMRTVI
IHRADKITGP WEGRVALQDL GVAQGGLIDT PDGRWFAYLF RDFGGVGRIP YLVPVEWKDG
WPVLGVNGKV PETLDLPASK GLIPGIVNSD EFTRKKGDPA LPLVWQWNHN PDNALWSVSE
RKGFLRLKTG RTDTSFVMAR NTLTQRTIGP ECTGSTLLDA SNMKDGDFAG LSLLQKNYGL
VGVKVENGSR SIVMVSASSG KPVEIQRVPL SQKMVYLKAE CDFNDRKDTA HFFYSLDGKA
WSPIGEPLKM PYTIPHFMGY RFGLFNYATL QTGGFADFDY FRITNTISGQ