Gene Slin_5374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5374 
Symbol 
ID8729139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6534873 
End bp6536201 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content51% 
IMG OID 
Productglycoside hydrolase family 28 
Protein accessionYP_003390141 
Protein GI284040211 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.558216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACAA GATTAGTTGC ACTGTTGTGT GCGGTGGTGA TAAGTGCACC GCTTTGGGCA 
CAGAAGGTAG AGAAAACCTC ATTTCCGGAT GGAAGTCCGA TCAGTAACTG GTTTACAACA
GTTCGGAAGG TAAGCCTTAG CCAACTCGGC AAGCGGTACG TGATTACGGA CTATGGAGTA
GGGTCAGACA GCACACGTGT TCAGACCGAA GCCATTCAGA AGGTAATTGA CCTGGCAGCC
CGCAACGGTG GGGGCATAGT TGTCATTCCG AAGGGTACGT TTATGAGCGG GGCGTTGTTT
TTCAAACCCA AAACCCATCT GCACCTCGCA GAAGGGGCCG TTCTCAAAGG CTCGAACGAC
ATTGCCGATT ACCCAAAGCT ACCATCGCGG ATGGAGGGCC AGAATCTAGA TTACTTTGCC
GCGCTGGTCA ATGCTTATCA GGTAGACGGT TTTACAATTT CGGGAAAAGG GACGATTGAT
GGCAACGGGT TACGCTTCTG GGAGGCATTC TGGGCCCGAC GAAAAGAAAA CCCCAACTGC
ACAAATCTGG AGGTTTCGCG CCCCCGGCTG GTGTTTATCT GGAAATGCAA CAACGTACAG
GTGCAGGATG TAAAGCTTCA TAATGCGGGC TTCTGGACAA GCCACTACTA CCAGTGCACC
AACGTGAAAG TGCTCGACGC GCATATTTAT TCGCCGTACA AGCCCGTGAA AGCACCGAGT
ACGGACGCTA TTGATATCGA CGCCTGTTCA AACGTGCTCA TCAAAGGATG CTACATGTCG
GTCAATGACG ACGCTATTGC CCTCAAAGGC GGCAAAGGCC CCTGGGCCGA CCGGCAACCC
GGCAATGGAG CCAACACCAA TATTATCATT GAAGGCTGTG AGTTTGGCTT TTGCCATTCG
GCGCTTACCT GCGGCAGTGA AGCCATTCAT AACCGGAACA TCATCATGCG GAACTGTCAC
GTGAGCGAGG CCGACCGGGT CTTGTGGCTT AAAATGCGGC CCGATACGCC CCAACTGTAC
GAATACATCC GGGTCGAAAA CATAAAAGGT CAGGCCCATA GCCTCCTATA TGTGAAGCCC
TGGACGCAGT TTTTTGATTT GCAGGGCCGC CCGGACATAC CGCTTTCCTA TTCAGACCAC
GTAAGCCTTA AAAACATTGA GCTTACCTGC GATACGTTCT TTGATGTTGC TCCTTCGGAG
CATAATAAGC TCTCTAATTT TACGTTCGAG AACCTGGTGG TCGAGAGTAA AAACGCGGCC
ATTGACAAAA CGGTAGTTAA CGGGTTTACG CTCAAAAACG TAGCCGTCAA CCATAAACTC
GTAAATTAA
 
Protein sequence
MTTRLVALLC AVVISAPLWA QKVEKTSFPD GSPISNWFTT VRKVSLSQLG KRYVITDYGV 
GSDSTRVQTE AIQKVIDLAA RNGGGIVVIP KGTFMSGALF FKPKTHLHLA EGAVLKGSND
IADYPKLPSR MEGQNLDYFA ALVNAYQVDG FTISGKGTID GNGLRFWEAF WARRKENPNC
TNLEVSRPRL VFIWKCNNVQ VQDVKLHNAG FWTSHYYQCT NVKVLDAHIY SPYKPVKAPS
TDAIDIDACS NVLIKGCYMS VNDDAIALKG GKGPWADRQP GNGANTNIII EGCEFGFCHS
ALTCGSEAIH NRNIIMRNCH VSEADRVLWL KMRPDTPQLY EYIRVENIKG QAHSLLYVKP
WTQFFDLQGR PDIPLSYSDH VSLKNIELTC DTFFDVAPSE HNKLSNFTFE NLVVESKNAA
IDKTVVNGFT LKNVAVNHKL VN