Gene Slin_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3866 
Symbol 
ID8727624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4641916 
End bp4643523 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content49% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003388655 
Protein GI284038725 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.991992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTTG TACTAACGTC GATTTTATTT GTAATTTCTC TCACCTACAC AGTTGCCCAG 
AGTTGCGATT GTGACATTAC CATCTCAAAA GCAGCCATGT ATGATGGCGA TGCACTGGAC
TACAAACCTG GCCAGACCAT TTGTATTAAA GCTGGCAACT ACCCCTATCT CTACTTTAAA
AATATAGTAG GAAAAAGCAA CAAACCCATT ACGATCATTA ACTGTGGAGG GCTCGTTACG
GTCGGCACCT CGACTGGCAC AAACGGCATC CAGTTCTACG ACAGCAAGTA CGTTAAGCTT
ACCGGCTCGG GCGATGACCG CTACAAATAC GGCATTAAGC TCACTAAAAC CCCCAGCGGA
GCATCGGGCA TTAACGTAAC AAGCTTTAGC TCCGACTTCG AGATTGAGCG GGTCGAAGTC
TCGGGTACCG GATTTGCCGG GATCATGATC AAGATGGACC CTACCTGCGA TCCAGCCACC
TGGCGGGGCA ACTTCTCGAT GTACAACATC AAGATACACG ACAACTACGT ACACGATACC
TACGGCGAGG GGATGTATAT CGGTAACTCG TTCTGGAACA GCGGCATTAC CAAGACCTGC
GATGGGGAGA GTCAGGTGGT TTATCCGCAC AACATTTATG GTCTGGCCAT TTACAACAAT
CTGGTCGAGC GCACCGGTGC CGAAGGCATA CAGTATGGCT GCGCTCCAGA GTATCGGGTA
TACAACAATG TGGTGAAAAA TACCGGTATC TCTCCGTTTG CCAAGTACCA GAACAACGGC
ATTCAGGCGT CGGGGGGCGT GTCGGGCCGA CTATACAATA ACGTCGTACA GAATGTAAGC
GGCAACGGTA TTATTATTCT TGGCCATACA GGGACGAACA TTATTTATAA TAACCTGGTC
ACGAATATTG GCGGGATAGG TGTTTTCTGC GACAACCGGC CCGGAACTCC CGCTGGCAAC
AGCATTATAT TCACGAATAA CACACTGGTT AATTGCGGAC AGGAAGGATT TGCGCTGTAT
AACAAGCTGG ATAAAAGCAC ACTGGCCAAC AATGCGGTGA TTCAGACAGG GTCGGGTAAA
TTAGTAGCTA CGCTGCCCGG AGTTCAGGTA ACGCAGACAG CCAATTACTA CGAAGCGACA
TTAGGATCGG CCCAATCGAA CGGACTATTC GATGACGACT TTAAGCCGGT ATCTGGCTCT
CCGCTCATTG ACAAAGGTGT CAATAACGGC TATTGGGGTG TTAAGCAGGA TCTCGATGGC
AAAAGTCGCC CAAAAGGAAA GAGTATTGAT ATTGGCGCTT ATGAATTTGA GCCGGGTGGC
GGCCGACTGG GCGCTGACCT AGGAGGTGTA GAAGCAGGCG TTCGGGAGGT GTTATCGTTT
CCTTCGCCCT GTGTCAACGA AGTGTTTCTC AAGCTGACCG GCTATGAACT AACCATTTCT
AAGGTGACTA TTTACACTGT TGATGGCAAG CCGGTTATGA ACGTGGCCCC TGCTACTCTA
AGCGATGAGG TCCGGTTAGA AACGGAGTCT CTGGCAACGG GTGTCTATGT GTATAAAGTA
GTCACTTCAG ATCTTAAGGT GTTACAGGGC CGATTTGTGA AACAGTAA
 
Protein sequence
MKFVLTSILF VISLTYTVAQ SCDCDITISK AAMYDGDALD YKPGQTICIK AGNYPYLYFK 
NIVGKSNKPI TIINCGGLVT VGTSTGTNGI QFYDSKYVKL TGSGDDRYKY GIKLTKTPSG
ASGINVTSFS SDFEIERVEV SGTGFAGIMI KMDPTCDPAT WRGNFSMYNI KIHDNYVHDT
YGEGMYIGNS FWNSGITKTC DGESQVVYPH NIYGLAIYNN LVERTGAEGI QYGCAPEYRV
YNNVVKNTGI SPFAKYQNNG IQASGGVSGR LYNNVVQNVS GNGIIILGHT GTNIIYNNLV
TNIGGIGVFC DNRPGTPAGN SIIFTNNTLV NCGQEGFALY NKLDKSTLAN NAVIQTGSGK
LVATLPGVQV TQTANYYEAT LGSAQSNGLF DDDFKPVSGS PLIDKGVNNG YWGVKQDLDG
KSRPKGKSID IGAYEFEPGG GRLGADLGGV EAGVREVLSF PSPCVNEVFL KLTGYELTIS
KVTIYTVDGK PVMNVAPATL SDEVRLETES LATGVYVYKV VTSDLKVLQG RFVKQ