Gene Slin_1046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_1046 
Symbol 
ID8724776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp1277599 
End bp1280799 
Gene Length3201 bp 
Protein Length1066 aa 
Translation table11 
GC content55% 
IMG OID 
Productprotein of unknown function DUF1549 
Protein accessionYP_003385896 
Protein GI284035966 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTA CGACTGGATT CATTACTGTA GGCTATCAAA GGTTTCGATT CTTATTAGTC 
GTCTGTTGTA CCCTTCTGTT TCTTCAAAGT TGCGGGGTCG ACAAACCGGC GGAGATCGCT
CAACTGGACG ACAAGCTTCC AGAAACCATC GACTATAATC TGCACGTCAA GCCTATTCTG
TCTGATAAAT GTTTCTTCTG CCACGGGCCG GACAAAAACA GTCAGAAAGC CGGATTGGAA
CTTGCCACCC CAGAAGGGGC CATGGCCGCC CTCAAAAAAG CAAAAGGGAA ACACGCCATT
GTTCCCGGTG ATCTGGCAAA CAGCGAAGTT TACCACCGGA TTTTGAGCAC CGATGAGCAC
GAGATGATGC CGCCGAAAGC CTCGAACCGG TCGTTATCGG TTTACGAAAA GGCCGTTCTG
ATCAAATGGA TCGAGCAGGG AGCCGAATAT AAACCGCACT GGGCGCTCAT CAAACCGCAA
AAACCAACCT TCCCAACGGT AAAAAACACA AGCTGGCCCA AGAATCCAAT CGACTATTTT
ACGCTGAGCA AGCTGGAGGA AAAAGGACTC ACCCCTTCGC CCGAAGCCGA TAAGGAAACC
CTCCTTCGCC GGGTAACCCT CGACCTCACT GGGCTACCGC CTACCCTTGC CGAAATCGAC
GCTTTTCTGG CCGACACCTC GCCCAACGCC TACGAAAACG TAGTCAACCG GTTGTTGAAG
TCGCCCCATT ACGGCGAGAA AATGGCGGTC GACTGGCTGG ATTTATCGCG CTATGCCGAT
ACACATGGCT ACACCGTTGA CCGATACCGA CCCATGTGGC CCTGGCGCGA TTGGGTCATT
CAGGCATTTA ACCGCAACCG GCCATTCAGC CAGTTTATCA CCTGGCAACT GGCGGGCGAT
CTGCTTCCCA ACCCAACGCC CGAACAGCGG CTGGCAACGG CCTTCAACCG CAACCATTCG
CAGAATATGG AAGGCGGCAT TATCAACGAG GAGTTCCGGG TGGAGTACGT CGCCGACCGG
ACCAACACGC TGGGTACGGC CTTGATGGGT ATGACCGTCG AGTGCGCCCG CTGCCACGAC
CATAAGTTCG ACCCCATCTC GCAGAAAGAT TATTACAGCC TGTTCGCTTT TTTCAACAAC
GTGGACGAAG CCGGGCAGAT TTCCTGGGAC GATGCCATGC CCGTGCCCAC CATGCTGCTC
ACCCAGCCGA AAGAAGACAG CTTGCTGGCC TTCATCGACA CGAAAATCAG GAAGGCAGAA
ACCGATCTAA AACAAACCGT ACAAACCGAA CAGAAAGCCT TTGACGCCTG GCAGGCGAAG
ACCAATGCGG TGCCGTTCGA TGCCCGCAAA GGACTACAGG CACGTTTTAC CTTCGATAAA
CTGGTGAAGG GACGGTTCGT GAATGAAGTG AGCATCAAAG ACAGCGGTAA AGTAGCAGAC
CCCGTGCTTG TGCCCGGCAA GCTGGGTAAT GCCTTTCAAT CCAACGGCGA CGACATTCTG
ACACTCGGCC AGGCCGGTGT TTTCAATCGC TCGCAGCCGT TCTCCATTGG CGTGTGGGTC
AACATTCCGA AAGGCGTGTC GAAAGCGGCC CTTGTTCACA AAGGGAACGG TGATATTCTG
TATAATTTCC GGGGGTACTT CCTCAACCTC CGCGACGATA AAGCCGAACT GCTGATGGCC
CATACCTGGC CTTACAACAG TATTGTCCGG GTTAGTCGGC AGGTGCTGCC CAAGGAAAAG
TGGATTCAAC TGACCATGAC CTACGACGGC TCCAGCAAAG CGAACGACTT CCGGCTGTAC
GTCGACGGGC AGGAAATGGC CTTACAGACC GAACGCGACA ATCTCTACAA AGACATTTTA
TTCAACGGAA ATCAAACGGG TCTGATGGTG GGGGCAGATA TGCGCGGACC GGGTTTCAAG
AAGGGGCTGG TCGACGAACT GGTTGTCTAT AACCGTACGC TGAGTCCTCC TGAAGTGCAG
GCGCTGGCCC AGTCTGTGCA GCAGGTAACG CCGATGAAAG GCGACCCGTT GAAACAGTAT
TACTTCAGTC ATCTATCACC CGCTTACCAG CAGCAGCAGC AGGCTTTGCA GCAACTCCGG
GCTGAGCGGA ACAAACTGGT GGAGGCCATA CCGGAAATTA TGGTGATGGA GGAGATGAAA
ACGCCCCGTC CAACCTACCT GCTCAAACGG GGTGTTTACG ACGCCCACGG CCCGCAGGTA
AAACCCGATG TGCCGGCCAG CGTGCTGCCC TTTTCGCCGG ATCTTCCTCG CAATCGGCTG
GGGCTGGCCA AATGGCTGCT GCACCCCGAT AACCCGCTGA CGGCGCGGGT GGTGGTAAAC
CGGTACTGGC AAACGTATTT CGGCAACGCT ATTCAGAAGT CAGCCAACAA CTTCGGAAAT
CAGGGCGGGC TGCCTACTCA CCCGGAACTG CTCGACTGGC TGGCGGTTAC CTTCCGAGAG
AGCGGCTGGA ACATGAAAGC CATGCAGAAG CTGATTGTGA TGTCGGCCAC TTACCGGCAG
TCATCGTACG GCACCCGCGA AAGTTTGCAG AAAGACCCGG AAAACACCTG GTTGTCGCGC
GGACCGATTT CGCGCCTAAC CGCCGAGATG ATTCGCGACA ATGCGTTGGC CGCCAGCGGC
CTGTTAGTCA AACAACTGGG TGGCCCCAGT GTAAAGCCTT ATCAACCCGA AGGACTCTGG
GCGGTCAATA ACACGACGTA TGAACAGGAC AAAGGAGACA GCCTATACCG CCGGAGCCTG
TACACGTTCT GGCGTCGGAC CAACCCGCCC CCGTCCATGA ATACCTTCGA TGCGCCGTCG
CGCAGTTACT GCGTGGTTCA ACGACAGAAA ACTAGTACCC CATTACAAGC ACTGGTGTTG
CTCAACGATC CGCAATTTGT CGAAGCCGCC AAAGTCGTTG CCGAACGGTC GTTTGCCCAA
CACACCACCG TACCGGATCG GTTAACGCAC CTGTTTCGGC TTTTGGCCAG TCGGAAACCA
GCACCAAAGG AGCTGGCTAT CTTAACCCGC TTGTATCAGC AGGAATACCA GTTATTTACG
AAAAATCCGG CCAAAATGCA GGGCTGGCTG CAGGCCGGTG ACTATAAACT GAAGGATGTA
GCCGACAAGC CTGCTTTGGC CGCTGGTGCC GTTGTGGCCA GTACCGTCAT GAATTCCGAA
GCGTTCGTTA CCAAGCATTA G
 
Protein sequence
MAITTGFITV GYQRFRFLLV VCCTLLFLQS CGVDKPAEIA QLDDKLPETI DYNLHVKPIL 
SDKCFFCHGP DKNSQKAGLE LATPEGAMAA LKKAKGKHAI VPGDLANSEV YHRILSTDEH
EMMPPKASNR SLSVYEKAVL IKWIEQGAEY KPHWALIKPQ KPTFPTVKNT SWPKNPIDYF
TLSKLEEKGL TPSPEADKET LLRRVTLDLT GLPPTLAEID AFLADTSPNA YENVVNRLLK
SPHYGEKMAV DWLDLSRYAD THGYTVDRYR PMWPWRDWVI QAFNRNRPFS QFITWQLAGD
LLPNPTPEQR LATAFNRNHS QNMEGGIINE EFRVEYVADR TNTLGTALMG MTVECARCHD
HKFDPISQKD YYSLFAFFNN VDEAGQISWD DAMPVPTMLL TQPKEDSLLA FIDTKIRKAE
TDLKQTVQTE QKAFDAWQAK TNAVPFDARK GLQARFTFDK LVKGRFVNEV SIKDSGKVAD
PVLVPGKLGN AFQSNGDDIL TLGQAGVFNR SQPFSIGVWV NIPKGVSKAA LVHKGNGDIL
YNFRGYFLNL RDDKAELLMA HTWPYNSIVR VSRQVLPKEK WIQLTMTYDG SSKANDFRLY
VDGQEMALQT ERDNLYKDIL FNGNQTGLMV GADMRGPGFK KGLVDELVVY NRTLSPPEVQ
ALAQSVQQVT PMKGDPLKQY YFSHLSPAYQ QQQQALQQLR AERNKLVEAI PEIMVMEEMK
TPRPTYLLKR GVYDAHGPQV KPDVPASVLP FSPDLPRNRL GLAKWLLHPD NPLTARVVVN
RYWQTYFGNA IQKSANNFGN QGGLPTHPEL LDWLAVTFRE SGWNMKAMQK LIVMSATYRQ
SSYGTRESLQ KDPENTWLSR GPISRLTAEM IRDNALAASG LLVKQLGGPS VKPYQPEGLW
AVNNTTYEQD KGDSLYRRSL YTFWRRTNPP PSMNTFDAPS RSYCVVQRQK TSTPLQALVL
LNDPQFVEAA KVVAERSFAQ HTTVPDRLTH LFRLLASRKP APKELAILTR LYQQEYQLFT
KNPAKMQGWL QAGDYKLKDV ADKPALAAGA VVASTVMNSE AFVTKH