Gene Slin_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_2076 
Symbol 
ID8725814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp2503510 
End bp2506734 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table11 
GC content54% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003386914 
Protein GI284036984 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTAC TTGTAACAAC ATCAATAGCT GTATTATTCC TTTTTTGTGC TGGTTTGACG 
ACCTGGGCTC AGGTAACGAC CAGCGGACTT AGCGGTCGGA TCACTGACGC TAAAAATGAA
GCACTTGTAG GCGCCACTGT TCAGGCAACT TACACGCCCA CCGGCACCAA ATATGCCGCC
GTAACCGATG TGGAAGGACG CTACCGGATC AATAACATGA ATGCGGGTGG GCCTTACGAA
CTAGTAGTAA CCTACGTTAG CTATAAAACC GAAACCCGTT CGGACATTAC GCTACAGCTT
GGCGAAACGA CCAATTTGAA CATCACCCTT CAGGACGCCA GTACCCAACT CTCGGAGGTT
GTAGTGAAAG CAAACCGGGA AGGCGAACGC CAGGGCGCGG GCATTAGTGT CAACAGCGAG
ACGATTCGGC GGTTACCGAC CATCTCCCGC AGCCTGACCG ACATGACGCG GCTGACACCC
CAGGTTAGTA ACAACAACTC CTTTGCGGGC ACCAACTTCC GCTACAACAA CGTCACCATC
GACGGCGCTA TTAACAACGA CGCCATTGGA TTCAGCCCTT CGCTGGGTGG CTCAACGGGC
ACTACGGGGC AACCTGGTTC CAGCACACGT ACCAACCCGG TTAGTCTGGA TGCTATTCAG
GACATTCAGG TAGCCGTAGC TCCTTTTGAC GTTCGATTAG GTAACTTCCT GGGTGGTTCG
GTCAATGCCG TTACCCGCAG CGGTACCAAC CAGGTAACGG GTTCTGTTTA CGGCTTTGGC
CGTAATGCCG CCCTGACGGG TGCGTGGAAC GGTGCTTCCA ATGCCAAAGA GAAACTACCC
AGCACCTTCC ACGAGTATCA GACGGGCGTT CGGGTCGGTT TTCCGCTGAT CAAGAACAAA
CTGTTCTTCT TTACCAACGA AGAGATCACC CGTCGGCAGG ACCCCGTCCA GTTTCAGGCC
GGTTCACCCA GTTCGCTCAT CAAAGATGCG GCACTGGCTC AGCAACTGAG CGATTTTGTG
AAAGCCAATT ACGGTCTCGA TGCTGGCTCG TATGGTGACT ACGCCATATA CTCCAATAGC
ACCAAGTTCT TCAACCGGCT GGACTGGAAC ATCAACGACA AAAACCAGTT GACCATCCGC
AACAACACGG TTTTCTCGGA AGCGACTAAC CTGGAGCGCG ATGCGGCCAA CTTCCGCTTC
GGTAGTATCG ACTTCAAACA GGTCAACAAC CAGAGCAGCA CCGTGGCCGA ACTAAAAAGC
CAGTTCAGCG GCCGGGCGTC AAACAGTCTG ATTTTAGGGT ACTCCAGCAT CCACGACTAC
CGCAATACGC TGTCCAACGT GCGCACGTTC CCCCAGGTTG AGATCGGCTA CAATGGCGGT
ACCATCTTTC TGGGCAACGA CCGGGAGGCT TCCATTTTCA ACCTGCGTCA GAAAACCTTC
GAGCTCACCG ACAACTTTAC CTTCTACCTG GGCAAAAACA CGTTTACGGT GGGTACGCAC
AACGAGTTTT ACACCATCGA CTATGGCTTT GTCAACTCGC CCAATGGTCG GATTTCGTAC
CGCAACCCGA CTGAATTCCT GGCTAAACTA CCCAACCGGG TTCGGGGTTC GTATCCCTTC
GCCGATGGCA CCAACAACCT CGACAACCAG TTTAACAACC CCTATGCCCA CTTCAATGTC
AACCTGCTCA GCTTCTATGT ACAGGATGAT ATTCAGGTAT CGGATCGGCT GAAACTGTCG
CCGGGTGTGC GTCTGGACTA CACCGGCCTA CCCAATAAAC CAACGCTCAG TCCGCTGGTA
ACCGGGTCGG CCGGCGACCC GACCAACCTG GGCCGAACCT ACAGCTTCAC CCCGCTTAAC
CAGATCGCCA ATAGCTACCT CAACAACGTT CAGATTTCGC CCCGCCTTGG CTTCACCTTC
GATGTCAATG GCGACAAGAG CCTGGTTGTT CGGGGGGGAA CAGGCCTGTT CACGGGCCGT
ATTCCGTTTG CCTGGCTGGG CTATGCCTTC TACAATAATG GGGTAGGCTA TGGCGCGTAC
GATTTCAACA ACAACGCAAC GGCCACGACG AAGCTGGTGG GTGATCCGCT CGTTGCGAAC
GGCGGTCTGC TGATCAACAA CAACCCGGCC AACGGGGGCG TGACGCGTAC ACAGGTCGAT
TTGATTGACA ACAACTTCAA GATGCCGCAG ATGTTCCGTA ACAATCTGGC CGTTGATTAT
GTGGTAGGCG GCTATAAACT CACGGTAGAA GGCCTGTACA CAAAGGTGAT CCAGGACCTG
AAATTTCAAC AGGTTAATAC GAAGGATGTG GTTCGGTACT ACAGCTACGA TACCCAGCAG
CAACAGCCCA TTTATGTGGC CGCTAATGGC TCAGCGGGTG CACAGCGTAT CGACAACAAC
TTTGCCAACG CTTATCTGCT CTCTAACACG AACAAGGGAT ACCGCTACAG CCTGACGGGA
CAAATTCAGC GGAACTTCCC ATTCGGATTC GGCTTCTCAA CGGCTTATAC CTACGGCAAG
TCGTTCGACC TGACCAACGG TATCCGCAAC TCGATGGAAT CGAACTGGCA GTTAAACCAG
TCGCTGACGC CGAACGATCC GCAACTCTCG TACTCGAACT TCGATATCCG CCACCGCATT
GTTGGTACCG TTAACTACCG CCAGGTCTGG AATCCCAGGA ATGCAACGAC GGTAACGCTG
TTCTACTCGT TGCAATCGGG CACGCCTTTT TCGTGGGGCT ATGTTAACTC GACCATCGAT
GGTACCGGAC AGGCTAACAG CTTAGCCTAT ATCCCCCGTG ACCTGACCGA AGCCCAGAAA
CTGCTGCCAA CGGGTACCCA AGCCAGCGAC TTTATGGCGT TCGTTGAATC AGATTCGTAT
CTGAAAACCC GAAAAGGAAA CTTCACCGAA CGCAATGCCG GTCGTACGCC CTGGAATAAC
ACTATGGACC TGCGTTTCCT GCATGAGTTC AAGCTGAAAG GTCGTCAGTC GGTGCAGATC
AGCTACGACA TCATCAACTT CCTGAACCTG CTGGATAAGA CCCTGGGGTA CTCTTACTTT
TCACCCAACA CGTTCAATTC GACAGCTTCC ATTGGTCTGG CACGGGCCAC CAACCCAACC
AGTGGCGACC CGACCTTTAC CTGGACGGCT CCATCAGCTC CCTACTCCAT CGATCCGCTG
GGCTCACGCT GGCAGATGCA GCTTGGGGCG CGGTACTCTT TTTAG
 
Protein sequence
MRLLVTTSIA VLFLFCAGLT TWAQVTTSGL SGRITDAKNE ALVGATVQAT YTPTGTKYAA 
VTDVEGRYRI NNMNAGGPYE LVVTYVSYKT ETRSDITLQL GETTNLNITL QDASTQLSEV
VVKANREGER QGAGISVNSE TIRRLPTISR SLTDMTRLTP QVSNNNSFAG TNFRYNNVTI
DGAINNDAIG FSPSLGGSTG TTGQPGSSTR TNPVSLDAIQ DIQVAVAPFD VRLGNFLGGS
VNAVTRSGTN QVTGSVYGFG RNAALTGAWN GASNAKEKLP STFHEYQTGV RVGFPLIKNK
LFFFTNEEIT RRQDPVQFQA GSPSSLIKDA ALAQQLSDFV KANYGLDAGS YGDYAIYSNS
TKFFNRLDWN INDKNQLTIR NNTVFSEATN LERDAANFRF GSIDFKQVNN QSSTVAELKS
QFSGRASNSL ILGYSSIHDY RNTLSNVRTF PQVEIGYNGG TIFLGNDREA SIFNLRQKTF
ELTDNFTFYL GKNTFTVGTH NEFYTIDYGF VNSPNGRISY RNPTEFLAKL PNRVRGSYPF
ADGTNNLDNQ FNNPYAHFNV NLLSFYVQDD IQVSDRLKLS PGVRLDYTGL PNKPTLSPLV
TGSAGDPTNL GRTYSFTPLN QIANSYLNNV QISPRLGFTF DVNGDKSLVV RGGTGLFTGR
IPFAWLGYAF YNNGVGYGAY DFNNNATATT KLVGDPLVAN GGLLINNNPA NGGVTRTQVD
LIDNNFKMPQ MFRNNLAVDY VVGGYKLTVE GLYTKVIQDL KFQQVNTKDV VRYYSYDTQQ
QQPIYVAANG SAGAQRIDNN FANAYLLSNT NKGYRYSLTG QIQRNFPFGF GFSTAYTYGK
SFDLTNGIRN SMESNWQLNQ SLTPNDPQLS YSNFDIRHRI VGTVNYRQVW NPRNATTVTL
FYSLQSGTPF SWGYVNSTID GTGQANSLAY IPRDLTEAQK LLPTGTQASD FMAFVESDSY
LKTRKGNFTE RNAGRTPWNN TMDLRFLHEF KLKGRQSVQI SYDIINFLNL LDKTLGYSYF
SPNTFNSTAS IGLARATNPT SGDPTFTWTA PSAPYSIDPL GSRWQMQLGA RYSF