Gene Slin_5811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5811 
Symbol 
ID8729586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp7053583 
End bp7056705 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content55% 
IMG OID 
Productmembrane-bound dehydrogenase domain protein 
Protein accessionYP_003390575 
Protein GI284040645 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCGAT CAATATCCTT TAAACGCTCG GCCTTCTCTC GTCGGCTGAT TGCTCTGTCG 
GCCGCCGGGG TGGTAGTTGG TGGCGTATTT ATCGGGGCTT ATCAGAACAA AAACCTGAAC
GGAGCCTCCA ACCCCTATCT GACAAGCCTC TTCGCTCAGC TATCTGACGA TGACAAACAC
GACCCCAAAT ACGCGGTCGG TAGTCTGAAC GTAGCGCCCG GTCTGGAGGC AACGCTCTTT
GCCGCTGAGC CCATGCTGAC CAACCCCACC GACATCGACG TCGATGCTCG GGGGCGGGTC
TGGGTCTGCG AAGCGTACAA CTACCGGCCC GCTATCAACG GCAACCCAAC CCGAAAGGAG
GGCGACCGCA TCGTTATTCT GGAAGACACC AATGGCGACG GAAAAGCCGA TGTCTCGAAA
GTATTCTACC AGGACCCGAG CATCGAATCT CCCCTCGGTA TCTGGGTGCA GGGCAACAAA
GTCATCGTGT CTGACAGCCC GAACGTCTGG GTGCTGACCG ATGAAAACGG CGACGACAAA
GCCGATAAGA AAGAACTGCT CTTCACCGGC ATCGGCGGGG AGCAGCACGA CCACGGCATG
CATACCTTCG TGTTCGGTCC CGATGGTAAA TGGTATTTCA ATTTCGGCAA CGCAGGCGAG
CAACTGCTGG ATAAAGACGG CAAACCCGTT ATCGACATTG CAACCGGCAA GCCCATCAAC
AAGCAGAATT TCAAACAGGG CATGGTATTC CGTTGCGACC CCGACGGGAA AAATGTCGAG
CTGCTGGGCC AGAATTTCCG GAACAACTAC GAAGTGGCCG TTGACTCCTA CGGCACCCTT
TGGCAGTCGG ACAACGACGA TGATGGCAAC AAAGGCGTTC GCATTAACTA CGTCATGGAG
TACGGCAACT ACGGCTACAC CGACGAACTG ACTGGCGCGG GCTGGCAGGC GAACCGGGAA
AACATAGAGC CCGAAATTCC CCGGCGTCAC TGGCACCTCA ACGACCCCGG TGCTATGCCT
AACCTGCTTC AAACGGGCGC CGGTTCCCCA ACGGGTATGA TCGTGTACGA AGGCAATCTG
CTGCCCGAAG TGTTCCGAAA TCAGATGATT CACTGCGATG CCGGTCCGAA TGTGGTGCGG
TCGTATCCGG TTCAGAAAGA TGGTGCGGGC TATAAAGCCG AGATCGTGAA TGTACTGGAA
GGTGCCCGCG ACCAGTGGTT CCGACCCGCT GACGTTTGTG TGGCTCCCGA TGGCTCGCTC
ATCATTGCCG ACTGGTACGA TCCCGGTGTG GGCGGACACC AGGCGGGCGA CCAGAGCCGG
GGGCGCGTGT ATCGCGTAGC TCCACCGAAC TCACCCTATA AAATGCCGAA AGTAGACGTA
ACGACGGTCG ATGGAGCCAT CGAAGCACTG CAGAGCCCGA ACATGAATAT TCGGTATGCG
GGTTGGCAGT CGCTGCGGAA CATGGATAAA AAGGCCGAGA AAGCACTGGC TAAACTGTAT
AAAACATCGG CCAACCCACG CATGCAGGCG CGGGCATTAT GGTTGCTGAG TAAGCTGGAC
AAAGGACAGA AATACATCGA AACGGCCCTG AAAAGCGATA ATTCCGATCT GCGCATCACC
GCGCTTCGGG CCGCTCGTGA GCTGAAAGGT GACATTACGC CCTACATCAA ACAGTTGGTA
AATGACCCTG AGCCACAGGT TCGCCGGGAG TGTGCTATTG CCTTGCGGAA GAACCAGAGC
ACCGGACCGC AGTCGCCCGA AGCTCCGGCT TTGTGGGCGC AACTGGCCAG TCAGTACGAT
GGTAAAGACC GCTGGTATCT GGAAGCCCTC GGTATTGGTG CTGATGGTAG CTGGGATAGC
TACTACACCG CCTGGGTTAA ACAGATGAAT GGGGACCCGC TGGCCAACGC GGGTGGTCGT
GATATTGTGT GGCGTGCCCG TACCAAAGAG TCGATTCCGA TGCTGGCCAA ACTGGCTGGT
GACCCATCGG TGGGTGTCAG CCAGCGGTTG CGGTATTTCC GGGCGTTCGA TTTCAACCCC
GGCGCTATGG AGAAATCCAA CGCGCTGCTT GGCATCTTAC AAGCCAATAG CAACTCGACT
GATGTAACGA AGCTAGCCCT GCGCCACCTC GACCCGGCTT TTGTGAAAAA CTCCCCGGTG
GCGACAACAG CCCTGAACAA GGTAATGAAC GACGTGTACG GAACTCCCGA ATACATCGAT
CTGGTAAGTC GCTACGAACC TGCATCCGAA AACGCCCGTC TGAAGCAGTT AGCTGTTCAG
AAAGCGAGTG ATGGGATGGG CCGTGATGCT GCCCGGCAGC TGCTCAAGCA AAAAGGCGCT
TCGATGGCCT GGGAGGTGAT TAATGGCAAT GATGCCGATG CCGCTGCGGA CATGCTGGTG
GCTTTGCGCC GGGTGGGAAA TAAAGAATCC ATCGATATAC TGAAAACTGT TGCCCTGGCC
GACAAATATC CGGCAGCGTT GCGTCGGGAG GCAACCCGTT CGCTGGGAGG TAGTTCCGAA
GGAGCCGATA TGGTTGTGGC CCTGCTTAAG TCGGGCGATA TTAAAGGCGA GTTCAAAAAG
TCAGCCGTAC AGGGCGTCAG CAACGACTGG CGGAAAAGCA TCCGGCAGCA GGCTGCCAGC
TTCCTGGATG GTGGACAGAG TGCCGAAGGC AAAAAGCTGC CCAACATTCA GGAGTTACTG
GCCATGAATG GCGACGCAGC CCGTGGCGTA TCGGTGTTCA AAAACAACTG TAACATCTGC
CATCAGGTGA ATGGCGAAGG CATGGACTTC GGACCAAAAC TGTCGGAGAT TGGCTCCAAA
CTCCCGAAAG AAGGGCAGTA TCTGGCCATC CTGCACCCCG ACGCTGGTAT TAGCTTCGGC
TACGAAGGCT GGGAAGTGAA GTTCAAAGAT GGTAGCTCTA TGACCGGTAT CGTATCGAGC
AAAACCGAAA CTGATTTGCA AATGAAGTTT CCGGGGGGCG TAGTGCAGAA TTACAAAATG
GCTGATGTCG TTAAGATGAA GCAGATTGAA AACTCCATGA TGCCGTCCGG CTTACAGGAG
GCCATGAGCA CCAAAGATTT AGTGGATTTA GTAGAGTATT TAGCCAGTTT AAAGAAAAAG
TAA
 
Protein sequence
MNRSISFKRS AFSRRLIALS AAGVVVGGVF IGAYQNKNLN GASNPYLTSL FAQLSDDDKH 
DPKYAVGSLN VAPGLEATLF AAEPMLTNPT DIDVDARGRV WVCEAYNYRP AINGNPTRKE
GDRIVILEDT NGDGKADVSK VFYQDPSIES PLGIWVQGNK VIVSDSPNVW VLTDENGDDK
ADKKELLFTG IGGEQHDHGM HTFVFGPDGK WYFNFGNAGE QLLDKDGKPV IDIATGKPIN
KQNFKQGMVF RCDPDGKNVE LLGQNFRNNY EVAVDSYGTL WQSDNDDDGN KGVRINYVME
YGNYGYTDEL TGAGWQANRE NIEPEIPRRH WHLNDPGAMP NLLQTGAGSP TGMIVYEGNL
LPEVFRNQMI HCDAGPNVVR SYPVQKDGAG YKAEIVNVLE GARDQWFRPA DVCVAPDGSL
IIADWYDPGV GGHQAGDQSR GRVYRVAPPN SPYKMPKVDV TTVDGAIEAL QSPNMNIRYA
GWQSLRNMDK KAEKALAKLY KTSANPRMQA RALWLLSKLD KGQKYIETAL KSDNSDLRIT
ALRAARELKG DITPYIKQLV NDPEPQVRRE CAIALRKNQS TGPQSPEAPA LWAQLASQYD
GKDRWYLEAL GIGADGSWDS YYTAWVKQMN GDPLANAGGR DIVWRARTKE SIPMLAKLAG
DPSVGVSQRL RYFRAFDFNP GAMEKSNALL GILQANSNST DVTKLALRHL DPAFVKNSPV
ATTALNKVMN DVYGTPEYID LVSRYEPASE NARLKQLAVQ KASDGMGRDA ARQLLKQKGA
SMAWEVINGN DADAAADMLV ALRRVGNKES IDILKTVALA DKYPAALRRE ATRSLGGSSE
GADMVVALLK SGDIKGEFKK SAVQGVSNDW RKSIRQQAAS FLDGGQSAEG KKLPNIQELL
AMNGDAARGV SVFKNNCNIC HQVNGEGMDF GPKLSEIGSK LPKEGQYLAI LHPDAGISFG
YEGWEVKFKD GSSMTGIVSS KTETDLQMKF PGGVVQNYKM ADVVKMKQIE NSMMPSGLQE
AMSTKDLVDL VEYLASLKKK