Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_5811 |
Symbol | |
ID | 8729586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | + |
Start bp | 7053583 |
End bp | 7056705 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | |
Product | membrane-bound dehydrogenase domain protein |
Protein accession | YP_003390575 |
Protein GI | 284040645 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCGAT CAATATCCTT TAAACGCTCG GCCTTCTCTC GTCGGCTGAT TGCTCTGTCG GCCGCCGGGG TGGTAGTTGG TGGCGTATTT ATCGGGGCTT ATCAGAACAA AAACCTGAAC GGAGCCTCCA ACCCCTATCT GACAAGCCTC TTCGCTCAGC TATCTGACGA TGACAAACAC GACCCCAAAT ACGCGGTCGG TAGTCTGAAC GTAGCGCCCG GTCTGGAGGC AACGCTCTTT GCCGCTGAGC CCATGCTGAC CAACCCCACC GACATCGACG TCGATGCTCG GGGGCGGGTC TGGGTCTGCG AAGCGTACAA CTACCGGCCC GCTATCAACG GCAACCCAAC CCGAAAGGAG GGCGACCGCA TCGTTATTCT GGAAGACACC AATGGCGACG GAAAAGCCGA TGTCTCGAAA GTATTCTACC AGGACCCGAG CATCGAATCT CCCCTCGGTA TCTGGGTGCA GGGCAACAAA GTCATCGTGT CTGACAGCCC GAACGTCTGG GTGCTGACCG ATGAAAACGG CGACGACAAA GCCGATAAGA AAGAACTGCT CTTCACCGGC ATCGGCGGGG AGCAGCACGA CCACGGCATG CATACCTTCG TGTTCGGTCC CGATGGTAAA TGGTATTTCA ATTTCGGCAA CGCAGGCGAG CAACTGCTGG ATAAAGACGG CAAACCCGTT ATCGACATTG CAACCGGCAA GCCCATCAAC AAGCAGAATT TCAAACAGGG CATGGTATTC CGTTGCGACC CCGACGGGAA AAATGTCGAG CTGCTGGGCC AGAATTTCCG GAACAACTAC GAAGTGGCCG TTGACTCCTA CGGCACCCTT TGGCAGTCGG ACAACGACGA TGATGGCAAC AAAGGCGTTC GCATTAACTA CGTCATGGAG TACGGCAACT ACGGCTACAC CGACGAACTG ACTGGCGCGG GCTGGCAGGC GAACCGGGAA AACATAGAGC CCGAAATTCC CCGGCGTCAC TGGCACCTCA ACGACCCCGG TGCTATGCCT AACCTGCTTC AAACGGGCGC CGGTTCCCCA ACGGGTATGA TCGTGTACGA AGGCAATCTG CTGCCCGAAG TGTTCCGAAA TCAGATGATT CACTGCGATG CCGGTCCGAA TGTGGTGCGG TCGTATCCGG TTCAGAAAGA TGGTGCGGGC TATAAAGCCG AGATCGTGAA TGTACTGGAA GGTGCCCGCG ACCAGTGGTT CCGACCCGCT GACGTTTGTG TGGCTCCCGA TGGCTCGCTC ATCATTGCCG ACTGGTACGA TCCCGGTGTG GGCGGACACC AGGCGGGCGA CCAGAGCCGG GGGCGCGTGT ATCGCGTAGC TCCACCGAAC TCACCCTATA AAATGCCGAA AGTAGACGTA ACGACGGTCG ATGGAGCCAT CGAAGCACTG CAGAGCCCGA ACATGAATAT TCGGTATGCG GGTTGGCAGT CGCTGCGGAA CATGGATAAA AAGGCCGAGA AAGCACTGGC TAAACTGTAT AAAACATCGG CCAACCCACG CATGCAGGCG CGGGCATTAT GGTTGCTGAG TAAGCTGGAC AAAGGACAGA AATACATCGA AACGGCCCTG AAAAGCGATA ATTCCGATCT GCGCATCACC GCGCTTCGGG CCGCTCGTGA GCTGAAAGGT GACATTACGC CCTACATCAA ACAGTTGGTA AATGACCCTG AGCCACAGGT TCGCCGGGAG TGTGCTATTG CCTTGCGGAA GAACCAGAGC ACCGGACCGC AGTCGCCCGA AGCTCCGGCT TTGTGGGCGC AACTGGCCAG TCAGTACGAT GGTAAAGACC GCTGGTATCT GGAAGCCCTC GGTATTGGTG CTGATGGTAG CTGGGATAGC TACTACACCG CCTGGGTTAA ACAGATGAAT GGGGACCCGC TGGCCAACGC GGGTGGTCGT GATATTGTGT GGCGTGCCCG TACCAAAGAG TCGATTCCGA TGCTGGCCAA ACTGGCTGGT GACCCATCGG TGGGTGTCAG CCAGCGGTTG CGGTATTTCC GGGCGTTCGA TTTCAACCCC GGCGCTATGG AGAAATCCAA CGCGCTGCTT GGCATCTTAC AAGCCAATAG CAACTCGACT GATGTAACGA AGCTAGCCCT GCGCCACCTC GACCCGGCTT TTGTGAAAAA CTCCCCGGTG GCGACAACAG CCCTGAACAA GGTAATGAAC GACGTGTACG GAACTCCCGA ATACATCGAT CTGGTAAGTC GCTACGAACC TGCATCCGAA AACGCCCGTC TGAAGCAGTT AGCTGTTCAG AAAGCGAGTG ATGGGATGGG CCGTGATGCT GCCCGGCAGC TGCTCAAGCA AAAAGGCGCT TCGATGGCCT GGGAGGTGAT TAATGGCAAT GATGCCGATG CCGCTGCGGA CATGCTGGTG GCTTTGCGCC GGGTGGGAAA TAAAGAATCC ATCGATATAC TGAAAACTGT TGCCCTGGCC GACAAATATC CGGCAGCGTT GCGTCGGGAG GCAACCCGTT CGCTGGGAGG TAGTTCCGAA GGAGCCGATA TGGTTGTGGC CCTGCTTAAG TCGGGCGATA TTAAAGGCGA GTTCAAAAAG TCAGCCGTAC AGGGCGTCAG CAACGACTGG CGGAAAAGCA TCCGGCAGCA GGCTGCCAGC TTCCTGGATG GTGGACAGAG TGCCGAAGGC AAAAAGCTGC CCAACATTCA GGAGTTACTG GCCATGAATG GCGACGCAGC CCGTGGCGTA TCGGTGTTCA AAAACAACTG TAACATCTGC CATCAGGTGA ATGGCGAAGG CATGGACTTC GGACCAAAAC TGTCGGAGAT TGGCTCCAAA CTCCCGAAAG AAGGGCAGTA TCTGGCCATC CTGCACCCCG ACGCTGGTAT TAGCTTCGGC TACGAAGGCT GGGAAGTGAA GTTCAAAGAT GGTAGCTCTA TGACCGGTAT CGTATCGAGC AAAACCGAAA CTGATTTGCA AATGAAGTTT CCGGGGGGCG TAGTGCAGAA TTACAAAATG GCTGATGTCG TTAAGATGAA GCAGATTGAA AACTCCATGA TGCCGTCCGG CTTACAGGAG GCCATGAGCA CCAAAGATTT AGTGGATTTA GTAGAGTATT TAGCCAGTTT AAAGAAAAAG TAA
|
Protein sequence | MNRSISFKRS AFSRRLIALS AAGVVVGGVF IGAYQNKNLN GASNPYLTSL FAQLSDDDKH DPKYAVGSLN VAPGLEATLF AAEPMLTNPT DIDVDARGRV WVCEAYNYRP AINGNPTRKE GDRIVILEDT NGDGKADVSK VFYQDPSIES PLGIWVQGNK VIVSDSPNVW VLTDENGDDK ADKKELLFTG IGGEQHDHGM HTFVFGPDGK WYFNFGNAGE QLLDKDGKPV IDIATGKPIN KQNFKQGMVF RCDPDGKNVE LLGQNFRNNY EVAVDSYGTL WQSDNDDDGN KGVRINYVME YGNYGYTDEL TGAGWQANRE NIEPEIPRRH WHLNDPGAMP NLLQTGAGSP TGMIVYEGNL LPEVFRNQMI HCDAGPNVVR SYPVQKDGAG YKAEIVNVLE GARDQWFRPA DVCVAPDGSL IIADWYDPGV GGHQAGDQSR GRVYRVAPPN SPYKMPKVDV TTVDGAIEAL QSPNMNIRYA GWQSLRNMDK KAEKALAKLY KTSANPRMQA RALWLLSKLD KGQKYIETAL KSDNSDLRIT ALRAARELKG DITPYIKQLV NDPEPQVRRE CAIALRKNQS TGPQSPEAPA LWAQLASQYD GKDRWYLEAL GIGADGSWDS YYTAWVKQMN GDPLANAGGR DIVWRARTKE SIPMLAKLAG DPSVGVSQRL RYFRAFDFNP GAMEKSNALL GILQANSNST DVTKLALRHL DPAFVKNSPV ATTALNKVMN DVYGTPEYID LVSRYEPASE NARLKQLAVQ KASDGMGRDA ARQLLKQKGA SMAWEVINGN DADAAADMLV ALRRVGNKES IDILKTVALA DKYPAALRRE ATRSLGGSSE GADMVVALLK SGDIKGEFKK SAVQGVSNDW RKSIRQQAAS FLDGGQSAEG KKLPNIQELL AMNGDAARGV SVFKNNCNIC HQVNGEGMDF GPKLSEIGSK LPKEGQYLAI LHPDAGISFG YEGWEVKFKD GSSMTGIVSS KTETDLQMKF PGGVVQNYKM ADVVKMKQIE NSMMPSGLQE AMSTKDLVDL VEYLASLKKK
|
| |