Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Slin_0821 |
Symbol | |
ID | 8724552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Spirosoma linguale DSM 74 |
Kingdom | Bacteria |
Replicon accession | NC_013730 |
Strand | - |
Start bp | 995878 |
End bp | 999066 |
Gene Length | 3189 bp |
Protein Length | 1062 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | |
Product | transporter, hydrophobe/amphiphile efflux-1 (HAE1) family |
Protein accession | YP_003385682 |
Protein GI | 284035752 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.265502 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.847415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAAT TGTTCATCGA ACGCCCGGTA CTGGCCACGG TCATTTCGAC CCTGCTGGTT ATTCTGGGCG TCATCTCCCT GCTGTCGCTC CCTGTCACGC AGTTTCCCGA AATCGCTCCC CCCAGCGTGC AAGTGGCCGC ATCGTACCCT GGTGCCAACG CCGACGTGGT GGCCCGTTCC GTCGCTACGC CCCTCGAAGA GGCCATCAAC GGGGTCGAGA ATATGACGTA CATGACCTCC TCGTCGGGTA ATGACGGGTC GGTGGCGATC AACATTTATT TCAAGCTGGG CACCAACCCC GATCTGGCGG CTGTGAACGT ACAGAACCGG GTGGCCAAAG CCACGAGCCT GCTGCCGGCC GAAGTAATTC AGGCCGGCAT TTCGACGCAG AAGCAGCAGA ACAGCATGAT CATGATCCTC AACCTGAACA GCAACGAAGA GGCTTACGAC GAAACGTTTT TGCAGAACTA CGCCAAAATA AACCTGATTC CGGAGTTGCA GCGGGTCAAC GGCGTAGGAC AGGTGATGGT CTTTGGCGTG AAGGATTATT CGATGCGGGT CTGGCTCAAA CCGGACCGGT TGGTCGCGCT TGGTTTATCG CCCCAGGAAG TACTAAGTGC CATTCGGGAG CAGAATCTGG AAGCGGCACC GGGTAAAATT GGGGAGAACA GCCGGGAAGC CTTCGAGTAT GTGATTAAGT ACAAGGGAAA ACTCAACCAG CCGGAGCAGT ACGAAAACAT CATCCTGAAA GCGAATACCG ATGGCTCGCT CATTCAACTC AAGGATGTCG CGCGGATTGA ATTTGGCTCC TTCACCTACA GTGGCGACAC CCGCGTGAAT GGCAAGCCCA GCGTCGGCAT TGCGATTAAC CAGATGGCAG GCTCGAACGC CAACGACATT CAGGTAGCGA TTCTGTCCAT TATGGACAAA GCCGCCGGAG CCTTTCCCAA AGGCATAAAC TATACCATCG GCTACAGCAC CAAAACGTTC CTTGATGAAT CCATCGATCA GGTAACGCAC ACGCTGATCG AAGCCTTTAT CCTGGTATTT ATTGTCGTAT TTCTGTTTCT ACAGGACTTT CGCTCCACGC TTATTCCGGC CATTGCGGTG CCCGTTGCCA TTGTCGGTAC GTTCTTTTTC ATGCAGTTGT TTGGCTTTAC GATCAACTTG CTGACGCTTT TTGCGCTGGT ACTGGCCATC GGTATCGTGG TCGATGATGC GATTGTGGTG GTCGAAGCCG TTCACGCCAA GATGGAAAAA AGCCGACAGT CGGCGCGGTC GGCAACGATC CAATCCATGC AGGAAATTTC AGGAGCTATC ATTTCCATTA CACTGGTGAT GGCGGCTGTA TTCGTGCCGG TCGGGTTCAT GAATGGGCCG GCGGGGGTTT TCTATCAGCA GTTCGCCTTC ACGCTGGCTA TCGCCATTCT GATTTCAGCG GTAAACGCGC TCACGCTGAG TCCGGCGCTC TGTGCACTAC TTTTAAAAAA CCCGCATGGC GACGACGACC ACGTTTACGC AAAAAAAGGC TTCCTGACTC GCTTCTTCGA CGCCTTCAAC GCGGGCTTTA CCTCCCTGAC CAGCAAGTAT GTCAGGAGCC TTCGGTTTCT AATCCGCAAC AAATGGGTCG GTCTGAGCGG ACTGACGTTG GTAACGGCCG TCACGGTTTT TCTGATGCGG ACCACACCAA CGGGTTTTAT TCCCTCGGAG GATCAGGGGT TTATCGCCTA CTCGCTGAAA CTTCCGGCGG GATCATCGTT GCAGCGAACT CAGAAAGTAG CCGACAAAAT TGAAGGCATT CTGCACAAAA CGCCCGCCGT CGAGCAGCAT ATCGAAATCA GTGGGTTCAA CATGATTGCC AACTCCGCCA GCCCATCCTA TGCCGCCGGG TTCGTTAAAA TGAAGCCCTA CGAAGACCGG GGAGCCGTCA AGGACCTTCA GCAGGTGGTC GATTCGGTGA GCAAGCAGGT GGCGGGGGTC GAAGAGGGGC GGGTCGATGT ATTTACCATG CCGACGGTTC CGGGATTCAG CAACGTCGAT GGCTTTGAGT TGTTGCTACA GGATCGGACC GGCGGCAAAC TCGATAAGCT CAGTGCCACG GCCAACGCCT TTATCGAGGA ACTACAAAAG CGTCCCGAAA TCGCGGCTGC CTTCACAACG TTCGATACGG GCACGCCCCA GTTTGAGCTG GAACTGGACG TAAAGAAAGC AAAACAACTC GGCGTTTCGA CCAGCGATAT TCTGCAAACG ATGCAGGTGT ATTACGGCAG CACGTTTGCC TCGGACTTCA ACCGGTTCGG TAAATTCTAC CGCGTCATTG CGCAGGCGGA TGCCGCGTAT CGGGCTGACC CATCGTCGCT GAACAGTATT TACGTGAAGA ACGCCACCGG ACAAATGGTG CCGATGACGA CATTCGTTAC CTTGAAGCGC GTCTATGGAC CGGAAGCCAT CACCCGGAAT AATCTGTTTA CCTCGGTCGC CATCAACGGA CAGGCCAAGC CGGGGTACAG CACGGGGGAT GCCATCCGGG CGGTGGAAGA AGTGGCAAAG CAAAGCCTGC CCGTGGGCTA CACCTACGAA TGGACGGGCA TGACGCGCGA AGAAATCGCG GCTGGTAGTC AGTCGAGTCT TATTTTTGGG CTCAGTCTGG TGTTTGTTTA TTTCCTGCTG GCGGCTCAGT ACGAAAGTTA CGTACTGCCG TGGGCGGTGT TATTGTCCAT TCCAACCGGG ATTCTGGGCG TTTTCGGGTT TATCAATCTG GCAGGCATCG ACAACAATAT TTACGTCCAG GTGGGTTTGA TCATGCTGAT CGGGTTGCTG GCCAAAAATG CCATTCTGAT TGTCGAATTT GCTATCCAGC GACGGCAGGC GGGTATGGGT TTAGTGGCGT CGGCACTAGA CGCGGCCAAA CTGCGGCTTC GCCCGATTCT GATGACCTCG TTTGCGTTCA TTGTTGGTCT GGTACCGTTG ATGAGTGCCA CGGGAGCATC GGCCAAAGGA AACCATTCGA TCAGTATCGG GACAGCGGGC GGTATGCTAA CCGGCGTACT GCTGGGTCTG TTTATCATTC CCGTGCTGTT CGTCATTTTT CAGGGAATTC AGGAAAAAAT CATTCGGCCC AAAACCGCCG AAGAACGGAA AGCGCTGGCC GAAGAGGCCT TTGCCAACAA TCCGCTAACC CGTAATTAA
|
Protein sequence | MFKLFIERPV LATVISTLLV ILGVISLLSL PVTQFPEIAP PSVQVAASYP GANADVVARS VATPLEEAIN GVENMTYMTS SSGNDGSVAI NIYFKLGTNP DLAAVNVQNR VAKATSLLPA EVIQAGISTQ KQQNSMIMIL NLNSNEEAYD ETFLQNYAKI NLIPELQRVN GVGQVMVFGV KDYSMRVWLK PDRLVALGLS PQEVLSAIRE QNLEAAPGKI GENSREAFEY VIKYKGKLNQ PEQYENIILK ANTDGSLIQL KDVARIEFGS FTYSGDTRVN GKPSVGIAIN QMAGSNANDI QVAILSIMDK AAGAFPKGIN YTIGYSTKTF LDESIDQVTH TLIEAFILVF IVVFLFLQDF RSTLIPAIAV PVAIVGTFFF MQLFGFTINL LTLFALVLAI GIVVDDAIVV VEAVHAKMEK SRQSARSATI QSMQEISGAI ISITLVMAAV FVPVGFMNGP AGVFYQQFAF TLAIAILISA VNALTLSPAL CALLLKNPHG DDDHVYAKKG FLTRFFDAFN AGFTSLTSKY VRSLRFLIRN KWVGLSGLTL VTAVTVFLMR TTPTGFIPSE DQGFIAYSLK LPAGSSLQRT QKVADKIEGI LHKTPAVEQH IEISGFNMIA NSASPSYAAG FVKMKPYEDR GAVKDLQQVV DSVSKQVAGV EEGRVDVFTM PTVPGFSNVD GFELLLQDRT GGKLDKLSAT ANAFIEELQK RPEIAAAFTT FDTGTPQFEL ELDVKKAKQL GVSTSDILQT MQVYYGSTFA SDFNRFGKFY RVIAQADAAY RADPSSLNSI YVKNATGQMV PMTTFVTLKR VYGPEAITRN NLFTSVAING QAKPGYSTGD AIRAVEEVAK QSLPVGYTYE WTGMTREEIA AGSQSSLIFG LSLVFVYFLL AAQYESYVLP WAVLLSIPTG ILGVFGFINL AGIDNNIYVQ VGLIMLIGLL AKNAILIVEF AIQRRQAGMG LVASALDAAK LRLRPILMTS FAFIVGLVPL MSATGASAKG NHSISIGTAG GMLTGVLLGL FIIPVLFVIF QGIQEKIIRP KTAEERKALA EEAFANNPLT RN
|
| |