Gene Slin_0821 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_0821 
Symbol 
ID8724552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp995878 
End bp999066 
Gene Length3189 bp 
Protein Length1062 aa 
Translation table11 
GC content54% 
IMG OID 
Producttransporter, hydrophobe/amphiphile efflux-1 (HAE1) family 
Protein accessionYP_003385682 
Protein GI284035752 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.265502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.847415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAAT TGTTCATCGA ACGCCCGGTA CTGGCCACGG TCATTTCGAC CCTGCTGGTT 
ATTCTGGGCG TCATCTCCCT GCTGTCGCTC CCTGTCACGC AGTTTCCCGA AATCGCTCCC
CCCAGCGTGC AAGTGGCCGC ATCGTACCCT GGTGCCAACG CCGACGTGGT GGCCCGTTCC
GTCGCTACGC CCCTCGAAGA GGCCATCAAC GGGGTCGAGA ATATGACGTA CATGACCTCC
TCGTCGGGTA ATGACGGGTC GGTGGCGATC AACATTTATT TCAAGCTGGG CACCAACCCC
GATCTGGCGG CTGTGAACGT ACAGAACCGG GTGGCCAAAG CCACGAGCCT GCTGCCGGCC
GAAGTAATTC AGGCCGGCAT TTCGACGCAG AAGCAGCAGA ACAGCATGAT CATGATCCTC
AACCTGAACA GCAACGAAGA GGCTTACGAC GAAACGTTTT TGCAGAACTA CGCCAAAATA
AACCTGATTC CGGAGTTGCA GCGGGTCAAC GGCGTAGGAC AGGTGATGGT CTTTGGCGTG
AAGGATTATT CGATGCGGGT CTGGCTCAAA CCGGACCGGT TGGTCGCGCT TGGTTTATCG
CCCCAGGAAG TACTAAGTGC CATTCGGGAG CAGAATCTGG AAGCGGCACC GGGTAAAATT
GGGGAGAACA GCCGGGAAGC CTTCGAGTAT GTGATTAAGT ACAAGGGAAA ACTCAACCAG
CCGGAGCAGT ACGAAAACAT CATCCTGAAA GCGAATACCG ATGGCTCGCT CATTCAACTC
AAGGATGTCG CGCGGATTGA ATTTGGCTCC TTCACCTACA GTGGCGACAC CCGCGTGAAT
GGCAAGCCCA GCGTCGGCAT TGCGATTAAC CAGATGGCAG GCTCGAACGC CAACGACATT
CAGGTAGCGA TTCTGTCCAT TATGGACAAA GCCGCCGGAG CCTTTCCCAA AGGCATAAAC
TATACCATCG GCTACAGCAC CAAAACGTTC CTTGATGAAT CCATCGATCA GGTAACGCAC
ACGCTGATCG AAGCCTTTAT CCTGGTATTT ATTGTCGTAT TTCTGTTTCT ACAGGACTTT
CGCTCCACGC TTATTCCGGC CATTGCGGTG CCCGTTGCCA TTGTCGGTAC GTTCTTTTTC
ATGCAGTTGT TTGGCTTTAC GATCAACTTG CTGACGCTTT TTGCGCTGGT ACTGGCCATC
GGTATCGTGG TCGATGATGC GATTGTGGTG GTCGAAGCCG TTCACGCCAA GATGGAAAAA
AGCCGACAGT CGGCGCGGTC GGCAACGATC CAATCCATGC AGGAAATTTC AGGAGCTATC
ATTTCCATTA CACTGGTGAT GGCGGCTGTA TTCGTGCCGG TCGGGTTCAT GAATGGGCCG
GCGGGGGTTT TCTATCAGCA GTTCGCCTTC ACGCTGGCTA TCGCCATTCT GATTTCAGCG
GTAAACGCGC TCACGCTGAG TCCGGCGCTC TGTGCACTAC TTTTAAAAAA CCCGCATGGC
GACGACGACC ACGTTTACGC AAAAAAAGGC TTCCTGACTC GCTTCTTCGA CGCCTTCAAC
GCGGGCTTTA CCTCCCTGAC CAGCAAGTAT GTCAGGAGCC TTCGGTTTCT AATCCGCAAC
AAATGGGTCG GTCTGAGCGG ACTGACGTTG GTAACGGCCG TCACGGTTTT TCTGATGCGG
ACCACACCAA CGGGTTTTAT TCCCTCGGAG GATCAGGGGT TTATCGCCTA CTCGCTGAAA
CTTCCGGCGG GATCATCGTT GCAGCGAACT CAGAAAGTAG CCGACAAAAT TGAAGGCATT
CTGCACAAAA CGCCCGCCGT CGAGCAGCAT ATCGAAATCA GTGGGTTCAA CATGATTGCC
AACTCCGCCA GCCCATCCTA TGCCGCCGGG TTCGTTAAAA TGAAGCCCTA CGAAGACCGG
GGAGCCGTCA AGGACCTTCA GCAGGTGGTC GATTCGGTGA GCAAGCAGGT GGCGGGGGTC
GAAGAGGGGC GGGTCGATGT ATTTACCATG CCGACGGTTC CGGGATTCAG CAACGTCGAT
GGCTTTGAGT TGTTGCTACA GGATCGGACC GGCGGCAAAC TCGATAAGCT CAGTGCCACG
GCCAACGCCT TTATCGAGGA ACTACAAAAG CGTCCCGAAA TCGCGGCTGC CTTCACAACG
TTCGATACGG GCACGCCCCA GTTTGAGCTG GAACTGGACG TAAAGAAAGC AAAACAACTC
GGCGTTTCGA CCAGCGATAT TCTGCAAACG ATGCAGGTGT ATTACGGCAG CACGTTTGCC
TCGGACTTCA ACCGGTTCGG TAAATTCTAC CGCGTCATTG CGCAGGCGGA TGCCGCGTAT
CGGGCTGACC CATCGTCGCT GAACAGTATT TACGTGAAGA ACGCCACCGG ACAAATGGTG
CCGATGACGA CATTCGTTAC CTTGAAGCGC GTCTATGGAC CGGAAGCCAT CACCCGGAAT
AATCTGTTTA CCTCGGTCGC CATCAACGGA CAGGCCAAGC CGGGGTACAG CACGGGGGAT
GCCATCCGGG CGGTGGAAGA AGTGGCAAAG CAAAGCCTGC CCGTGGGCTA CACCTACGAA
TGGACGGGCA TGACGCGCGA AGAAATCGCG GCTGGTAGTC AGTCGAGTCT TATTTTTGGG
CTCAGTCTGG TGTTTGTTTA TTTCCTGCTG GCGGCTCAGT ACGAAAGTTA CGTACTGCCG
TGGGCGGTGT TATTGTCCAT TCCAACCGGG ATTCTGGGCG TTTTCGGGTT TATCAATCTG
GCAGGCATCG ACAACAATAT TTACGTCCAG GTGGGTTTGA TCATGCTGAT CGGGTTGCTG
GCCAAAAATG CCATTCTGAT TGTCGAATTT GCTATCCAGC GACGGCAGGC GGGTATGGGT
TTAGTGGCGT CGGCACTAGA CGCGGCCAAA CTGCGGCTTC GCCCGATTCT GATGACCTCG
TTTGCGTTCA TTGTTGGTCT GGTACCGTTG ATGAGTGCCA CGGGAGCATC GGCCAAAGGA
AACCATTCGA TCAGTATCGG GACAGCGGGC GGTATGCTAA CCGGCGTACT GCTGGGTCTG
TTTATCATTC CCGTGCTGTT CGTCATTTTT CAGGGAATTC AGGAAAAAAT CATTCGGCCC
AAAACCGCCG AAGAACGGAA AGCGCTGGCC GAAGAGGCCT TTGCCAACAA TCCGCTAACC
CGTAATTAA
 
Protein sequence
MFKLFIERPV LATVISTLLV ILGVISLLSL PVTQFPEIAP PSVQVAASYP GANADVVARS 
VATPLEEAIN GVENMTYMTS SSGNDGSVAI NIYFKLGTNP DLAAVNVQNR VAKATSLLPA
EVIQAGISTQ KQQNSMIMIL NLNSNEEAYD ETFLQNYAKI NLIPELQRVN GVGQVMVFGV
KDYSMRVWLK PDRLVALGLS PQEVLSAIRE QNLEAAPGKI GENSREAFEY VIKYKGKLNQ
PEQYENIILK ANTDGSLIQL KDVARIEFGS FTYSGDTRVN GKPSVGIAIN QMAGSNANDI
QVAILSIMDK AAGAFPKGIN YTIGYSTKTF LDESIDQVTH TLIEAFILVF IVVFLFLQDF
RSTLIPAIAV PVAIVGTFFF MQLFGFTINL LTLFALVLAI GIVVDDAIVV VEAVHAKMEK
SRQSARSATI QSMQEISGAI ISITLVMAAV FVPVGFMNGP AGVFYQQFAF TLAIAILISA
VNALTLSPAL CALLLKNPHG DDDHVYAKKG FLTRFFDAFN AGFTSLTSKY VRSLRFLIRN
KWVGLSGLTL VTAVTVFLMR TTPTGFIPSE DQGFIAYSLK LPAGSSLQRT QKVADKIEGI
LHKTPAVEQH IEISGFNMIA NSASPSYAAG FVKMKPYEDR GAVKDLQQVV DSVSKQVAGV
EEGRVDVFTM PTVPGFSNVD GFELLLQDRT GGKLDKLSAT ANAFIEELQK RPEIAAAFTT
FDTGTPQFEL ELDVKKAKQL GVSTSDILQT MQVYYGSTFA SDFNRFGKFY RVIAQADAAY
RADPSSLNSI YVKNATGQMV PMTTFVTLKR VYGPEAITRN NLFTSVAING QAKPGYSTGD
AIRAVEEVAK QSLPVGYTYE WTGMTREEIA AGSQSSLIFG LSLVFVYFLL AAQYESYVLP
WAVLLSIPTG ILGVFGFINL AGIDNNIYVQ VGLIMLIGLL AKNAILIVEF AIQRRQAGMG
LVASALDAAK LRLRPILMTS FAFIVGLVPL MSATGASAKG NHSISIGTAG GMLTGVLLGL
FIIPVLFVIF QGIQEKIIRP KTAEERKALA EEAFANNPLT RN