Gene Slin_4444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_4444 
Symbol 
ID8728204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp5385376 
End bp5388510 
Gene Length3135 bp 
Protein Length1044 aa 
Translation table11 
GC content52% 
IMG OID 
ProductCation/multidrug efflux pump-like protein 
Protein accessionYP_003389224 
Protein GI284039294 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00034162 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCATA GCTACCAGTT CATCCTGCTG TTTACCATCT TGTCGGTCAT GGGAGTGACG 
TTGTTGCCCC GGCTGTCGGT GCAATTAGCG CCTTCGCCGG GAGGGCGTTC TATTACAGTG
AGTTATCCCT GGCGGGGAGC CGCACCAGAG GCCCTTGAAC GGCAAGTATC GTCTCGCCTG
GAAGGCGCCT TTAGTACACT GGCCAACATT AAAAAAGTGA GGTCGGTGTC GGCTTACAAC
CGAGGGTATG TTACCATTGA ATTGGATCGT TCGGCCGATG CCGACGCCCT GCGCTTCGAA
CTGGCCGCGC TGGTTAGGCA GGTGTACCCC AAACTCCCGC CTGATGTTTC TTATCCTCAA
ATTAGCCTGA ATGCTCCCGA TGAGCAGAGC CAAACCAAAC CGCTGCTTAC CCTTCAACTG
AGCGGCCCCA GTTCTACTGC CGAGTTGCAA CGCTATGCCA ACGAACAACT AAAGCCCCGG
CTGGCCGCAA CAGAGGGCGT TGGTAGCGTG GCTGTGTTTG GGGGCACCCA ACCCGAATGG
GTCCTTACCT ACAACGCCGA TGCACTGGCT ACCCTGCAAC TAACGGAGAA CGACCTGCGG
GTGGCCGTAC AGCATTATTT TCAGCGTGAA CCTCTGGGCA AGGTTGTTTC GGCAACCGGG
CAGACGTTGC GCATTCGGCT GGATAATACA GTACAAAATG GCACCAACCA CTGGTCGCAG
ATTCCGGTAG CCAATCGGGC CGGGCGGATC ATTTACCTGA CCGATCTGGT GAGCGTAAGT
CGGCAGGAAC CACCGTCCGA CCAGTTTTAT CGCATCAATG GCAAAACAGC CGTAAACATG
GTGCTAACAG CTGCAGCAGG GGCCAATCAG CTAACGGTTG CGCAGACACT AAACCGGCAG
GTAGCGGAAC TGAACTTACC GCCCGGCTAC CGCCTGGATG TCGATTATGA TGCCACGGTG
TACATTCGGG AAAACCTGCG TAAAATTGGC ATTCAGACCA GTGTAGCCAT TGTTATTTTA
TTGCTGTTTG TTGCCCTGAC AACACGGAAC TGGTACTATG TGCTCTTGAT CGTGACCAGC
ACCGTTGTGT CGTTACTGCT GTCTGTCCTG GTTTTTGTTT GGCTACGGGT CGAGATTCAC
CTCTATTCCC TGGCTGCGCT GACGACATCG TTAGGTATTG TGATGGACAA TGTCATCGTG
ATGATTGACC ATTACCGCCG ATACCGTGAC CTGCGCGTTT TTACGGCCCT GCTAGGTGCC
ACCCTCACCA CCTGCGCCGG CCTGGTGGTT GTCTGGTTTC TGCCCGAAGA AAACCGCCAG
GCCATGAGCG ATTTTTCGGT CGTGATGGCC GTTACCCTGT TTATTTCCCT GCTGGTCGCT
ATGGCGTTTA CGCCTGCTAT GATGGAGCAG TTCTGGCCAA AAGCACAGGA AGGGCAGGAG
AGAATTATCC AAAAGCAGAC AAAAACTGCC AGACAGAAGC TTTGGGGAGA GCAATTTTAT
GGACGTGTCA TCCGTCTGCT GCTGCGGTAT CGGCGGTGGG CGCTTGCGGG GGCTGTTCTC
CTGTTCGGTC TGCCTGTGTT CTGGTTACCA ATTACCTTAG ACTCAAAGAA CCCGCTGGCT
CCTTATTACA AAGCCACGAT AGGCAGCGAC CTGTATGCCG ACAACCTGCA ACCGTATGTC
AACAAGTGGT TAGGGGGCAC GCTTCGGCTG TTTGTCAATT ACGTATATGA AGGATCATAC
CAGCGTGAGC CCGAACGTAC AGCCCTATAC GTTATTGCTG AGTTACCTAA CCAAAGTACG
CCCGAACAAA TGGATGCTAT CTTCCGGCGG TTTGAGTCTA CCTTAGGCCA ATATGGAGAA
ATTGACAAAT TTATTACCCA GATTAACAAT GGGCAGGAGG GCAATATGGT TGTTTACTTC
AAACAGTCGC ACGATACGGA AATTTTCCCC TATCAATTGA AGAACCGGGG TATCCTGCTT
TCAACCGAAA TGAGTGGTAT CGACTGGAAT ATTTATGGTG TGGGACAGGG ATTTAGCCAG
AGTCTGAACG AAGACGAATC GTCCACCTTC AATGTCGAGC TGTTTGGCTA TAATTACCGG
CAGTTGGAGC AACAGGCCAC GCTACTGAAA CAAATGCTGG AGAGCCACCC CCGCATTCAG
GAGGTCAATA TCAACCGCAG CCCTAACCTG TTTCAGCGAA AGCGACTCTA TGAATTTGTC
CTACAAACCG ACCCTCAGTT GCTGGCACTG CGGGGCATCG GCGCTTCCCA ACTCTATGAA
CGTCTGGCCG ACCTCAACGC CCGCCCCCAA CCCGACCAGT ATGCATTCAT CAACGGAGAC
TACGAACCTG TTAAACTGAT TCCTGTTCAA AGCCGAAGCG TAGATGTCTG GCAATTACAG
AACCAACCGC TAACGGTTGG CTCAGGGTCA GCCCATCTGC GCGATATTGG CACTATTACC
CGGCAGAAAG TTACCCCCGA AATTCATAAG GAAGACCAGC AGTACAAACG ACTGGTTAGC
TTTGAATACT TTGGCAGCTA TAATTTCGGT GAGACGTTTC TGACCAAAAC TCTCGACGAA
CTACGGCTAC AAATGCCGCT TGGCTACACG GCCAAAGCGG TAGACCGCTT CTGGTTTGGC
ACTGACCAAC GAACTCCCTA CGAATTAATT GGGCTGGTTG TCCTGATAAT CTACATCATT
TGCGCGATAA TTTTCGAGAG TTTATGGCAA CCCCTGGCCC TCATCGGGCT GATTCCGCTA
TCGTACATCG GTGTGTTTTT AGCCTTTTAC TGGACAGACA GTAATTTTGA TCAGGGCGGC
TACGCATCGT TTATTTTGCT GGCGGGCAAT GTTGTTTGTG CGGGCATCTT TATCGTGGCC
GAAACAAACC GGCTTGGCAA GCGGTACCCC AATCTGTCGT CATTTACAGT TTATCAAAAA
GCAGTCAGGC ATAAAATTGG CCCAGTATTG CTGACCGTCT TATCTACCGT AGTCGGGATG
GTTCCATTTC TGCTGTATGA GCAGGAAGCT TTCTGGTATG CGCTGGGTAT TGGCACCATT
GGCGGCTTAC TTATGTCCCT TGTCGCTGTA GGAATTTATT TACCTGTATT TTTATTGCCC
CAAAATCAGG TCTAA
 
Protein sequence
MRHSYQFILL FTILSVMGVT LLPRLSVQLA PSPGGRSITV SYPWRGAAPE ALERQVSSRL 
EGAFSTLANI KKVRSVSAYN RGYVTIELDR SADADALRFE LAALVRQVYP KLPPDVSYPQ
ISLNAPDEQS QTKPLLTLQL SGPSSTAELQ RYANEQLKPR LAATEGVGSV AVFGGTQPEW
VLTYNADALA TLQLTENDLR VAVQHYFQRE PLGKVVSATG QTLRIRLDNT VQNGTNHWSQ
IPVANRAGRI IYLTDLVSVS RQEPPSDQFY RINGKTAVNM VLTAAAGANQ LTVAQTLNRQ
VAELNLPPGY RLDVDYDATV YIRENLRKIG IQTSVAIVIL LLFVALTTRN WYYVLLIVTS
TVVSLLLSVL VFVWLRVEIH LYSLAALTTS LGIVMDNVIV MIDHYRRYRD LRVFTALLGA
TLTTCAGLVV VWFLPEENRQ AMSDFSVVMA VTLFISLLVA MAFTPAMMEQ FWPKAQEGQE
RIIQKQTKTA RQKLWGEQFY GRVIRLLLRY RRWALAGAVL LFGLPVFWLP ITLDSKNPLA
PYYKATIGSD LYADNLQPYV NKWLGGTLRL FVNYVYEGSY QREPERTALY VIAELPNQST
PEQMDAIFRR FESTLGQYGE IDKFITQINN GQEGNMVVYF KQSHDTEIFP YQLKNRGILL
STEMSGIDWN IYGVGQGFSQ SLNEDESSTF NVELFGYNYR QLEQQATLLK QMLESHPRIQ
EVNINRSPNL FQRKRLYEFV LQTDPQLLAL RGIGASQLYE RLADLNARPQ PDQYAFINGD
YEPVKLIPVQ SRSVDVWQLQ NQPLTVGSGS AHLRDIGTIT RQKVTPEIHK EDQQYKRLVS
FEYFGSYNFG ETFLTKTLDE LRLQMPLGYT AKAVDRFWFG TDQRTPYELI GLVVLIIYII
CAIIFESLWQ PLALIGLIPL SYIGVFLAFY WTDSNFDQGG YASFILLAGN VVCAGIFIVA
ETNRLGKRYP NLSSFTVYQK AVRHKIGPVL LTVLSTVVGM VPFLLYEQEA FWYALGIGTI
GGLLMSLVAV GIYLPVFLLP QNQV