Gene Slin_5025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_5025 
Symbol 
ID8728790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp6122862 
End bp6126032 
Gene Length3171 bp 
Protein Length1056 aa 
Translation table11 
GC content54% 
IMG OID 
Productputative transcriptional regulator, Crp/Fnr family 
Protein accessionYP_003389801 
Protein GI284039871 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00524189 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCTG GTGCAGATTC TACTTCTCAA CCTATTTCTA CTCCTCAGCG ATGGCATCGT 
GCGTTAGGCA TCCGGCCGGA GGAAAGCCGA ACGGTGGGGT TATTTTTCAT TCATAACTTT
TTGCTTGGCA TCGGCACTAT TCTCATTTAT GTAGCAGCCA ACGCAATTCT GCTTGAAAAT
CAACCTGATA CAAGTCTTCC TATTGCCTAT ATAGCCTCGG CAGCAGCCAT GATCGGCATT
GGCCGAATCT ATGGCTACTT TGAACATCAC CTGGCCTTAC GCAGTCTGGC CGTTCGGGTG
TTGATCGCGG CTGTTGTGAT GACGATTGTG GTGGGTATTC TGGTCGTTGT TGGGCATTCC
ATTATGGCGG CTGTAGCCAT CATGACGGGC TATCGAATTA TTTACCTGCT TACCAACCTC
GAATTCTGGG GCGTTTCGGC CGTTGTATTT GATACGCGTC AGAGTAAGCG TTTATTTGGC
GTGATCAGCT CCGGCGATAT GCCCGCCAAA GCCATAGGCG CTATCCTGGC TGCTCTGGTA
CATGCCCATG CCGATGTATT GCGGTTGTTG CTGGTTGCTT TCGGGGCCTT TCTGGCAGCA
CTTTATATTC TTCAGCTAAC TATTCAGTCG CACGATGTAC ACGCGCCACA GGGGGCCGAC
CGGGTCGCCC GCCGGGAGCC GTCCCGATTA ATTGGCAAGC GGTTCGGCGG GAGTGAACTG
ATTTTCTACA TGTGCCTGAG TATGGCCTTT CTGGCCGCCG TGGCAACCGA GATCGAATAC
AATTTCTTCA TTAACGTGAA GCACCGCTTT CACGACCAAA CGGATGTTAT CCGCTACGTG
AGTTATGTAC TGGCGTTGAC GTACGGAGTG GCCATGCTGG CTAAATTGCT GATGTCGAGG
CAGGTACTCG ACCGATTTGG GATACAGCGC TCGTTGCTGG TATTGCCCTG GCTGGCCTTG
ACGGGGCTAA TTGGCTTAAT CGGACTACGC TATTTTACCA CCAGTGAAAC CGTCTTGCTG
GTTTACTTCT GTGGCTTATA TCTGCTCTTT GAAGTAGCCC GCCGGGCGCT GTTCGATCCC
GTATTTCTCG TTTTATTTCA ACCCCTGTTG CCCCCTCAGC GACTCAAGGG ACACACACTG
GCCAAAGGTT TATATGAACC CATTGGCTTA GGTGTGGCCG GTCTGCTCAT TTATGTTTTG
CATACAACGC TCGAATCGGG TGGCGTACTG GTATGGATTT TACTCCTGGC GGGCGCTATC
TGGCTGCTGC GCCACACGTA TAAACGGTAT CTGCATGAGT TGACGGATGC CATTGGCCGA
CGCTTTCTGG AAAGCGATCA ACTAGCGATG CCAACGGTAG CGCAGGCCAG TCTGGTCAGG
CAGTTGCAAA GTGACCGACC CGGCGACGTG CTAACGGCAA TTAGTTGGCT GGATACGCAT
AATCCGGACG AATTAAGTCG GCAAATCCCT TCGCTGCTTA CGCATACTAA TGCCAGTATC
CGCCGAAGAA CGCTGGCCGC AGCGGCCCGG ATGCAGCAAC CCCTGCCCGT CCGTCAGGTT
ACGCACATTG CCCTGACGGA CGATGACCCG GCCTTACGTC AGCAAGCGGC CTACCTGTTG
GGACGACGCA TAAATGCAGA AGCCGCTGAA CTGGCCCCGC TTATTAATCA CACCGATCTC
GCTATCCGGC AGGGGGCTAT CAAGGGGGTG CTGGAATTAA ACCCACACGA TCCGGCGGCT
CTTGCGGCCC TGACTAAACT GGCCACCGCT GCGGACGTTG TTCTTCAGCA ACTAGCTTTA
TCCCTCATTG CCACTCTCAG ACTGACCAGC TTTGCCGCCC TGGTTGGCGA ACGGCTGGAG
AGCCCGAGTG CCGATGTACG AAAGGCCGCC ATCAATGCGG CAGGCCATTT GCCCAGTACC
AATCTGACCC GGTATTTACT GGATAACCTT ACTCACAAGA CCATCGGGCG AGCTGTAACC
CAGGCACTGA AAGCCCGCGG CAGCGAAACC ATTGCAGCCC TACGTCAGGT GCGGCAGTCG
ACTACAGATC GACTCGTATT GGAACGTATT GCGGCCGTTT GCGGAGCTAT ACCTACGCCG
GAGAGCCGCC ACGTGCTCAA TGGGCTGGCG CAGCAACCCG ATGTATTGGT GCGGAGTGCC
GCCCTCCGGG CCTTGCGCCG GTTCCCGAAC GAACCCGAAG ACGACGCACT TTTTCGCGCG
GTCATGCAGG AAGAGGTGTT ACTGGCCCAG CGGCTCCTGC ACGGGAGTTT GACGGATACA
GCCCTGACGA AGACGCTGGA TTATGAACTG TCGGTACTGC TACAGCGGCT GTTCGATGTG
CTGATGCAGC TTTACGACTC CGAAACCATT GCCGGGGCTC GCATGGGTAT TGGCCATCCT
GCCCGCGAAC GGCGGGCCAA CTCGCTCGAA ATGCTGGATA ACCTGATTCC ACGAGCTACC
TACCAAACGC TTCAAGTACT CGTAGATGAT CTGCCCCGCA CCGAAAAAAA TCGTATTCTT
GATGCGGAAT TAGGGCCTTT CGACGACCGG GAACCCATCC GGGCGTTTGT GTTGCGAATG
GGTGAAACCG TTTTTTCGTC CTGGACGGTC GATGTAGTGC TGCGTACATT ACCACCGGCT
GACGCTTTGC TGGCCCGGCA GTCTTTAGAA TCCCGTTTTG GTACTTTATC CCCTTTGCTT
ATGTCGCATG CCCCTCATTC TACCGAGCAG ATTTCAGCCT ACGACCGGGT ATTGCTATTA
TCCCAGGCCA GTCTTTTTTC CCAAACGCCC GAGAATGTAC TGGCGAGTAT CACGCCAATT
ATGAAAGAAG AGGAACATCA AGCCGGTGTA TCCATTTTCC GTAAAGGCGA TCTGGGCACG
GGCATGTACG TTATCTACAC CGGCGAAGTA GCTATTCTGG ATGGCGATAC CGAACTGGCT
CGTTTTGGGC GGGGCGACTT CTTTGGCGAG CTGGCGCTGC TGGATACCGA AGCCCGCTCG
GCTACGGCAG AGGCCATTAC CGATGTTCGG CTGCTGCGTA TTGACCAGGA TGATTTTTAC
GACCTGATGG AAGAACGGAG CGAGGTTCTT CGTAGTATTG TTAAAAGCTT GTCAGGGCGT
ATTCGGCGGC AGAACGAATT GCTGTCAAAC CGGGCTACAA CACCGCAGTA G
 
Protein sequence
MIAGADSTSQ PISTPQRWHR ALGIRPEESR TVGLFFIHNF LLGIGTILIY VAANAILLEN 
QPDTSLPIAY IASAAAMIGI GRIYGYFEHH LALRSLAVRV LIAAVVMTIV VGILVVVGHS
IMAAVAIMTG YRIIYLLTNL EFWGVSAVVF DTRQSKRLFG VISSGDMPAK AIGAILAALV
HAHADVLRLL LVAFGAFLAA LYILQLTIQS HDVHAPQGAD RVARREPSRL IGKRFGGSEL
IFYMCLSMAF LAAVATEIEY NFFINVKHRF HDQTDVIRYV SYVLALTYGV AMLAKLLMSR
QVLDRFGIQR SLLVLPWLAL TGLIGLIGLR YFTTSETVLL VYFCGLYLLF EVARRALFDP
VFLVLFQPLL PPQRLKGHTL AKGLYEPIGL GVAGLLIYVL HTTLESGGVL VWILLLAGAI
WLLRHTYKRY LHELTDAIGR RFLESDQLAM PTVAQASLVR QLQSDRPGDV LTAISWLDTH
NPDELSRQIP SLLTHTNASI RRRTLAAAAR MQQPLPVRQV THIALTDDDP ALRQQAAYLL
GRRINAEAAE LAPLINHTDL AIRQGAIKGV LELNPHDPAA LAALTKLATA ADVVLQQLAL
SLIATLRLTS FAALVGERLE SPSADVRKAA INAAGHLPST NLTRYLLDNL THKTIGRAVT
QALKARGSET IAALRQVRQS TTDRLVLERI AAVCGAIPTP ESRHVLNGLA QQPDVLVRSA
ALRALRRFPN EPEDDALFRA VMQEEVLLAQ RLLHGSLTDT ALTKTLDYEL SVLLQRLFDV
LMQLYDSETI AGARMGIGHP ARERRANSLE MLDNLIPRAT YQTLQVLVDD LPRTEKNRIL
DAELGPFDDR EPIRAFVLRM GETVFSSWTV DVVLRTLPPA DALLARQSLE SRFGTLSPLL
MSHAPHSTEQ ISAYDRVLLL SQASLFSQTP ENVLASITPI MKEEEHQAGV SIFRKGDLGT
GMYVIYTGEV AILDGDTELA RFGRGDFFGE LALLDTEARS ATAEAITDVR LLRIDQDDFY
DLMEERSEVL RSIVKSLSGR IRRQNELLSN RATTPQ