Gene Slin_3946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSlin_3946 
Symbol 
ID8727704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSpirosoma linguale DSM 74 
KingdomBacteria 
Replicon accessionNC_013730 
Strand
Start bp4729714 
End bp4732845 
Gene Length3132 bp 
Protein Length1043 aa 
Translation table11 
GC content55% 
IMG OID 
ProductTonB-dependent receptor plug 
Protein accessionYP_003388735 
Protein GI284038805 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAAA CTCTAATCTT TCTGGGTTGT CTGCTACTGA GTTGTGGCGT TACGCTGGCC 
CAGACAAACC GAATAACCGG TAAGGTAACC GGCCCCGATA AACAGGGGTT ACCCGGCGTA
AACGTCCTCG TGGGCGGCAC TTCGGTCGGA ACGGCCACTG ATGCGTCGGG AAATTACGCC
ATCAATGCAC CCGCAAATGC TTCGCTGATT TTTTCATTCA TCAGCTATGT AACGCAAACT
GTACCCGTCA ATAACCGATC CATAGTGAAC GTAGAACTGG CCGAAGATGC CAAGGCCATC
GATGAGGTGG TTGTTACGGC CCTCGGTATC AAGCGGGAAG CCAAAACGCT GGGGTATGCC
ACCGCAACGG TCAACGCCGA GCAGATCAAC GTAAACCGGA CGCCCAATTT TATGAGTGGA
CTACAGGGCA AAATGGCTGG TGTCAACATC ACGTCTATGG GTACTGGTCC CGCCGGAACG
GCCAAAATCC GCATTCGGGG GCAGTCGTCG TTCAGCGGAC AGAACAACCC GCTGATTGTC
GTCAACGGGG TGCCCATCGA CAACTCCAAC TACTCCCTCG GGGGCGATTT CGGCAACCGG
GCTTCCAACA GTTCGGATGG GGGCGATGGA CTGAGCAGTA TCAATCCCGA CGATATCGAG
ACCATGACCG TACTGAAAGG TGCTACGGCG GCTGCCCTGT ACGGCTCACG CGCTAAAGAC
GGCGTGGTCA TGATTACGAC CAAAAGCCGG GGGTCGGGCA AAGGCTTCGG TGTGACCTAC
AACGCCAACT TCACCACCGA TACCCCGTTG GATTTTACGG ACTTTCAGTA CGAATACGGG
CAGGGCGAAG GCGGCAAACG ACCTACAACG GAGAAACCGA CCTCGGGCGT ATGGAGTTTC
GGCGAGAAGT TCCAGCCGGG CATGACACAG ATTCTGTTCG ATAACAAAAC GTACCCCTAT
GAACCCGTTT ATAATCGGGT ACGGCAGTTT TACCGCGTAG GCACCAACTT CACGAATACC
GTAACCGTGT CGAACAACGG CCAGAATGGC GGTTTCAGCC TGTCGTTTGG CAACACCGAC
AACCGGGGTA TCATGGAAAA CAACACCTTC AACCGAAAGG TGATCAACCT GGGATTCACG
CAGAACATCA CGCAAAAGCT AACCGCGTTG GGCAACATTA ATTACTCGCT GGAAAACAAC
GTCAATCCGC CCCAGCTAAA CACCCAGGAC CTGTCTGTAT CGACGGTGAT TTTTACGCTG
GCCAACTCCA TGCCTTTCGA CGCACTGCGC GACAACCAGA CATTGCCCAA CGGCGATGAG
TTCGTCTTCT CCCGTTTTCT GGTTCGGAAT AACCCCTACT ATTCCATGAG CCACAAATTC
GAGAACGTCA ATCGCAGTCG ACTGTTCGGG AACGTTGCCC TTAAATACCA GTTCACCGAC
TGGCTGTACG CACAGGCCCG CCTGGCCCAG GATTATTATG TGCGTAACCA GGAGTATAAC
ATCCCGAACG GCTACGCCCC CATTGCCCGC GCCCCGGTGG GCTTTGTGAA CGGTTCCTAC
ACGCAGGATG TCCGCCAGAA CACCGAACGG AACCTCGACT TGATTCTCGG AATGAACAAG
ACGTTTGGCA CGATCGGCGT GGACGTCACG CTGGGTGGCA ACCACCGCTA TGCACGCAAC
GACTACAACA GTGTAACGGT GCAGGATTTT GTACAGCCGG GCCTGTATAC CGTCATGAAC
GGCCGTATTA AAGACCCGCT CTACAGTCTG GCCGAGAAAA AGATCAACTC CGTGTTTGGG
GCGGCTACGG TATCGTACAA GGATTTCCTG TTTTTGAGTG CTACGGCCCG CAATGACTGG
TTTTCGACGC TGGCCCCCTC CAACCGGAGC ATTCTGTACC CATCCGTTAC GAGTAGTTTC
GTATTCTCGC AGGCGTTCGA CAACATGCCC GCCTGGCTGT CGTTCGGAAA GCTACGCGCG
GCTTATGCGC AGGTTGGCTC GGACAACGTC GACCCCTATT CCAACGCGCT GTATTTTTCC
GTTGATAACA ACTCATTCCC GAATCCTTCT GGCGCTCTGG TACCCGTGGG CGGCATCAAT
GCAACGGTTG TTCCCAACAA AAACCTGCGT CCGCTGCGCA TTCAGGAGGC CGAAGTGGGG
CTGGAGCTGA AGCTGTTTGC CAATAAAGTC GGCTTTGATT TCACGTACTA CCACAAAACG
ACGGACGACC AGATTCTGGC CGCTCAGGTT TCCGATGCCT CGTCGTACAC CAGCAAGCTG
ATCAACGTAG GCCGGAGTAT GAACCAGGGC CTTGAGATGC TGCTGACCTT TTCGCCCGTC
CGGACGACCA CGTTCCGTTG GGATGTCAGC GCCAACGTGT CGTACAATAC ATCCAAAGTG
CTGAAACTTG GTTTATCGCC CAACGACACT GTCATTACCG TCAGCAGCGG GGGCGGCCGG
ACGCTGAATC AGGTAGTGGG CAAGCCCATT GGGCAGTTGT ACACCTTCAC TTACCTGCGG
GATGCGCAGG GTCGGCAGGT TTTCGACGCC AACAGCGGGA TGCCGCTGCG CAACAATACC
CTTAAGAATG TGGGCAACGC CCTGCCAAGT TATTTTGGCG GTATCACGAA CACCTTCACG
TATCGAGGCA TTGTGCTGTC GGCGCTGATC GACTTCAAGC TGGGCCATAA ACTGATTGCG
GGCCGCAACA TCAACTACAT GCGCCACGGC CTGTCGAAGC GGACATTACC GGGTCGGGAT
GTGGGGTATG TAATTGGCAA CGGGGTCAAC CCAAACGGAG AAATCAACCA GACGAGAGCC
GCCGTACAAC CTTTCTACGA ATCCATTAAC CCGCTGGGCA TCAACGAAGA TTTCGTGTTC
AACGCTGGGT TCTGGAAACT GCGTCAGATA TCGCTGGGCT ATGACTTCGA CAAACTCCTA
CCCCAGCGTT TTTTCCTGAA AGGTCTGCGG TTAAATGCGG TGGCCAACAA CGTCCTGATC
ATCAAAAAGT GGACCGAGAA TATGGACCCC GAAGAAGTGC TGGTGTCGTC GGACAACGCC
GTGGGACTGG ATTTCTGGCC GGGCCTGCCG CCTACCCGCA GCATAGGCTT CAACCTCAAC
GCCCGCTTTT GA
 
Protein sequence
MNKTLIFLGC LLLSCGVTLA QTNRITGKVT GPDKQGLPGV NVLVGGTSVG TATDASGNYA 
INAPANASLI FSFISYVTQT VPVNNRSIVN VELAEDAKAI DEVVVTALGI KREAKTLGYA
TATVNAEQIN VNRTPNFMSG LQGKMAGVNI TSMGTGPAGT AKIRIRGQSS FSGQNNPLIV
VNGVPIDNSN YSLGGDFGNR ASNSSDGGDG LSSINPDDIE TMTVLKGATA AALYGSRAKD
GVVMITTKSR GSGKGFGVTY NANFTTDTPL DFTDFQYEYG QGEGGKRPTT EKPTSGVWSF
GEKFQPGMTQ ILFDNKTYPY EPVYNRVRQF YRVGTNFTNT VTVSNNGQNG GFSLSFGNTD
NRGIMENNTF NRKVINLGFT QNITQKLTAL GNINYSLENN VNPPQLNTQD LSVSTVIFTL
ANSMPFDALR DNQTLPNGDE FVFSRFLVRN NPYYSMSHKF ENVNRSRLFG NVALKYQFTD
WLYAQARLAQ DYYVRNQEYN IPNGYAPIAR APVGFVNGSY TQDVRQNTER NLDLILGMNK
TFGTIGVDVT LGGNHRYARN DYNSVTVQDF VQPGLYTVMN GRIKDPLYSL AEKKINSVFG
AATVSYKDFL FLSATARNDW FSTLAPSNRS ILYPSVTSSF VFSQAFDNMP AWLSFGKLRA
AYAQVGSDNV DPYSNALYFS VDNNSFPNPS GALVPVGGIN ATVVPNKNLR PLRIQEAEVG
LELKLFANKV GFDFTYYHKT TDDQILAAQV SDASSYTSKL INVGRSMNQG LEMLLTFSPV
RTTTFRWDVS ANVSYNTSKV LKLGLSPNDT VITVSSGGGR TLNQVVGKPI GQLYTFTYLR
DAQGRQVFDA NSGMPLRNNT LKNVGNALPS YFGGITNTFT YRGIVLSALI DFKLGHKLIA
GRNINYMRHG LSKRTLPGRD VGYVIGNGVN PNGEINQTRA AVQPFYESIN PLGINEDFVF
NAGFWKLRQI SLGYDFDKLL PQRFFLKGLR LNAVANNVLI IKKWTENMDP EEVLVSSDNA
VGLDFWPGLP PTRSIGFNLN ARF