Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_28420 |
Symbol | |
ID | 8396730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 3146157 |
End bp | 3149120 |
Gene Length | 2964 bp |
Protein Length | 987 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644987580 |
Product | hypothetical protein |
Protein accession | YP_003145177 |
Protein GI | 257065505 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.553187 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.116138 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAACC CTATTGATCA TGTAGCGTAC CTGGCCGATG AAATCGGTCC CCGTCCCTAC GGTACGGAGG AAGAGCAGCA AGGCGCGCTG TATATTGTCG AGCGCCTGCA GAAGGACGCA CATCTTTCCG TCAATCTTGA AGATTTTAGC GCGAGCATCG AGGCCAACGC CTACAAGATG ATCTGCTTTG GCGTGACGAT CGTTGCCGCC ATCGTGGCCA TGATTGTTTC ACGTGCCGAG CTGGTTGCTG CGATTCTGGC TCTTGCATCG TCGGCTCTCT ATTTTTTGGA GATGTTCGAC ATCCCTGTGC TGTCGAGGTT CTTCAAGAAG GGCGTCAGCC AGAACGTGGT AGCGAAGTAC GATCCGCCGC GTAGGGAAAA CGCCGCTGGC ACGCGCCGCC GCAAAGTCAT TCTTACGGCT AACTACGACA GCGGCAAAGT GCGTCTTGAT TACAACCGTG GCGTCATCCG TTTCCTGAAG CCGTTGCAAC AGGCCACAGC CGTGTCGATG ATCGCTGTTC CTGTGTTCAT GCTGATTCGC GCCTTCTTCA TCCACGGTCT TACCGGCACT GCAGCCGGTG TCGCCGATGT GTTCGAAGGC ATCTTCTTGC TCTGCATCGC CATTTCGCTT GTTTTCCTGG TTGTTGAGAA GTTTGCTCCT TACAATGACG CCGCCAACGA CAATGCGGCC GGCGTTGCGG TTCTGCTTGA AGTTGCCCGA CGCCTGAGCG AAGGCCAGAC CGACACGGCC GTTACCGAGC AGCGCGGCAT CACCCATGAC GAGGACACCC TTCGCGACGA GGGGCTTATT CCGCAGGATG CCACCATCGT GTATGAGGAC GACGATCCCG ATGCGTACGT GAAGGATGCC CAGCAGCCCA TGTATGACAT CGCGGGCAAC CTGGTCCGCG TCGACCGCGA CAACATGGAT GAACAGCTGA GTGCTGCCCA GGCCGCAGCC GCTGGTTCTA CCACAGCTTT CGCTCCTGTT TCACAGGAAA CATTCGAGGA GCTTCGTTCC GCCGTGGACG CCGGCAGCGT CGAGGATATG ATCGACAAGG AAACCGCTCT TGATGCCGTG GCTCCTGTTC AGCCCGCTCC CGCTCCGGTT GCGCAGCCTG CCGAGTCTGC GCCTGTGGTG AAGGAAGCTT CTGCAGAGGG CGTTTCTGCC GAAGAGCCCG CAGCCGAACC GGTGGTCGAG CCTGAGGTTG TAGAAGACGA TGCGAACGTT CCCGCGTGGT ACAAGAAGGC CATGCAGCGC GCCCGCAAAG ACGAGCAAGC CGCTCCTGAA ACCACGCAGC GTTCCCGCTA TGCCGACTAT CCGACGGTTC CCGCAGCTGC ATTTGGCGCA GCTGCCGAAG CCTCTGCGGT TGCCGCGCCC GAGCCGGAGC CCGAGCCCGC GCCCGAACCG GTTGCCGAGG TCGAAGCTGT TGCTGAGCCT GTTGTGGTTG AAGAACAGCC GGCCGCAGTT GAGCCTGTGA TCGAGTCTGT TGCTGAGCCG GCACCTGAAC CCGAACCAGA ACCTGTGGTG GAACCTGAGT TTGAGCCGGT TGTCATGTCG GATACCGAAC CTGAACCTGA ACCCGAGCCT GAGCCGGAAC CTGAACCCGA GCCCGAGCCG GAACCGGAAC CCGAACCCGA ACCTGAACCT GAAATTGACT CGTCGAAGGC CGTCACGCAG CCTCTGCCAT TCTTGACCCA GGCTGACAAG CGTACCAAGG ACCGTATTCT GGTCGATAGC ACCGAGGCTC AGGCCACGAT CATGATGCCC CCCATCAACG CCGATGCGGC CCGTGCCGAT ACGGCCCGCA CCTTCGATAT TCCGTCCATT TCTTACGGCG GGAAGACCGA AGCGATCGAT CCGGCCCAGC TGCAGCAGCG CGCGCCGCTG GCCGAAGTCA ACGAGCAGCA GGGTAAAGAA GCCGCCAAGA AGCTGTTGGC GACCACGTTG CCTTCCATCG AAGATGACCC CGACAAGGAC GAGTCGTCTG GCCAGACGAA CACGAATGTC AGCCTGACAG GTTCGTTCTC GGCCATTGCA GCTACGGGTG CCGCAACGTC GGTCGGTGAT GAGCTGCTTG CCGATGTTGA CCCTGATGAC ATCTTTATCG ACGATGCCGA CGACTCCATC TTCGACGAGG AATTCACCGA GACCGGTGCC TTCGCAGGCA AGGGCTACGT CGATATGCCT CAGTCTCGCT TGGGCCGTTT CTTCAACCGC TTCCGCCGCA AGGACAAGAA AGAAGAAGAA TCCGCGCATG ACTGGTTGGG CGTCGACGAT GATTTCGACG CACGCAAGGT CGGTAAGGAG CGTGGCGGTT GGGAAAGCTT CCGCGAGGAT GATGATGAGT GGCTCGGCGG TGCGTTCGAC GGCATTCGCG ACCGTCTGTC TGGCGGCGGC GAAGACCGCA CCGTGGGTGG CCAGGACCGC CACGTGCGTA AGAGCATCGC TTCGCCTTTC GAGGGCCTGC CGCTGCATAT GGACGAAACT GCCGATCAGG TATATGCCTT TGCGGGCGCA GACCAGGTGA CCACCGAGGT GTGGTTCGTG GCTCTGGGCT CGCAGGGCAG CGACCAAGCC GGCATCAAAG CCTTTATGGC CGAACATGCC GATGATATGC GTGGCGCCAT TGTTGTGAAC CTCGAGGCGT TGGGCGACGG CGACACCTGC TATCTTGAGA GCGAAGGCGA AATCTTCCAG CGTCCCGCTG CGAGCCGCGT GAAACGGTTC GTTCGCCAGG CCGCACAGCG TACGGGAGTG AATGTGCATT CCGCCAAGAT TGATTGGCGT GAATCCGCTG CGAGCTATGC CTTGAAGCAT AACCTTCCTG CCATCACCTT GGTTGGTATG GATGGCGACA AGCCGGCTGG CCTGGGCGAA GCAGGCGACA CGCTTGAAGG CGTGAATCCC CAGAAGCTGG AGGAAAGCGC CAACTTCGTC ATTGAGGTTC TGAAGAACGT CTAG
|
Protein sequence | MSNPIDHVAY LADEIGPRPY GTEEEQQGAL YIVERLQKDA HLSVNLEDFS ASIEANAYKM ICFGVTIVAA IVAMIVSRAE LVAAILALAS SALYFLEMFD IPVLSRFFKK GVSQNVVAKY DPPRRENAAG TRRRKVILTA NYDSGKVRLD YNRGVIRFLK PLQQATAVSM IAVPVFMLIR AFFIHGLTGT AAGVADVFEG IFLLCIAISL VFLVVEKFAP YNDAANDNAA GVAVLLEVAR RLSEGQTDTA VTEQRGITHD EDTLRDEGLI PQDATIVYED DDPDAYVKDA QQPMYDIAGN LVRVDRDNMD EQLSAAQAAA AGSTTAFAPV SQETFEELRS AVDAGSVEDM IDKETALDAV APVQPAPAPV AQPAESAPVV KEASAEGVSA EEPAAEPVVE PEVVEDDANV PAWYKKAMQR ARKDEQAAPE TTQRSRYADY PTVPAAAFGA AAEASAVAAP EPEPEPAPEP VAEVEAVAEP VVVEEQPAAV EPVIESVAEP APEPEPEPVV EPEFEPVVMS DTEPEPEPEP EPEPEPEPEP EPEPEPEPEP EIDSSKAVTQ PLPFLTQADK RTKDRILVDS TEAQATIMMP PINADAARAD TARTFDIPSI SYGGKTEAID PAQLQQRAPL AEVNEQQGKE AAKKLLATTL PSIEDDPDKD ESSGQTNTNV SLTGSFSAIA ATGAATSVGD ELLADVDPDD IFIDDADDSI FDEEFTETGA FAGKGYVDMP QSRLGRFFNR FRRKDKKEEE SAHDWLGVDD DFDARKVGKE RGGWESFRED DDEWLGGAFD GIRDRLSGGG EDRTVGGQDR HVRKSIASPF EGLPLHMDET ADQVYAFAGA DQVTTEVWFV ALGSQGSDQA GIKAFMAEHA DDMRGAIVVN LEALGDGDTC YLESEGEIFQ RPAASRVKRF VRQAAQRTGV NVHSAKIDWR ESAASYALKH NLPAITLVGM DGDKPAGLGE AGDTLEGVNP QKLEESANFV IEVLKNV
|
| |