Gene Shel_28420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShel_28420 
Symbol 
ID8396730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSlackia heliotrinireducens DSM 20476 
KingdomBacteria 
Replicon accessionNC_013165 
Strand
Start bp3146157 
End bp3149120 
Gene Length2964 bp 
Protein Length987 aa 
Translation table11 
GC content60% 
IMG OID644987580 
Producthypothetical protein 
Protein accessionYP_003145177 
Protein GI257065505 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0810] Periplasmic protein TonB, links inner and outer membranes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.553187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.116138 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACC CTATTGATCA TGTAGCGTAC CTGGCCGATG AAATCGGTCC CCGTCCCTAC 
GGTACGGAGG AAGAGCAGCA AGGCGCGCTG TATATTGTCG AGCGCCTGCA GAAGGACGCA
CATCTTTCCG TCAATCTTGA AGATTTTAGC GCGAGCATCG AGGCCAACGC CTACAAGATG
ATCTGCTTTG GCGTGACGAT CGTTGCCGCC ATCGTGGCCA TGATTGTTTC ACGTGCCGAG
CTGGTTGCTG CGATTCTGGC TCTTGCATCG TCGGCTCTCT ATTTTTTGGA GATGTTCGAC
ATCCCTGTGC TGTCGAGGTT CTTCAAGAAG GGCGTCAGCC AGAACGTGGT AGCGAAGTAC
GATCCGCCGC GTAGGGAAAA CGCCGCTGGC ACGCGCCGCC GCAAAGTCAT TCTTACGGCT
AACTACGACA GCGGCAAAGT GCGTCTTGAT TACAACCGTG GCGTCATCCG TTTCCTGAAG
CCGTTGCAAC AGGCCACAGC CGTGTCGATG ATCGCTGTTC CTGTGTTCAT GCTGATTCGC
GCCTTCTTCA TCCACGGTCT TACCGGCACT GCAGCCGGTG TCGCCGATGT GTTCGAAGGC
ATCTTCTTGC TCTGCATCGC CATTTCGCTT GTTTTCCTGG TTGTTGAGAA GTTTGCTCCT
TACAATGACG CCGCCAACGA CAATGCGGCC GGCGTTGCGG TTCTGCTTGA AGTTGCCCGA
CGCCTGAGCG AAGGCCAGAC CGACACGGCC GTTACCGAGC AGCGCGGCAT CACCCATGAC
GAGGACACCC TTCGCGACGA GGGGCTTATT CCGCAGGATG CCACCATCGT GTATGAGGAC
GACGATCCCG ATGCGTACGT GAAGGATGCC CAGCAGCCCA TGTATGACAT CGCGGGCAAC
CTGGTCCGCG TCGACCGCGA CAACATGGAT GAACAGCTGA GTGCTGCCCA GGCCGCAGCC
GCTGGTTCTA CCACAGCTTT CGCTCCTGTT TCACAGGAAA CATTCGAGGA GCTTCGTTCC
GCCGTGGACG CCGGCAGCGT CGAGGATATG ATCGACAAGG AAACCGCTCT TGATGCCGTG
GCTCCTGTTC AGCCCGCTCC CGCTCCGGTT GCGCAGCCTG CCGAGTCTGC GCCTGTGGTG
AAGGAAGCTT CTGCAGAGGG CGTTTCTGCC GAAGAGCCCG CAGCCGAACC GGTGGTCGAG
CCTGAGGTTG TAGAAGACGA TGCGAACGTT CCCGCGTGGT ACAAGAAGGC CATGCAGCGC
GCCCGCAAAG ACGAGCAAGC CGCTCCTGAA ACCACGCAGC GTTCCCGCTA TGCCGACTAT
CCGACGGTTC CCGCAGCTGC ATTTGGCGCA GCTGCCGAAG CCTCTGCGGT TGCCGCGCCC
GAGCCGGAGC CCGAGCCCGC GCCCGAACCG GTTGCCGAGG TCGAAGCTGT TGCTGAGCCT
GTTGTGGTTG AAGAACAGCC GGCCGCAGTT GAGCCTGTGA TCGAGTCTGT TGCTGAGCCG
GCACCTGAAC CCGAACCAGA ACCTGTGGTG GAACCTGAGT TTGAGCCGGT TGTCATGTCG
GATACCGAAC CTGAACCTGA ACCCGAGCCT GAGCCGGAAC CTGAACCCGA GCCCGAGCCG
GAACCGGAAC CCGAACCCGA ACCTGAACCT GAAATTGACT CGTCGAAGGC CGTCACGCAG
CCTCTGCCAT TCTTGACCCA GGCTGACAAG CGTACCAAGG ACCGTATTCT GGTCGATAGC
ACCGAGGCTC AGGCCACGAT CATGATGCCC CCCATCAACG CCGATGCGGC CCGTGCCGAT
ACGGCCCGCA CCTTCGATAT TCCGTCCATT TCTTACGGCG GGAAGACCGA AGCGATCGAT
CCGGCCCAGC TGCAGCAGCG CGCGCCGCTG GCCGAAGTCA ACGAGCAGCA GGGTAAAGAA
GCCGCCAAGA AGCTGTTGGC GACCACGTTG CCTTCCATCG AAGATGACCC CGACAAGGAC
GAGTCGTCTG GCCAGACGAA CACGAATGTC AGCCTGACAG GTTCGTTCTC GGCCATTGCA
GCTACGGGTG CCGCAACGTC GGTCGGTGAT GAGCTGCTTG CCGATGTTGA CCCTGATGAC
ATCTTTATCG ACGATGCCGA CGACTCCATC TTCGACGAGG AATTCACCGA GACCGGTGCC
TTCGCAGGCA AGGGCTACGT CGATATGCCT CAGTCTCGCT TGGGCCGTTT CTTCAACCGC
TTCCGCCGCA AGGACAAGAA AGAAGAAGAA TCCGCGCATG ACTGGTTGGG CGTCGACGAT
GATTTCGACG CACGCAAGGT CGGTAAGGAG CGTGGCGGTT GGGAAAGCTT CCGCGAGGAT
GATGATGAGT GGCTCGGCGG TGCGTTCGAC GGCATTCGCG ACCGTCTGTC TGGCGGCGGC
GAAGACCGCA CCGTGGGTGG CCAGGACCGC CACGTGCGTA AGAGCATCGC TTCGCCTTTC
GAGGGCCTGC CGCTGCATAT GGACGAAACT GCCGATCAGG TATATGCCTT TGCGGGCGCA
GACCAGGTGA CCACCGAGGT GTGGTTCGTG GCTCTGGGCT CGCAGGGCAG CGACCAAGCC
GGCATCAAAG CCTTTATGGC CGAACATGCC GATGATATGC GTGGCGCCAT TGTTGTGAAC
CTCGAGGCGT TGGGCGACGG CGACACCTGC TATCTTGAGA GCGAAGGCGA AATCTTCCAG
CGTCCCGCTG CGAGCCGCGT GAAACGGTTC GTTCGCCAGG CCGCACAGCG TACGGGAGTG
AATGTGCATT CCGCCAAGAT TGATTGGCGT GAATCCGCTG CGAGCTATGC CTTGAAGCAT
AACCTTCCTG CCATCACCTT GGTTGGTATG GATGGCGACA AGCCGGCTGG CCTGGGCGAA
GCAGGCGACA CGCTTGAAGG CGTGAATCCC CAGAAGCTGG AGGAAAGCGC CAACTTCGTC
ATTGAGGTTC TGAAGAACGT CTAG
 
Protein sequence
MSNPIDHVAY LADEIGPRPY GTEEEQQGAL YIVERLQKDA HLSVNLEDFS ASIEANAYKM 
ICFGVTIVAA IVAMIVSRAE LVAAILALAS SALYFLEMFD IPVLSRFFKK GVSQNVVAKY
DPPRRENAAG TRRRKVILTA NYDSGKVRLD YNRGVIRFLK PLQQATAVSM IAVPVFMLIR
AFFIHGLTGT AAGVADVFEG IFLLCIAISL VFLVVEKFAP YNDAANDNAA GVAVLLEVAR
RLSEGQTDTA VTEQRGITHD EDTLRDEGLI PQDATIVYED DDPDAYVKDA QQPMYDIAGN
LVRVDRDNMD EQLSAAQAAA AGSTTAFAPV SQETFEELRS AVDAGSVEDM IDKETALDAV
APVQPAPAPV AQPAESAPVV KEASAEGVSA EEPAAEPVVE PEVVEDDANV PAWYKKAMQR
ARKDEQAAPE TTQRSRYADY PTVPAAAFGA AAEASAVAAP EPEPEPAPEP VAEVEAVAEP
VVVEEQPAAV EPVIESVAEP APEPEPEPVV EPEFEPVVMS DTEPEPEPEP EPEPEPEPEP
EPEPEPEPEP EIDSSKAVTQ PLPFLTQADK RTKDRILVDS TEAQATIMMP PINADAARAD
TARTFDIPSI SYGGKTEAID PAQLQQRAPL AEVNEQQGKE AAKKLLATTL PSIEDDPDKD
ESSGQTNTNV SLTGSFSAIA ATGAATSVGD ELLADVDPDD IFIDDADDSI FDEEFTETGA
FAGKGYVDMP QSRLGRFFNR FRRKDKKEEE SAHDWLGVDD DFDARKVGKE RGGWESFRED
DDEWLGGAFD GIRDRLSGGG EDRTVGGQDR HVRKSIASPF EGLPLHMDET ADQVYAFAGA
DQVTTEVWFV ALGSQGSDQA GIKAFMAEHA DDMRGAIVVN LEALGDGDTC YLESEGEIFQ
RPAASRVKRF VRQAAQRTGV NVHSAKIDWR ESAASYALKH NLPAITLVGM DGDKPAGLGE
AGDTLEGVNP QKLEESANFV IEVLKNV