Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shel_06030 |
Symbol | |
ID | 8394495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Slackia heliotrinireducens DSM 20476 |
Kingdom | Bacteria |
Replicon accession | NC_013165 |
Strand | - |
Start bp | 702282 |
End bp | 705185 |
Gene Length | 2904 bp |
Protein Length | 967 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644985365 |
Product | YhgE/Pip-like protein |
Protein accession | YP_003143011 |
Protein GI | 257063339 |
COG category | [S] Function unknown |
COG ID | [COG1511] Predicted membrane protein |
TIGRFAM ID | [TIGR03061] YhgE/Pip N-terminal domain [TIGR03062] YhgE/Pip C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.646381 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.719714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAG TACTTGAAAT ACTCGGGCGA GATCTTGGCC GCCTTATCCG CAACCCTCTT GCAACAGTGG TCGTACTTGC CATGTGCATC CTGCCGTCGC TGTACACCTG GTACTGCGAC GTCGCTTTTT GGGACCCCTA CGGCAACACC GGCAACATCC CGGTTGCGGT GGCGTCCGCC GACGAAGGCT ACGATCTGGC GAATCTGGCC GACGAGCTGC CGGTCGACGC GTCCGAAGGG CTCGGCGCAT CGTCTGAGGC CCGCGAGGAG AACTACGTCA ACATCGGCGA GCAGCTCGAG GACCAGCTTC GCGAAAACAA CCAGCTCAAG TGGATATTCG TGAACGAAGA CGAAGCCATC CAAGGCGTGA AGGCCGGCGA CTACTACGCG GCCATCGTCA TTCCCGAAGA CTTCACCAAC CAGTTGATCC AGGTCATCCT TCAGAACAGC GACGACATGC CGAAGATCCA ATACTACGTG AACGAGAAGA GCAACGCCGT GGCGCCGAAG ATCACCGACG CCGGCTCACG CACCATCCAG CAGCAGATCA ACGCGTCCTT CGTCGAGCTG CTCACCCAAA CCATCCTGGA AACGCTGCAG TCCGCTGGCT TTGACCTGGA CGCATCCGCT ACCGATGCCG AACAGACCCT GACCCGAAAG CTTTCGGGGG TCCAAGGCTC CCTCGACGAG GTGCAGGCTT CCCTTGACGG GTTGGGCGAA TCCGTCAGCA AGGCGCGCAC CGCCACGGCA GACGCGCGGC TCACGTTAAC CGACCTGCAG GCATCGCTAC CCACCATCGT ATCCGGCGTG AACCAGGCCG ACGACCTTTT GGGAGACATC CGCACCCAAG GCGGGGCCTA TGCCACAGAA GCTTCTGCAG CCCTGGGCGA AGGCGCCTCG AAGCTCGGCG GGGCCTCGGC GCAGGCGTCG AGCACCGTGG GGGCCGCAAG CGCCGACGTG CTGACCCTCA AGGCGAAGGT CGATGCCGCC ATCGAAGCCA TGCAGGACGT GATCGACCGC AACACCGAAA CCATCGCCGA GCTGCAGGAC ATCGTGGACG CGGCCTCTGC CCGCCTGGCG GAAGCCACGG GCGACGACAC GGTCCTCGAT CCCGAATCCC CGCTGTCGGG CATCACCGAC GAAACGGGGC AAATCGCCCA GGACGCCATC GACGACGCCC AGGCGATCAT CGACCAGCTC ACGGCCGAAA ACGCGAAATA CCAGGACGTG GTCGACCGCC TGACCCGCGC TAGCGACAAC CTGGGCGCCG TGGCCACGTC CACCGACGCC TCCGTGGAGC AGATCGACGC CGCCGTGCAG AACGCCATAG CGGTGTTGGG CGGCGCCCAG ACCACCTTCA ACACCGACCT GCTGCCCCAG CTGTCAAGCG GCCTGGACTC GCTGTCCAGC GCCGGTTCCA ACATGACCGC CGCCGCGGCG GGCGCCGACA CCACCATCGA CCAGGCCGTG GCCTCCCTCG ACGCCCTGGA CGGCATCCTG GACCAGACCG AAACCTCGCT TTCCACCTGG CAGGACGACA TCCAGCGCAT CCAGGACCGC CTGGACGGCA TCGCCACCGA CATCCGGTCC CTATACAACG CCAACTCGCT GGAGACCCTG GCCGGGCTTC TGGGTATGGA CGTGCAGGAG ATTGCATCGT TCATCGCGTC GCCCGTGACC CTGACCACCG AGAAGGTGTA CCCCGTCGAA TCCTACGGCT CCAGTGTGAA CCCCTTCTTC ACCAACCTGT CGCTGTGGAT CGGCGGCTTG GTGCTGATCG CCATCCTGAA GGTTGAGGTT GACCGCACGG GCCTGGGCGA CGTCAAGCCC TACCAGGCGT ATCTGGGACG CTGGCTCTTC TTCGTGCTGA TCGGCATCGT CCAATCCGTC GTGCTGTGCA CAGGCGAGCT GCTCATCGGC GTGCAATGCA ACCACGTTCC GGCCATGTAT CTGGCTGCCA TGGTGGCGTC CGTCGTCTAT GTGAGCATCT CCTACTCGCT CACCATCACG TTCAAGCACA TCGGCAAAGC CATCGCGGTC ATGCTGCTTA TCGCGCAAAT CCCCGGCGGT TCGGGCATGT ACCCCGTCGA AATGCTGCCG GACTTCTACC AACGGCTGCA CCCGCTGGTG CCCTTCACCT ACGGAATCAA CGCAATACGC GAGGCCATGT TCGGCTTCTA CGGGAATTAC TACGTCCACA ACCTGCTCGT CCTGGCGCTG TTCTTCATCC CCACCCTCAT CGTGGGCATC GCGCTGCGAC CCTACCTCAT GAACATCAAC CTGCTGTTCG ACCGCGAGCT GAAGAGCACC GGCGTCATGA TTCACGAGGA GCACACCATC GGTATCGAGC GCTTCCGCAT GCGCACCATC ATCCGCGCGC TCATGAACGC CGACGAATAC CGCTCGCAGC TGACCGTCCG AGCCGCGCAG TTCGACGCGT ACTACCCGAT CATGCGCAGG CTGGGACTGA TCTTCATGAT CTCGCTCCCG CTCATCCCGC TGACGGTCAT GGCCGTGGTC GACCTGGGCA TCGACGGGCG CATCGCGGCT CTGGTCCTGT GGATCCTGCT GGTCATCTTC ACGGTGGCCA TGCTCATCAT GCTCGAATAC GCGAAAACGA ACATCTCCAA CCAGATGAAG CTCAACGGCA TGAGCCAAGA CGAGCTGGCT GCCTCCTTGG CGATGCATGC CAACATGACC CGCACCGGGC ACGCGTTGAA GCTGTCGAAG CTGCTGACCC GAATGGCCGG CATCCAAACC GACCAGAGCG AAGCCACAGG GCCGGTGCGG AACGCCGCGA CCGAAGCCGT ACTGGCGGAT GCGGCCGCAT CCGAGGCTGA GCCCGAACCT GCGGCCGAGC CCGAGCCCGT GACCAACGAG GACAAGGACG GCAGCCATGC ATAA
|
Protein sequence | MGKVLEILGR DLGRLIRNPL ATVVVLAMCI LPSLYTWYCD VAFWDPYGNT GNIPVAVASA DEGYDLANLA DELPVDASEG LGASSEAREE NYVNIGEQLE DQLRENNQLK WIFVNEDEAI QGVKAGDYYA AIVIPEDFTN QLIQVILQNS DDMPKIQYYV NEKSNAVAPK ITDAGSRTIQ QQINASFVEL LTQTILETLQ SAGFDLDASA TDAEQTLTRK LSGVQGSLDE VQASLDGLGE SVSKARTATA DARLTLTDLQ ASLPTIVSGV NQADDLLGDI RTQGGAYATE ASAALGEGAS KLGGASAQAS STVGAASADV LTLKAKVDAA IEAMQDVIDR NTETIAELQD IVDAASARLA EATGDDTVLD PESPLSGITD ETGQIAQDAI DDAQAIIDQL TAENAKYQDV VDRLTRASDN LGAVATSTDA SVEQIDAAVQ NAIAVLGGAQ TTFNTDLLPQ LSSGLDSLSS AGSNMTAAAA GADTTIDQAV ASLDALDGIL DQTETSLSTW QDDIQRIQDR LDGIATDIRS LYNANSLETL AGLLGMDVQE IASFIASPVT LTTEKVYPVE SYGSSVNPFF TNLSLWIGGL VLIAILKVEV DRTGLGDVKP YQAYLGRWLF FVLIGIVQSV VLCTGELLIG VQCNHVPAMY LAAMVASVVY VSISYSLTIT FKHIGKAIAV MLLIAQIPGG SGMYPVEMLP DFYQRLHPLV PFTYGINAIR EAMFGFYGNY YVHNLLVLAL FFIPTLIVGI ALRPYLMNIN LLFDRELKST GVMIHEEHTI GIERFRMRTI IRALMNADEY RSQLTVRAAQ FDAYYPIMRR LGLIFMISLP LIPLTVMAVV DLGIDGRIAA LVLWILLVIF TVAMLIMLEY AKTNISNQMK LNGMSQDELA ASLAMHANMT RTGHALKLSK LLTRMAGIQT DQSEATGPVR NAATEAVLAD AAASEAEPEP AAEPEPVTNE DKDGSHA
|
| |