Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E1447 |
Symbol | |
ID | 6270148 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 1320668 |
End bp | 1322524 |
Gene Length | 1857 bp |
Protein Length | 618 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641725548 |
Product | YjhS |
Protein accession | YP_001880054 |
Protein GI | 187733174 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 39 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATTTA AACACTATGA CGTGGTCAGG GCGGTATCGC CGTCAGACCT TGCGAAACGA CTGACACAAA AACTGAAGGA GGGCTGGCAG CCGTTTGGTA GTCCGGTGGC CATAACCCCT TATACCCTGA TGCAGGCGAT TGCAGCAGAA GGTGATGTGG TGGTCAGTGG TGCAACTGAG CCGGATTGGT ACTACGTCAT CGTACTGGCC GGGCAGTCCA ATGCCATGGC TTACGGTGAA GGGCTTCCGC TGCCGGATTC ATACGATGCG CCCCATCCGC GCATTAAGCA ACTGGCCCGT CGTAACACAG TGACTCCCGG TGGTGAAGTA TGCGTATTTA ACGACATCAT TCCTGCTGAC CATTGTCTGC ATGATGTTCA GGATATGAGT ACGATTAACC ATCCCCGGGC TGACCTGAGC AAAGGGCAGT ACGGCTGTGT CGGACAGGGC TTACATATTG CCAAAAAACT GCTTCCGTAT ATCCCTAATA ACGCGGGGAT CCTGCTGGTA CCATGCTGTC GTGGTGGTTC GGCATTCACC CAGGGCACGG AGGGGACATT CAGCGAGTCC ACGGGGGCCA GTCAGGATTC GGCTCGCTGG GGAGTGGGTA AGCCGTTATA TCAGGATCTG CTTTTCCGCA CGAAGGCAGC ATTGCAGAAA AACCCGAAAA ACGTTTTGCT GGCGATATGC TGGATGCAGG GGGAATTCGA TATGACGAAT GCCAGTTACG CCCAGCAGCC AGCAGCATTT CTTGCAATGG TACAGCAGTT CCGTGCTGAC CTTGCCGGGC TGGCGGCGCA GTGTCACGGT GGAAGTCCGG CATCAGTCCC CTGGATTTGT GGCGACACGA CATACGCGTG GAAACAAGAA CACGGTACGC AATATGAAGT GGTATATGGT GCATATAAAG GTAAAGAATC CCAGCAGATT TATTTTGTTC CCTTTATGAC CGATGGTAGC GGAGTTAATA CACCGACAAA CAACCCGTCA GAAGATCCTG ATATTGCCGG GTCTGGTTAT TACGGTTCGG CATCCCGAAC GAACAAAAAC TGGGTATCAT CAAATCGCCC GACGCATTTC AGCTCATGGG CGCGTCGTGG CATTATTCCC GATCGTATGG CAACTGCTAT TCTGAACGTA GCCGGTCGCA CCTTAGCCTT CATTAGTGGT AAGGCACCGG AAATCAAACC CTCGCCCGGC GGCGACACTC CATCGGGGCC GTCTGATGGT GACACATCCG TTCGTACAGT CTCCCTGCTG CCGACAGCCG GAGAGGCTGC TGCGCAGGGC TGGACCATCA CCGGCGGCAG TGTTGCGCTG GAAGATGGTG TGTTTAAGGT TACCAAGCAG AGCAATAAAA CCTGGTCCCT GATGCATCCG GTGGATGACG CAGTCTCCCT GCTGACACGG GGTGGCAGAC TGAGCTGTAA GTTTCGACTG TCAGGCGCAC TGACCAACAA CCAGTTCGGT CTGGGAATTT ATCTGTATAC CGATGTAGCG TTACCTGACG TCGTGGCGAT GACCGGGACT GGTAACCCGT TCCTGATGTC GTTCTTCACC CAGACCACAG ACGGCAAACT GAATCTGATG CATCACAGGA AAGCAGGAAA CACAAAGTTG GGCGAGTTCG GGAATTACAG TAACGACTGG CAGACGCTGG AGCTGGTGTT CACCGCCGGC AGTGCCACGG TTACTCCGAA ACTGAATGGA GTGGCTGGCC CGGCATTCCA GGTCATAAAA GACAGTCTGA CACTGGGGCT GAATGCGCTG ACGCTGACGG ATATTACCAA AAATGCAGCG TATGGCGTTG AGATAGAAAG TCTGGTGCTG GAGATAAATG CACCAGCATC ATCATAA
|
Protein sequence | MAFKHYDVVR AVSPSDLAKR LTQKLKEGWQ PFGSPVAITP YTLMQAIAAE GDVVVSGATE PDWYYVIVLA GQSNAMAYGE GLPLPDSYDA PHPRIKQLAR RNTVTPGGEV CVFNDIIPAD HCLHDVQDMS TINHPRADLS KGQYGCVGQG LHIAKKLLPY IPNNAGILLV PCCRGGSAFT QGTEGTFSES TGASQDSARW GVGKPLYQDL LFRTKAALQK NPKNVLLAIC WMQGEFDMTN ASYAQQPAAF LAMVQQFRAD LAGLAAQCHG GSPASVPWIC GDTTYAWKQE HGTQYEVVYG AYKGKESQQI YFVPFMTDGS GVNTPTNNPS EDPDIAGSGY YGSASRTNKN WVSSNRPTHF SSWARRGIIP DRMATAILNV AGRTLAFISG KAPEIKPSPG GDTPSGPSDG DTSVRTVSLL PTAGEAAAQG WTITGGSVAL EDGVFKVTKQ SNKTWSLMHP VDDAVSLLTR GGRLSCKFRL SGALTNNQFG LGIYLYTDVA LPDVVAMTGT GNPFLMSFFT QTTDGKLNLM HHRKAGNTKL GEFGNYSNDW QTLELVFTAG SATVTPKLNG VAGPAFQVIK DSLTLGLNAL TLTDITKNAA YGVEIESLVL EINAPASS
|
| |