Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3482 |
Symbol | |
ID | 6271714 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3232852 |
End bp | 3234480 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641727366 |
Product | SPFH/band 7 domain protein |
Protein accession | YP_001881815 |
Protein GI | 187732141 |
COG category | [S] Function unknown |
COG ID | [COG2268] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.632144 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTACCG CGATTATTGC CGTATGCATT CTGTTTATTA TTGGAATTAT TTTCGCCAGG CTCTATCGTC GCGCTTCGGC AGAGCAAGCT TTTGTTCGTA CTGGTTTAGG TGGGCAAAAA GTGGTGATGA GCGGTGGCGC AATCGTGATG CCGATCTTTC ATGAAATAAT CCCCATCAAT ATGAATACTC TGAAGCTGGA AGTCAGCCGC TCAACCATTG ATAGCCTGAT TACGAAAGAT CGTATGCGTG TCGATGTAGT AGTCGCTTTC TTTGTACGGG TAAAACCGTC TGTAGAAGGC ATCGCCACTG CGGCGCAGAC GTTGGGGCAA CGCACGCTAT CACCAGAAGA CTTACGTATG TTGGTTGAAG ATAAATTTGT CGATGCCCTC CGTGCAACAG CTGCACAAAT GACCATGCAT GAGTTACAGG ATACCCGCGA GAACTTCGTG CAGGGAGTAC AAAATACTGT CGCAGAAGAC CTGTCGAAAA ACGGTCTTGA GCTGGAAAGC GTTTCACTTA CCAACTTTAA CCAGACCTCG AAAGAACATT TCAATCCGAA CAATGCCTTT GACGCCGAAG GTTTAACCAA ACTGATTCAG GAGACGGAGC GCCGTCGTCG CGAACGTAAC GAAGTTGAAC AGGATGTAGA AGTTGCGGTG CGTGAGAAAA ACCGTGATGC ACTTTCGCGC AAGTTGGAGA TTGAACAACA AGAAGCGTTT ATGACGCTTG AGCAGGAGCA GCAGGTTAAA ACCCGTACTG CCGAACAGAA TGCACGTATT GCGGCTTTTG AAGCTGAACG TCGTCGTGAA GCAGAGCAGA CGCGAATTCT GGCTGAACGA CAAATTCAGG AAACGGAGAT CGAGCGCGAG CAGGCCGTCC GCTCAAGAAA GGTTGAGGCT GAACGTGAAG TTCGTATTAA AGAGATCGAA CAGCAGCAGG TCACCGAAAT CGCCAACCAG ACGAAATCGA TCGCTATTGC CGCCAAATCG GAACAACAGT CCCAGGCAGA AGCGCGTGCT AACCTTGCAC TCGCAGAAGC AGTAAGTGCG CAACAAAACG TAGAAACCAC TCGCCAGACC GCAGAAGCCG ATCGTGCTAA ACAAGTTGCC CTAATCGCTG CCGCGCAGGA TGCAGAAACC AAAGCGGTTG AACTGACCGT GCGGGCGAAA GCAGAAAAAG AAGCCGCAGA GATGCAGGCG GCGGCTATCG TTGAGTTAGC CGAAGCTACA CGTAAAAAGG GTCTGGCGGA AGCAGAAGCA CAACGTGCGC TGAACGATGC TATCAACGTA CTTTCTGATG AACAAACCAG CCTTAAATTC AAACTGGCCT TGTTGCAGGC GCTACCTGCG GTAATAGAAA AATCCGTTGA GCCGATGAAA TCTATTGACG GTATCAAGAT TATTCAGGTC GATGGTCTGA ATCGTGGCAG CGCTGCGGGT GATGCAAACA CGGGTAATGT GGGGGGCGGA AACCTGGCGG AACAAGCATT ATCAGCCGCT CTCTCTTACC GCACACAGGC ACCGCTGATT GACTCCTTGC TCAATGAAAT TGGCGTTTCA GGCGGCTCAC TGGCGGCATT GACTTCACCC TTAACCTCAA CAACTCCCGT CGCCGAAAAC GTAGAATAA
|
Protein sequence | MFTAIIAVCI LFIIGIIFAR LYRRASAEQA FVRTGLGGQK VVMSGGAIVM PIFHEIIPIN MNTLKLEVSR STIDSLITKD RMRVDVVVAF FVRVKPSVEG IATAAQTLGQ RTLSPEDLRM LVEDKFVDAL RATAAQMTMH ELQDTRENFV QGVQNTVAED LSKNGLELES VSLTNFNQTS KEHFNPNNAF DAEGLTKLIQ ETERRRRERN EVEQDVEVAV REKNRDALSR KLEIEQQEAF MTLEQEQQVK TRTAEQNARI AAFEAERRRE AEQTRILAER QIQETEIERE QAVRSRKVEA EREVRIKEIE QQQVTEIANQ TKSIAIAAKS EQQSQAEARA NLALAEAVSA QQNVETTRQT AEADRAKQVA LIAAAQDAET KAVELTVRAK AEKEAAEMQA AAIVELAEAT RKKGLAEAEA QRALNDAINV LSDEQTSLKF KLALLQALPA VIEKSVEPMK SIDGIKIIQV DGLNRGSAAG DANTGNVGGG NLAEQALSAA LSYRTQAPLI DSLLNEIGVS GGSLAALTSP LTSTTPVAEN VE
|
| |