Gene SbBS512_E4817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4817 
Symbol 
ID6269091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4488321 
End bp4489823 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content53% 
IMG OID641728559 
Producthypothetical protein 
Protein accessionYP_001882953 
Protein GI187730306 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.00312271 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGAAC CCCTATTAAT TGCCCGCACG CCGGACACAG AACTTTTTTT ACTGCCGGGA 
ATGGCTAACC GTCACGGGCT GATTACTGGC GCAACGGGGA CGGGTAAAAC CGTTACGCTG
CAAAAACTGG CAGAGTCATT GTCGGAAATC GGCGTGCCGG TGTTTATGGC TGATGTGAAA
GGCGATCTGA CCGGTATCGC GCAGGCAGGA ACGGCGTCGG AAAAACTGCT CACAAGGCTT
AAAAATATCG GCGTCAATGA CTGGCAACCG CATGCTAATC CGGTGGTGGT GTGGGATATC
TTTGGCGAGA AAGGCCATCC GGTGCGGGCG ACGGTTTCAG ACCTGGGGCC GCTGTTGCTG
GCGCGGCTGT TGAATCTCAA CGATGTGCAA TCTGGCGTGC TGAATATCAT CTTCCGTATT
GCTGACGATC AGGGATTGTT GCTGCTCGAC TTTAAAGATC TGCGGGCGAT TACCCAGTAC
ATCGGCGATA ACGCCAAATC TTTCCAGAAT CAGTACGGTA ATATCAGTAG CGCATCGGTT
GGTGCCATCC AGCGCGGTTT ACTGTCGCTG GAAAAGCAAG GTGCGGCGCA TTTCTTTGGC
GAGCCGATGC TGGATATCAA AGACTGGATG CGCACCGATA CCAACGGTAA AGGCGTTATC
AATATCCTCA GCGCCGAGAA GCTTTATCAG ATGCCGAAAC TGTACGCCGC CAGCCTGCTG
TGGATGCTTT CAGAGTTGTA TGAACAATTG CCGGAAGCGG GCGATCTGGA GAAACCAAAA
CTGGTGTTTT TCTTCGACGA GGCACATCTG CTGTTTAACG ATGCACCGCA GGTACTGCTG
GATAAGATTG AGCAGGTAAT AAGGCTTATT CGCTCAAAAG GCGTGGGCGT CTGGTTCGTT
TCGCAAAACC CGTCTGATAT TCCGGATAAT GTGCTCGGGC AGCTCGGTAA TCGCGTTCAA
CACGCTTTGC GGGCTTTTAC GCCCAAAGAT CAGAAAGCGG TAAAAGCTGC GGCGCAAACC
ATGCGGGCCA ATCCGGCGTT TGATACCGAA AAGGCAATCC AGGAACTGGG GACCGGCGAG
GCGTTAATCT CGTTTCTCGA TGCAAAAGGA AGTCCTTCTG TGGTGGAACG GGCGATGGTG
ATTGCGCCTT GTTCGCGGAT GGGACCGGTG ACGGAAGATG AGCGCAATGG CTTGATTAAC
CACTCCCCGG TGTATGGCAA GTACGAGGAT GACGTGGACC GGGAATCCGC CTATGAGATG
TTGCAAAAAG GCTTTCAGGC CAGTACCAAG CAGCAAAATA ATCCTCCCGC GAAAGGGAAA
GCGGTGGCGG TGGATGACGG TATTCTTGGT GGATTGAAGG ATATTTTGTT TGGCACTACC
GGACCACGCG GCGGGAAGAA AGATGGTGTG GTGCAAACAA TGGCGAAAAG CGCCGCCCGC
CAGGTGACGA ATCAGATTGT ACGTGGGATG TTGGGGAGTT TGCTGGGGGG AAGAAGAAGG
TAA
 
Protein sequence
MSEPLLIART PDTELFLLPG MANRHGLITG ATGTGKTVTL QKLAESLSEI GVPVFMADVK 
GDLTGIAQAG TASEKLLTRL KNIGVNDWQP HANPVVVWDI FGEKGHPVRA TVSDLGPLLL
ARLLNLNDVQ SGVLNIIFRI ADDQGLLLLD FKDLRAITQY IGDNAKSFQN QYGNISSASV
GAIQRGLLSL EKQGAAHFFG EPMLDIKDWM RTDTNGKGVI NILSAEKLYQ MPKLYAASLL
WMLSELYEQL PEAGDLEKPK LVFFFDEAHL LFNDAPQVLL DKIEQVIRLI RSKGVGVWFV
SQNPSDIPDN VLGQLGNRVQ HALRAFTPKD QKAVKAAAQT MRANPAFDTE KAIQELGTGE
ALISFLDAKG SPSVVERAMV IAPCSRMGPV TEDERNGLIN HSPVYGKYED DVDRESAYEM
LQKGFQASTK QQNNPPAKGK AVAVDDGILG GLKDILFGTT GPRGGKKDGV VQTMAKSAAR
QVTNQIVRGM LGSLLGGRRR