Gene SbBS512_E3277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E3277 
Symbol 
ID6270614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3049294 
End bp3053094 
Gene Length3801 bp 
Protein Length1266 aa 
Translation table11 
GC content53% 
IMG OID641727183 
Producthypothetical protein 
Protein accessionYP_001881636 
Protein GI187731278 
COG category[S] Function unknown 
COG ID[COG3164] Predicted membrane protein 
TIGRFAM ID[TIGR02099] conserved hypothetical protein TIGR02099 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.682788 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGGCGAT TGCCGGGGAT TTTACTGCTT ACTGGAGCCG CGCTCGTTGT GATCGCAGCC 
CTGCTGGTTA GCGGCCTGCG TATTGCTTTA CCGCATCTTG ACGCCTGGCG TCCGGAAATC
CTCAACAAAA TAGAATCCGC GACTGGCATG CCGGTAGAAG CCAGTCAGCT CTCAGCCAGC
TGGCAGAATT TTGGCCCGAC GCTTGAAGCA CACGACATCC GTGCAGAACT AAAAGATGGC
GGCGAATTTT CGGTTAAACG CGTTACTCTG GCGCTGGATG TCTGGCAGAG CTTGTTACAT
ATGCGCTGGC AGTTTCGCGA CCTCACTTTC TGGCAGCTGC GCTTTCGCAC CAACACTCCT
ATCACCAGCG GTGGTGGTAA TGATAGCCTG GAAGCCAGTC ACATCAGCGA TCTGTTTCTT
CGTCAATTTG ACCATTTCGA TCTCCGCGAC AGTGAAGTCA GTTTTCTGAC GCCATCCGGT
CAGCGCGCCG AGCTGGCGAT CCCACAACTC ACCTGGCTGA ACGATCCACG TCGACACCGT
GCGGAAGGCC TGGTAAGCCT CTCCAGCCTT ACCGGACAGC ACGGCGTGAT GCAGGTGCGC
ATGGATTTGC GCGATGATGA GGGGTTGTTA AGCAATGGTC GCGTCTGGCT ACAGGCGGAT
GACATCGACC TGAAGCCGTG GCTCGGTAAA TGGATGCAGG ACAATATTGC TCTGGAAACG
GCACAGTTCT CCCTTGAAGG CTGGATGACG ATCGACAAAG GCGATGTAAC CGGCGGTGAC
GTCTGGCTGA AACAGGGCGG TGCCAGCTGG TTGGGCGAGA AGCAAACGCA TACGCTGTCG
GTGGATAATC TGACCGCGCA TATTACGCGT GAAAATCCGG GCTGGCAGTT CTCTATTCCC
GATACACGGA TCACGATGGA CGGCAAACCC TGGCCGAGCG GAGCATTGAC GCTGGCCTGG
ATACCGGAAC AGGACGTTGG CGGCAAAGAC AATAAACGCA GTGACGAACT CCGGATTCGC
GCCAGTAATC TGGAGCTGGC AGGCCTGGAG GGCATACGCC CGCTGGCCGC GAAACTTTCA
CCTGCACTGG GTGATGTTTG GCGCTCCACA CAACCGAGCG GCAAGATTAA CACTCTGGCG
CTGGATATCC CGCTTCAGGC GGCAGACAAG ACCCGTTTTC AGGCATCGTG GAGCGATCTG
GCCTGGAAGC AATGGAAATT ATTACCGGGT GCGGAACACT TCTCCGGGAC GCTTTCCGGC
AGCGTTGAAA ATGGTTTGCT TACCGCGTCG ATGAAGCAGG CAAAGATGCC TTACGAAACG
GTATTCCGTG CGCCACTGGA AATCGCCGAC GGCCAGGCAA CTATAAGCTG GCTGAACAAT
GACAAAGGTT TCCAGCTGGA TGGGCGTAAT ATTGACGTTA AAGCCAAAGC CGTCCATGCG
CGCGGCGGTT TTCGTTACCT GCAACCTGCT AACGATGAAC CCTGGCTGGG TATTCTGGCT
GGCATCAGTA CCGATGATGG TTCACAAGCC TGGCGCTATT TCCCGGAAAA CTTGATGGGT
AAAGACCTAG TTGATTACTT AAGTGGCGCG ATTCAGGGCG GTGAAGCGGA TAACGCGACG
CTGGTTTATG GTGGCAATCC GCAACTCTTC CCCTATAAAC ACAACGAAGG TCAGTTTGAA
GTGCTGGTGC CGCTGCGTAA CGCGAAGTTT GCCTTCCAGC CGGACTGGCC TGCATTAACT
AACCTTGATA TTGAACTGGA CTTTATTAAC GACGGTTTAT GGATGAAAAC CGATGACGTT
AATCTGGGCG GCGTGCGCGC GAGTAATCTC ACCGCAGTGA TCCCTGACTA CTCGAAAGAA
AAACTGCTGA TTGACGCTGA CATTAAAGGT CCGGGTAAAG CCGTTGGCCC TTACTTTGAT
GAGACACCGC TGAAAGATTC TCTGGGTGCG ACCCTGCAAG AACTCCAGCT CGACGGCGAT
GTGAATGCTC GCTTACATCT TGATATCCCG CTGAACGGCG AACTGGTAAC CGCGAAAGGT
GAAGTGACGC TGCGTAATAA CAGTCTGTTT ATCAAACCAC TCGACAGCAC CCTGAAAAAT
TTGAGCGGTA AATTCAGCTT TATCAATGGC GATCTGCAAA GTGAACCACT GACAGCAAGC
TGGTTTAATC AGCCGTTGAA CGTGGATTTT TCCACCAAAG AAGGGGCAAA AGCCTACCAG
GTTGCGGTAA ACCTCAACGG TAACTGGCTA CCGGCGAAAA CCGGCGTTCT GCCTGCAGCG
GTGAACGAAG CATTGAGTGG CAGCGTGGCG TGGGATGGTA AAGTGGGCAT TGATCTGCCT
TATCATGCTG GCGCGACCTA TAACGTAGAG CTGAACGGCG ATCTGAATAA TGTGAGCAGT
CACTTACCTT CACCGTTAGC CAAACCTGCG GGTGAACCAC TGCCGGTAAA CGTTAAGGTT
GATGGCAATC TCAGCAGCTT TGAATTAACC GGACAGGCTG GTGCGGATAA CCATTTCAAT
AGCCGCTGGT TGCTTGGTCA AAAGCTGACG CTCGATCGTG CTATTTGGGC GGCAGACAGT
AAAACGCTCC CGCCGTTGCC GGAACAAAGT GGCGTTGAAC TCAATATGCC GCCGATGAAT
GGTGCCGAGT GGCTGGCCCT GTTCCAGAAA GGCGCTGCGG AGAGTGTCGG TGGTGCAGCG
AGTTTCCCAC AACACATAAC GTTACGTACG CCTATGTTGT CACTGGGAAA TCAGCAATGG
AATAACCTGA GTATTGTTTC GCAACCGACG GCAAATGGCA CCCTGGTTGA GGCGCAGGGG
CGTGAAATCA ATGCCACGCT GGCGATGCGT AATAACGCGC CGTGGCTGGC GAATATCAAA
TATCTTTATT ACAACCCGAG CGTGGCGAAA ACTCGTGGTG ATTCAACACC GTCATCACCT
TTCCCGACAA CGGAGCGCAT TAACTTCCGT GGCTGGTCGG ACGCACAAAT ACGATGCACA
GAGTGCTGGT TCTGGGGGCA AAAATTCGGT CGTATTGACA GTGATCTCAC CATTTCTGGC
GATACGTTAA CGCTGACCAA TGGACTGATT GATACTGGTT TCTCGCGGCT TACTGCCGAT
GGTGAATGGG TTAATAATCC GGGGAATGAA CGTACCTCGC TGAAAGGAAA ACTGCGCGGG
CAGAAAATTG ATGCCGCCGC AGAATTTTTT GGTGTCACGA CGCCCATACG CCAGTCGTCA
TTTAATGTGG ATTACGATTT ACACTGGCGC AAAGCACCCT GGCAACCAGA TGAGGCGACG
TTGAATGGCA TCATTCATAC TCAACTGGGT AAAGGCGAAA TTACCGAAAT CAATACCGGA
CATGCCGGGC AATTGCTGCG CTTATTGAGC GTTGATGCCC TGATGCGTAA GCTGCGTTTT
GATTTCAGAG ACACTTTTGG CGAAGGGTTC TATTTTGACT CCATTCGCAG CACCGCGTGG
ATTAAAGACG GCGTTATGCA CACCGACGAC ACGCTGGTGG ATGGCCTGGA GGCGGATATC
GCCATGAAAG GGTCGGTAAA TCTGGTACGT CGCGACCTGA ATATGGAAGC GGTTGTCGCA
CCAGAGATTT CTGCGACGGT GGGCGTGGCT GCGGCTTTTG CGGTTAACCC CATTGTTGGC
GCGGCAGTAT TTGCAGCCAG TAAAGTGCTG GGGCCGCTGT GGAGCAAAGT CTCCATTTTG
CGCTATCACA TTTCGGGTCC GCTGGACGAT CCGCAAATCA ACGAAGTGTT GCGCCAACCG
CGTAAAGAAA AAGCGCAATG A
 
Protein sequence
MRRLPGILLL TGAALVVIAA LLVSGLRIAL PHLDAWRPEI LNKIESATGM PVEASQLSAS 
WQNFGPTLEA HDIRAELKDG GEFSVKRVTL ALDVWQSLLH MRWQFRDLTF WQLRFRTNTP
ITSGGGNDSL EASHISDLFL RQFDHFDLRD SEVSFLTPSG QRAELAIPQL TWLNDPRRHR
AEGLVSLSSL TGQHGVMQVR MDLRDDEGLL SNGRVWLQAD DIDLKPWLGK WMQDNIALET
AQFSLEGWMT IDKGDVTGGD VWLKQGGASW LGEKQTHTLS VDNLTAHITR ENPGWQFSIP
DTRITMDGKP WPSGALTLAW IPEQDVGGKD NKRSDELRIR ASNLELAGLE GIRPLAAKLS
PALGDVWRST QPSGKINTLA LDIPLQAADK TRFQASWSDL AWKQWKLLPG AEHFSGTLSG
SVENGLLTAS MKQAKMPYET VFRAPLEIAD GQATISWLNN DKGFQLDGRN IDVKAKAVHA
RGGFRYLQPA NDEPWLGILA GISTDDGSQA WRYFPENLMG KDLVDYLSGA IQGGEADNAT
LVYGGNPQLF PYKHNEGQFE VLVPLRNAKF AFQPDWPALT NLDIELDFIN DGLWMKTDDV
NLGGVRASNL TAVIPDYSKE KLLIDADIKG PGKAVGPYFD ETPLKDSLGA TLQELQLDGD
VNARLHLDIP LNGELVTAKG EVTLRNNSLF IKPLDSTLKN LSGKFSFING DLQSEPLTAS
WFNQPLNVDF STKEGAKAYQ VAVNLNGNWL PAKTGVLPAA VNEALSGSVA WDGKVGIDLP
YHAGATYNVE LNGDLNNVSS HLPSPLAKPA GEPLPVNVKV DGNLSSFELT GQAGADNHFN
SRWLLGQKLT LDRAIWAADS KTLPPLPEQS GVELNMPPMN GAEWLALFQK GAAESVGGAA
SFPQHITLRT PMLSLGNQQW NNLSIVSQPT ANGTLVEAQG REINATLAMR NNAPWLANIK
YLYYNPSVAK TRGDSTPSSP FPTTERINFR GWSDAQIRCT ECWFWGQKFG RIDSDLTISG
DTLTLTNGLI DTGFSRLTAD GEWVNNPGNE RTSLKGKLRG QKIDAAAEFF GVTTPIRQSS
FNVDYDLHWR KAPWQPDEAT LNGIIHTQLG KGEITEINTG HAGQLLRLLS VDALMRKLRF
DFRDTFGEGF YFDSIRSTAW IKDGVMHTDD TLVDGLEADI AMKGSVNLVR RDLNMEAVVA
PEISATVGVA AAFAVNPIVG AAVFAASKVL GPLWSKVSIL RYHISGPLDD PQINEVLRQP
RKEKAQ