Gene SbBS512_E2859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2859 
Symbol 
ID6270365 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2656720 
End bp2658435 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content55% 
IMG OID641726804 
Producthydrogenase-4, G subunit 
Protein accessionYP_001881277 
Protein GI187733753 
COG category[C] Energy production and conversion 
COG ID[COG0852] NADH:ubiquinone oxidoreductase 27 kD subunit
[COG3261] Ni,Fe-hydrogenase III large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACGTTA ATTCATCGTC AAATCGTGGC GAAGCGATTC TCGCCGCCCT GAAAACGCAG 
TTCCCCGGCG CGGTGCTGGA TGAAGAGCGA CAAACGCCTG AACAGGTCAC CATTACGGTG
AAAATCAATC TGCTGCCTGA CGTTGTACAT TATCTTTATT ATCAACATGA TGGCTGGCTT
CCAGTCCTGT TTGGCAACGA CGAGCGGACA CTTAACGGTC ATTACGCGGT TTATTATGCC
CTTTCAATGG AAGGGGCCGA AAAATGCTGG ATCGTGGTGA AGGCACTGGT CGATGCCGAC
AGTCGGGAGT TTCCGTCAGT CACACCGCGC GTCCCTGCCG CGGTCTGGGG CGAGCGAGAA
ATTCGCGATA TGTACGGGCT GATTCCGGTT GGCCTGCCGG ATCAGCGTCG CCTGGTGTTG
CCCGATGACT GGCCGGAAGA TATGCATCCG CTGCGCAAAG ATGCGATGGA TTATCGACTG
CGCCCTGAAC CGACGACAGA TTCCGAAACG TATGCGTTTA TCAATGAGGG CAACAGCGAT
GCGCGGGTGA TCCCTGTCGG CCCGCTGCAT ATCACCTCCG ATGAACCGGG TCACTTCCGC
TTGTTTGTGG ATGGCGAGCA AATTGTCGAT GCTGATTACC GCCTGTTTTA TGTCCATCGC
GGCATGGAGA AACTGGCAGA AACGCGGATG GGCTACAACG AAGTGACCTT CTTATCGGAC
CGCGTGTGTG GGATTTGCGG TTTTGCCCAC AGTGTGGCCT ATACCAACTC GGTTGAAAAT
GCACTGGGGA TTGAGGTGCC GCAACGAGCG CATACCATTC GCTCGATTCT GCTGGAAGTC
GAACGGCTAC ACAGTCATTT GCTCAACCTT GGCCTCTCCT GCCATTTTGT TGGTTTTGAT
ACCGGCTTTA TGCAATTTTT CCGCGTGCGG GAAAAGTCGA TGACGATGGC GGAATTGCTG
ACCGGGTCGC GTAAAACCTA CGGTCTGAAT CTGATTGGTG GTGTTCGCCG CGATATTCTC
AAAGAGCAAC GTCTGCAAAC GCTGAAACTG GTGCGCGAGA TGCGCGCCGA CGTGTCGGAG
CTGGTAGAGA TGCTGCTTGC TACGCCGAAT ATGGAACAAC GCACTCAGGA CATTGGCATT
CTCGACCGAC AAATCGCCCG TGATTATAGC CCTGTAGGGC CGCTGATCCG CGGCAGTGGT
TTTGCCCGTG ATTTGCGCTT TGATCACCCC TACGCCGACT ACGGCAATAT TCCAAAAACA
CTGTTTACCT TCACCGGCGG CGATGTTTTC TCCCGCGTGA TGGTCCGTGT CAAAGAGACG
TTTGATTCGC TGGCAATGCT GGAATTTGCC CTCGACAACA TGCTGGATAC CCCACTGCTG
ACCGAAGGCT TTAGCTATAA ACCTCACGCA TTCGCGCTGG GCTTTGTTGA AGCGCCACGC
GGTGAAGACG TGCACTGGAG CATGCTCGGT GATAACCAAA AATTGTTCCG CTGGCGCTGC
CGTGCCGCCA CCTACGCCAA CTGGCCGGTG TTGCGTTACA TGCTGCGCGG CAATACCGTT
TCTGACGCAC CGCTGATTAT CGGTAGCCTT GATCCCTGCT ACTCCTGTAC CGACCGTGTG
ACGCTGGTAG ATGTGCGCAA GCGCCAGTCA AAAACCGTGC CGTATAAAGA GATCGAACGC
TACGGCATTG ATCGTAACCG TTCGCCGCTG AAGTAA
 
Protein sequence
MNVNSSSNRG EAILAALKTQ FPGAVLDEER QTPEQVTITV KINLLPDVVH YLYYQHDGWL 
PVLFGNDERT LNGHYAVYYA LSMEGAEKCW IVVKALVDAD SREFPSVTPR VPAAVWGERE
IRDMYGLIPV GLPDQRRLVL PDDWPEDMHP LRKDAMDYRL RPEPTTDSET YAFINEGNSD
ARVIPVGPLH ITSDEPGHFR LFVDGEQIVD ADYRLFYVHR GMEKLAETRM GYNEVTFLSD
RVCGICGFAH SVAYTNSVEN ALGIEVPQRA HTIRSILLEV ERLHSHLLNL GLSCHFVGFD
TGFMQFFRVR EKSMTMAELL TGSRKTYGLN LIGGVRRDIL KEQRLQTLKL VREMRADVSE
LVEMLLATPN MEQRTQDIGI LDRQIARDYS PVGPLIRGSG FARDLRFDHP YADYGNIPKT
LFTFTGGDVF SRVMVRVKET FDSLAMLEFA LDNMLDTPLL TEGFSYKPHA FALGFVEAPR
GEDVHWSMLG DNQKLFRWRC RAATYANWPV LRYMLRGNTV SDAPLIIGSL DPCYSCTDRV
TLVDVRKRQS KTVPYKEIER YGIDRNRSPL K