Gene SbBS512_E4761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4761 
Symbol 
ID6271458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4443183 
End bp4444598 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content55% 
IMG OID641728516 
Productaminotransferase, classes I and II superfamily 
Protein accessionYP_001882911 
Protein GI187732129 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones55 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATAACAG ATGTCTGGAA ATATAGGGGC AAATCCACCG AGCGGATTGA GCAAGGGCTG 
TATCGTCACG GGGATAAATT GCCGTCGGTG CGCAGCTTAA GTCAGGAGCA CGGCGTCAGC
ATCAGCACCG TGCAGCAGGC GTACCAGACG CTGGAGACGA TGAAGCTCAT CACTCCGCAG
CCGCGTTCGG GTTATTTTGT CGCACAACGT AAAGCCCAGC CGCCTGTACC GCCGATGACG
CGTCCGGTGC AGCGCCCGGT GGAAATTACC CAGTGGGATC AGGTGCTGGA TATGCTGGAG
GCGCATAGCG ACAATTCCAT TGTTCCGTTA AGCAAAAGCA CGCCGGATGT CGAAACGCCC
AGCCTGAAAC CACTCTGGCG TGAGCTAAGC CGGGTGGTGC AGCATAATCT ACAAACCGTG
CTCGGTTATG ACTTGTTAGC CGGTCAGCGA GTGTTGCGGG AGCAGATTGC CCGCCAGATG
CTCGACAGCG GCTCGGTGGT CACCGCCGAT GACATCATCA TCACCAGCGG CTGCCATAAC
TCGATGTCGC TGGCGTTAAT GGCGGTGTGT AAACCGGGCG ATATTGTCGC GGTCGAATCT
CCCTGTTATT ACGGTTCGAT GCAGATGCTG CGAGGCATGG GCGTGAAAGT GATTGAAATC
CCAACCGATC CAGAAACTGG CATCAGCGTT GAAGCACTGG AACTGGCGCT GGAACAGTGG
CCGATTAAAG GCATCATTCT GGTGCCAAAC TGTAATAATC CGCTGGGATT TATTATGCCG
GACGCACGCA AACGGGCCGT TCTCTCTCTC GCTCAGCGTC ATGATATTGT GATTTTTGAA
GATGATGTCT ACGGCGAACT GGCAACGGAG TATCCGCGCC CGCGGACCAT CCATTCATGG
GATATCGACG GGCGAGTGCT GTTGTGCAGC TCGTTCAGTA AAAGTATTGC TCCAGGCCTG
CGCGTGGGTT GGGTCGCACC GGGGCGTTAT CACGATAAAC TGATGCATAT GAAATACGCC
ATCAGCAGCT TTAATGTGCC GTCCACGCAA ATGGCGGCGG CAACGTTTGT GTTGGAAGGC
CACTATCATC GCCATATCCG GCGGATGCGG CAGACTTATC AGCGCAATCT GGCGCTTTAT
ACCTGCTGGA TACGGGAATA TTTTCCCTAC GAAATCTGTA TTACGCGCCC GAAAGGCGGA
TTTTTACTGT GGATCGAATT GCCTGAACAG GTCGATATGG TCTGCGTCGC GCGGCAGCTG
TACCGCATGA AAATCCAGGT GGCGGCAGGC TCGATTTTCT CGGCTTCCGG CAAATACCGT
AATTGTCTGC GCATCAACTG CGCTTTGCCG CTCAGCGAAA CCTATCGCGA AGCACTAAAG
CAAATTGGCG ATGCCGTGTA TCGGGCAATG GAATAA
 
Protein sequence
MITDVWKYRG KSTERIEQGL YRHGDKLPSV RSLSQEHGVS ISTVQQAYQT LETMKLITPQ 
PRSGYFVAQR KAQPPVPPMT RPVQRPVEIT QWDQVLDMLE AHSDNSIVPL SKSTPDVETP
SLKPLWRELS RVVQHNLQTV LGYDLLAGQR VLREQIARQM LDSGSVVTAD DIIITSGCHN
SMSLALMAVC KPGDIVAVES PCYYGSMQML RGMGVKVIEI PTDPETGISV EALELALEQW
PIKGIILVPN CNNPLGFIMP DARKRAVLSL AQRHDIVIFE DDVYGELATE YPRPRTIHSW
DIDGRVLLCS SFSKSIAPGL RVGWVAPGRY HDKLMHMKYA ISSFNVPSTQ MAAATFVLEG
HYHRHIRRMR QTYQRNLALY TCWIREYFPY EICITRPKGG FLLWIELPEQ VDMVCVARQL
YRMKIQVAAG SIFSASGKYR NCLRINCALP LSETYREALK QIGDAVYRAM E