Gene SbBS512_E4344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4344 
SymbolglnG 
ID6269202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp4058250 
End bp4059659 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content57% 
IMG OID641728153 
Productnitrogen regulation protein NR(I) 
Protein accessionYP_001882566 
Protein GI187733434 
COG category[T] Signal transduction mechanisms 
COG ID[COG2204] Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains 
TIGRFAM ID[TIGR01818] nitrogen regulation protein NR(I) 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGAG GGATAGTCTG GGTAGTCGAT GACGATAGTT CCATCCGTTG GGTGCTTGAA 
CGTGCGCTCG CTGGAGCGGG TTTAACCTGT ACGACATTTG AGAACGGCGC GGAAGTACTG
GAGGCGCTGG CGAGCAAAAC GCCGGATGTG CTGCTTTCAG ATATCCGTAT GCCGGGAATG
GACGGGCTGG CGCTGCTCAA GCAGATTAAA CAGCGCCATC CAATGCTTCC GGTCATCATT
ATGACCGCAC ATTCCGATCT GGATGCTGCC GTCAGCGCCT ATCAACAAGG GGCGTTTGAT
TATCTGCCCA AACCGTTTGA TATCGACGAA GCCGTGGCGC TGGTTGAGCG CGCTATCAGT
CATTACCAGG AACAGCAGCA GCCGCGTAAT ATTCAGCTTA ACGGCCCAAC GACCGATATC
ATCGGCGAAG CGCCAGCCAT GCAGGACGTG TTCCGTATTA TCGGTCGGCT TTCGCGTTCT
TCTATTAGCG TGCTGATTAA CGGCGAATCC GGCACCGGTA AAGAACTGGT CGCTCATGCC
CTGCATCGCC ACAGTCCGCG CGCCAAAGCG CCGTTTATCG CGCTGAATAT GGCAGCTATC
CCAAAAGATT TGATCGAATC AGAACTGTTT GGCCACGAGA AAGGCGCGTT TACTGGCGCG
AATACCATTC GTCAGGGGCG TTTTGAACAG GCCGATGGCG GTACATTATT CCTCGACGAA
ATTGGTGATA TGCCGCTGGA TGTGCAGACG CGTTTGCTGC GCGTGCTGGC AGACGGTCAG
TTTTACCGCG TTGGCGGCTA TGCGCCGGTG AAAGTGGATG TGCGGATTAT CGCTGCCACT
CACCAGAATC TCGAACAGCG AGTGCAGGAA GGTAAGTTCC GTGAGGATCT GTTCCACCGC
CTAAACGTTA TCCGCGTTCA TCTGCCGCCG CTGCGCGAAC GTCGGGAAGA TATTCCCCGT
CTGGCGCGCC ATTTTTTACA GGTTGCCGCG CGCGAACTGG GCGTAGAAGC GAAGTTACTG
CATCCGGAAA CCGAAACTGC TCTGACGCGT CTGGCGTGGC CAGGCAACGT GCGCCAGCTG
GAAAACACCT GCCGCTGGCT AACGGTGATG GCCGCCGGGC AGGAAGTGTT GATTCAGGAT
TTGCCCGGCG AACTGTTTGA ATCAACGGTT GCGGAGAGTA CTTCGCAAAT GCAACCGGAC
AGCTGGGCGA CGCTTCTTGC GCAGTGGGCA GACAGAGCGC TGCGTTCCGG TCATCAAAAT
CTGCTTTCCG AAGCGCAGCC AGAGCTGGAG CGGACGTTAC TGACGACCGC GTTGCGACAT
ACGCAGGGGC ATAAACAGGA AGCGGCGCGG CTACTCGGCT GGGGCCGCAA CACCCTGACG
CGTAAGTTAA AAGAGCTGGG GATGGAGTGA
 
Protein sequence
MQRGIVWVVD DDSSIRWVLE RALAGAGLTC TTFENGAEVL EALASKTPDV LLSDIRMPGM 
DGLALLKQIK QRHPMLPVII MTAHSDLDAA VSAYQQGAFD YLPKPFDIDE AVALVERAIS
HYQEQQQPRN IQLNGPTTDI IGEAPAMQDV FRIIGRLSRS SISVLINGES GTGKELVAHA
LHRHSPRAKA PFIALNMAAI PKDLIESELF GHEKGAFTGA NTIRQGRFEQ ADGGTLFLDE
IGDMPLDVQT RLLRVLADGQ FYRVGGYAPV KVDVRIIAAT HQNLEQRVQE GKFREDLFHR
LNVIRVHLPP LRERREDIPR LARHFLQVAA RELGVEAKLL HPETETALTR LAWPGNVRQL
ENTCRWLTVM AAGQEVLIQD LPGELFESTV AESTSQMQPD SWATLLAQWA DRALRSGHQN
LLSEAQPELE RTLLTTALRH TQGHKQEAAR LLGWGRNTLT RKLKELGME