Gene EcSMS35_2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2831 
SymbolnorR 
ID6144446 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2906002 
End bp2907516 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content57% 
IMG OID641617701 
Productanaerobic nitric oxide reductase transcription regulator 
Protein accessionYP_001744856 
Protein GI170679621 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.00139306 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTT CCGTTGATGT GCTGGCGAAT ATCGCCATCG AATTGCAGCG TGGGATTGGT 
CATCAGGATC GTTTTCAGCG CCTGATCACC ACGCTGCGTC AGGTGCTGGA GTGTGATGCC
TCTGCGTTGC TACGTTATGA CTCGCGGCAG TTTATTCCGC TTGCCATCGA CGGGCTGGCG
AAGGATGTAC TCGGTAGACG CTTTGCGCTG GAAGGGCATC CACGGCTGGA AGCGATTGCC
CGCGCCGGGG ACGTGGTGCG CTTTCCCGCA GACAGCGAAT TGCCCGATCC CTATGACGGT
TTGATTCCCG GGCAGGAGAG TCTGAAGGTT CACGCCTGCG TTGGTCTGCC ATTGTTTGCC
GGGCAAAACC TGATCGGCGC ATTGACGCTC GACGGGATGC AGCCCGATCA GTTCGATGTT
TTCAGCGACG AAGAGTTACG CCTGATTGCC GCGCTGGCGG CGGGAGCGTT AAGCAATGCG
TTGCTGATTG AGCAACTGGA AAGCCAGAAT ATGCTGCCGG GCGATGCCGC GCCGTTTGAA
GCGGTGAAAC AGACGCAGAT GATCGGTCTG TCGCCAGGCA TGACGCAATT GAAAAAAGAG
ATTGAGATTG TGGCGGCGTC CGATCTCAAC GTCTTAATCA GCGGTGAGAC GGGAACCGGT
AAGGAGCTGG TGGCGAAAGC GATTCATGAA GCCTCGCCAA GGGCGGTGAA TCCGCTGGTC
TATCTCAACT GTGCCGCACT GCCGGAAAGT GTGGCGGAAA GTGAGTTGTT CGGGCACGTG
AAAGGGGCGT TTACTGGCGC TATCAGTAAC CGCAGCGGGA AGTTTGAAAT GGCGGATAAC
GGCACGCTGT TTCTGGATGA GATCGGCGAG TTGTCGTTGG CATTGCAGGC CAAGCTGCTG
CGGGTGTTGC AGTATGGCGA TATTCAGCGC GTTGGCGATG ACCGCAGTTT GCGGGTCGAT
GTGCGCGTGC TGGCGGCGAC TAACCGTGAC TTACGCGAAG AGGTACTGGC AGGGCGATTT
CGCGCTGACT TGTTCCACCG CCTGAGCGTG TTTCCACTTT CAGTGCCGCC GCTGCGTGAG
CGGGGCGATG ATGTCATTCT GCTGGCGGGG TATTTCTGCG AACAGTGCCG CCTGCGGTTG
GGGCTCTCCC GCGTGGTATT AAGTGCCAGA GCGCGAAATT TACTGCAACA CTACAATTTT
CCGGGGAACG TTCGTGAACT GGAACATGCT ATTCATCGGG CGGTAGTGCT GGCGAGAGCC
ACTCGCAGCG GCGATGAAGT GATTCTTGAA GCGCAACATT TTGCTTTTCC TGAGGTGACG
TTGCCGCCGC CAGAAGCGGC GGCGGTGCCT ATCGTTAAGC AAAACCTGCG TGAAGCGACA
GAAGCGTTCC AGCGTGAAAC CATTCGTCAG GCACTGGCAC AAAATCATCA CAACTGGGCT
GCCAGCGCAC GGATGCTGGA AACCGACGTC GCCAACTTGC ATCGGTTGGC GAAACGTCTG
GGGCTGAAGG ATTAA
 
Protein sequence
MSFSVDVLAN IAIELQRGIG HQDRFQRLIT TLRQVLECDA SALLRYDSRQ FIPLAIDGLA 
KDVLGRRFAL EGHPRLEAIA RAGDVVRFPA DSELPDPYDG LIPGQESLKV HACVGLPLFA
GQNLIGALTL DGMQPDQFDV FSDEELRLIA ALAAGALSNA LLIEQLESQN MLPGDAAPFE
AVKQTQMIGL SPGMTQLKKE IEIVAASDLN VLISGETGTG KELVAKAIHE ASPRAVNPLV
YLNCAALPES VAESELFGHV KGAFTGAISN RSGKFEMADN GTLFLDEIGE LSLALQAKLL
RVLQYGDIQR VGDDRSLRVD VRVLAATNRD LREEVLAGRF RADLFHRLSV FPLSVPPLRE
RGDDVILLAG YFCEQCRLRL GLSRVVLSAR ARNLLQHYNF PGNVRELEHA IHRAVVLARA
TRSGDEVILE AQHFAFPEVT LPPPEAAAVP IVKQNLREAT EAFQRETIRQ ALAQNHHNWA
ASARMLETDV ANLHRLAKRL GLKD