Gene EcolC_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1003 
Symbol 
ID6067653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1091773 
End bp1093287 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content57% 
IMG OID641600411 
Productanaerobic nitric oxide reductase transcription regulator 
Protein accessionYP_001723999 
Protein GI170019045 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00158351 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000580065 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTTTTT CCGTTGATGT GCTGGCGAAT ATCGCCATCG AATTGCAGCG TGGGATTGGT 
CATCAGGATC GTTTTCAGCG CCTGATCACC ACGCTACGTC AGGTGCTGGA GTGCGATGCG
TCTGCGTTGC TACGTTACGA TTCGCGGCAG TTTATTCCGC TTGCCATCGA CGGTCTGGCA
AAGGATGTAC TCGGTAGACG CTTTGCGCTG GAAGGGCATC CACGGCTGGA AGCGATTGCC
CGCGCCGGGG ATGTGGTGCG CTTTCCCGCA GACAGCGAAT TGCCCGATCC CTATGACGGT
TTGATTCCTG GGCAGGAGAG TCTGAAGGTT CACGCCTGCG TTGGTCTGCC ATTGTTTGCC
GGGCAAAACC TGATCGGCGC ACTGACGCTC GACGGGATGC AGCCCGATCA GTTCGATGTT
TTCAGCGACG AAGAGCTACG GCTGATTGCT GCGCTGGCGG CGGGAGCGTT AAGCAATGCG
TTGCTGATTG AACAACTGGA AAGCCAGAAT ATGATGCCAG GCGATGCCAC GCCGTTTGAA
GCGGTGAAAC AGACGCAGAT GATTGGCTTG TCCCCTGGCA TGACGCAACT GAAAAAAGAG
ATTGAGATTG TGGCGGCGTC CGATCTCAAC GTCCTGATCA GCGGTGAGAC GGGAACCGGT
AAGGAGCTGG TGGCGAAAGC GATTCATGAA GCCTCGCCAC GGGCGGTGAA TCCGCTGGTC
TATCTCAACT GTGCTGCACT GCCGGAAAGT GTGGCGGAAA GTGAGTTGTT CGGGCATGTG
AAAGGAGCGT TTACTGGCGC TATCAGTAAC CGCAGCGGGA AGTTTGAAAT GGCGGATAAC
GGCACTCTGT TTCTGGATGA GATCGGCGAG TTGTCGTTGG CATTGCAGGC CAAGCTGCTG
AGGGTGTTGC AGTATGGCGA TATTCAGCGC GTTGGCGATG ACCGTAGTTT GCGGGTCGAT
GTGCGCGTGC TGGCGGCGAC TAACCGCGAC TTACGCGAAG AGGTGCTGGC AGGGCGATTT
CGCGCTGACT TGTTTCATCG CCTGAGCGTG TTTCCACTTT CGGTGCCGCC GCTGCGTGAG
CGGGGCGATG ATGTCATTCT GCTGGCGGGG TATTTCTGCG AGCAGTGTCG TTTGCGGCTG
GGGCTCTCCC GCGTGGTATT AAGTGCCGGA GCGCGAAATT TACTGCAACA CTATCGTTTT
CCGGGGAACG TGCGCGAACT GGAACATGCT ATTCATCGGG CGGTAGTGCT GGCGAGAGCC
ACCCGCAACG GCGATGAAGT GATTCTTGAG GCGCAACATT TTGCTTTTCC TGAGGTGACG
TTGCCGCCGC CAGAAGCGGC GGCGGTGCCC GTTGTTAAGC AAAACCTGCG TGAAGCGACA
GAAGCGTTCC AGCGTGAAAC TATTCGCCAG GCACTGGCAC AAAATCATCA TAACTGGGCT
GCCTGCGCGC GGATGCTGGA AACCGACGTC GCCAACCTGC ATCGGCTGGC GAAACGTCTG
GGAATGAAGG ATTAA
 
Protein sequence
MSFSVDVLAN IAIELQRGIG HQDRFQRLIT TLRQVLECDA SALLRYDSRQ FIPLAIDGLA 
KDVLGRRFAL EGHPRLEAIA RAGDVVRFPA DSELPDPYDG LIPGQESLKV HACVGLPLFA
GQNLIGALTL DGMQPDQFDV FSDEELRLIA ALAAGALSNA LLIEQLESQN MMPGDATPFE
AVKQTQMIGL SPGMTQLKKE IEIVAASDLN VLISGETGTG KELVAKAIHE ASPRAVNPLV
YLNCAALPES VAESELFGHV KGAFTGAISN RSGKFEMADN GTLFLDEIGE LSLALQAKLL
RVLQYGDIQR VGDDRSLRVD VRVLAATNRD LREEVLAGRF RADLFHRLSV FPLSVPPLRE
RGDDVILLAG YFCEQCRLRL GLSRVVLSAG ARNLLQHYRF PGNVRELEHA IHRAVVLARA
TRNGDEVILE AQHFAFPEVT LPPPEAAAVP VVKQNLREAT EAFQRETIRQ ALAQNHHNWA
ACARMLETDV ANLHRLAKRL GMKD