Gene EcolC_1185 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1185 
Symbol 
ID6066794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1298836 
End bp1300848 
Gene Length2013 bp 
Protein Length670 aa 
Translation table11 
GC content53% 
IMG OID641600601 
ProductNifA subfamily transcriptional regulator 
Protein accessionYP_001724179 
Protein GI170019225 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.132433 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGT CAGACGAGGC GATGTTTGCC CCGCCGCAAG GAATAACAAT TGAAGCGGTA 
AACGGAATGC TCGCGGAGCG GTTAGCGCAG AAACACGGTA AGGCGTCTTT ATTACGCGCC
TTCATCCCGC TGCCGCCGCC GTTCAGCCCG GTACAACTTA TTGAACTGCA TGTTCTCAAA
AGCAACTTCT ATTACCGCTA CCATGATGAT GGCAGCGATG TGACGGCAAC AACAGAGTAT
CAGGGCGAGA TGGTCGATTA TTCGCGTCAC GCCGTCCTTC TCGGCAGTAG TGGAATGGCG
GAGCTACGCT TTATTCGCAC CCACGGCAGT CGTTTTACTC CCCAGGATTG CACACTGTTT
AACTGGCTGG CGCGGATAAT CACCCCGGTT CTACAATCAT GGCTCAATGA TGAAGAACAG
CAGGTGGCGC TGCGTTTGCT GGAGAAAGAT CGCGATCATC ATCGGGTACT GGTTGATATT
ACTAATGCAG TGCTGTCACA TCTTGATCTC GACGATCTGA TCGCTGACGT CGCTCGTGAG
ATCCATCATT TTTTCGGTCT GGCTTCAGTC AGTATGGTAC TGGGCGATCA TCGAAAGAAC
GAGAAGTTTA GCCTGTGGTG CAGCGATCTT TCTGCCTCAC ATTGTGCGTG TCTGCCACGC
AATATGCCTG GCGACAGTGT ATTGCTGACA CAAACGCTAC AAACCCGACA ACCGACCTTG
ACGCACCGTG CAGACGATCT GTTTCTCTGG CAACGCGACC CGTTATTACT CTTACTTGCA
TCTAACGGCT GCGAATCTGC GCTCCTTATA CCGCTTACCT TTGGCAACCA TACACCGGGT
GCATTGTTGC TGGCGCATAC CTCTTCCACT CTCTTTAGTG AGGAAAACTG CCAGCTACTA
CAACACATAG CCGATCGCAT CGCTATTGCC GTTGGCAATG CCGATGCCTG GCGTAGCATG
ACCGATTTGC AGGAAAGTTT GCAGCAAGAA AACCACCAGC TTAGCGAGCA GCTCCTTTCG
AATCTGGGCA TCGGTGACAT TATCTATCAA AGCCAGGCAA TGGAAGACCT ACTCCAGCAG
GTAGATATTG TGGCGAAGAG CGACAGTACG GTGTTGATTT GCGGTGAAAC CGGAACCGGC
AAAGAGGTGA TCGCCAGAGC GATCCATCAA CTTAGCCCGC GACGCGACAA GCCGCTGGTC
AAAATCAACT GCGCTGCCAT CCCCGCCAGT CTTCTGGAAA GTGAGTTATT CGGTCATGAC
AAAGGGGCGT TTACTGGTGC GATTAATACC CATCGTGGTC GTTTTGAAAT TGCCGATGGC
GGCACGTTGT TTCTCGATGA AATTGGCGAT CTGCCGTTAG AACTTCAGCC TAAACTGCTG
CGCGTATTGC AGGAACGGGA GATTGAGCGT CTCGGCGGGA GTAGAACGAT CCCGGTAAAT
GTCAGAGTCA TTGCCGCCAC CAACCGTGAT TTGTGGCAAA TGGTTGAAGA TCGCCAGTTT
CGCAGCGATC TCTTTTATCA CCTGAATGTC TTCCCACTGG AATTGCCGCC GCTGCGCGAC
CGTCCGGAAG ATATCCCTCT TTTAGCAAAG CATTTCACGC AAAAAATGGC GCGCCATATG
AATCGCGCAA TTGACGCCAT CCCGACCGAG GCACTACGCC AGTTGATGTC GTGGGATTGG
CCGGGCAACG TGCGCGAGCT GGAAAACGTG ATTGAGCGGG CGGTACTGTT GACTCGTGGT
AACAGTCTGA ATTTACATCT AAATGTCCGA CAAAGCCGTT TACTGCCGAC GCTAAATGAA
GATTCAGCGC TTCGCAGTTC AATGGCGCAG TTGCTGCACC CGACGACGCC AGAGAATGAC
GAAGAAGAAC GTCAGCGCAT TGTTCAGGTA TTGCGAGAAA CCAATGGCAT TGTTGCCGGG
CCCCGTGGCG CGGCGACACG ATTAGGGATG AAGCGCACCA CGCTGCTGTC ACGAATGCAG
CGTCTGGGGA TCTCGGTTCG CGAGGTGTTG TAA
 
Protein sequence
MAMSDEAMFA PPQGITIEAV NGMLAERLAQ KHGKASLLRA FIPLPPPFSP VQLIELHVLK 
SNFYYRYHDD GSDVTATTEY QGEMVDYSRH AVLLGSSGMA ELRFIRTHGS RFTPQDCTLF
NWLARIITPV LQSWLNDEEQ QVALRLLEKD RDHHRVLVDI TNAVLSHLDL DDLIADVARE
IHHFFGLASV SMVLGDHRKN EKFSLWCSDL SASHCACLPR NMPGDSVLLT QTLQTRQPTL
THRADDLFLW QRDPLLLLLA SNGCESALLI PLTFGNHTPG ALLLAHTSST LFSEENCQLL
QHIADRIAIA VGNADAWRSM TDLQESLQQE NHQLSEQLLS NLGIGDIIYQ SQAMEDLLQQ
VDIVAKSDST VLICGETGTG KEVIARAIHQ LSPRRDKPLV KINCAAIPAS LLESELFGHD
KGAFTGAINT HRGRFEIADG GTLFLDEIGD LPLELQPKLL RVLQEREIER LGGSRTIPVN
VRVIAATNRD LWQMVEDRQF RSDLFYHLNV FPLELPPLRD RPEDIPLLAK HFTQKMARHM
NRAIDAIPTE ALRQLMSWDW PGNVRELENV IERAVLLTRG NSLNLHLNVR QSRLLPTLNE
DSALRSSMAQ LLHPTTPEND EEERQRIVQV LRETNGIVAG PRGAATRLGM KRTTLLSRMQ
RLGISVREVL