Gene EcolC_0981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0981 
Symbol 
ID6067810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1067646 
End bp1069724 
Gene Length2079 bp 
Protein Length692 aa 
Translation table11 
GC content53% 
IMG OID641600389 
ProductNifA subfamily transcriptional regulator 
Protein accessionYP_001723977 
Protein GI170019023 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG3604] Transcriptional regulator containing GAF, AAA-type ATPase, and DNA binding domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.427916 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATATA CACCGATGAG TGATCTCGGA CAACAAGGGT TGTTCGACAT CACTCGGACA 
CTATTGCAGC AGCCCGATCT GGCCTCGCTG TGTGAGGCTC TTTCGCAACT GGTAAAGCGT
TCTGCGCTCG CCGACAACGC GGCTATTGTG TTGTGGCAAG CGCAGACTCA ACGTGCGTCT
TATTACGCGT CGCGTGAAAA AGACACCCCC ATTAAATATG AAGACGAAAC TGTTCTGGCA
CACGGTCCGG TACGCAGCAT TTTGTCGCGC CCTGATACGC TGCATTGCAG TTACGAAGAA
TTTTGTGAAA CCTGGCCGCA GCTGGACGCA GGTGGGCTAT ACCCAAAATT TGGTCACTAT
TGCCTGATGC CACTGGCGGC GGAAGGGCAT ATTTTTGGTG GCTGTGAATT TATTCGTTAT
GACGATCGCC CCTGGAGCGA AAAAGAGTTC AATCGTCTGC AAACATTTAC GCAGATCGTT
TCTGTCGTCA CCGAACAAAT CCAGAGCCGC GTCGTTAACA ATGTCGACTA TGAGTTGTTA
TGCCGGGAAC GCGATAACTT CCGCATCCTG GTCGCCATCA CCAACGCGGT GCTTTCCCGC
CTGGATATGG ACGAACTGGT CAGCGAAGTC GCCAAAGAAA TCCATTACTA TTTCGACATT
GACGATATCA GTATCGTCTT ACGCAGCCAC CGTAAAAACA AACTCAACAT CTACTCCACT
CACTATCTTG ATAAACAGCA TCCCGCCCAC GAACAGAGCG AAGTCGATGA AGCCGGAACC
CTCACCGAAC GCGTGTTCAA AAGTAAAGAG ATGCTGCTGA TCAATCTCCA CGAGCGGGAC
GATTTAGCCC CCTATGAACG CATGTTGTTC GACACCTGGG GCAACCAGAT TCAAACCTTG
TGCCTGTTAC CGCTGATGTC TGGCGACACC ATGCTGGGCG TGCTGAAACT GGCGCAATGC
GAAGAGAAAG TGTTTACCAC TACCAATCTG AATTTACTGC GCCAGATTGC CGAACGTGTG
GCAATCGCTG TCGATAACGC CCTCGCCTAT CAGGAAATCC ATCGTCTGAA AGAACGGCTG
GTTGATGAAA ACCTCGCCCT GACCGAGCAG CTCAACAATG TTGATAGTGA ATTTGGCGAG
ATTATTGGCC GCAGCGAAGC CATGTACAGC GTGCTTAAAC AAGTTGAAAT GGTGGCGCAA
AGTGACAGTA CCGTGCTGAT CCTCGGTGAA ACTGGCACGG GTAAAGAGCT GATTGCCCGT
GCGATCCATA ATCTCAGTGG GCGTAATAAT CGCCGCATGG TCAAAATGAA CTGCGCGGCG
ATGCCTGCCG GATTGCTGGA AAGCGATCTG TTTGGTCATG AGCGTGGGGC TTTTACCGGT
GCCAGCGCCC AGCGTATCGG TCGTTTTGAA CTGGCGGATA AAAGCTCCCT GTTCCTCGAC
GAAGTGGGCG ATATGCCACT GGAGTTACAG CCGAAGTTGC TGCGTGTATT GCAGGAACAG
GAGTTTGAAC GTCTCGGCAG CAACAAAATC ATTCAGACGG ACGTGCGTCT AATCGCCGCG
ACTAACCGCG ATCTGAAAAA AATGGTCGCC GACCGTGAGT TCCGTAGCGA TCTCTATTAC
CGCCTGAACG TATTCCCGAT TCACCTGCCG CCACTACGCG AGCGTCCGGA AGATATTCCG
CTGCTGGCGA AAGCCTTTAC CTTCAAAATT GCCCGTCGTC TGGGGCGCAA TATCGACAGC
ATTCCTGCCG AGACGCTGCG CACCTTGAGC AACATGGAGT GGCCGGGTAA CGTACGCGAA
CTGGAAAACG TCATTGAGCG CGCGGTATTG CTAACACGCG GTAACGTGCT GCAGCTGTCA
TTGCCAGATA TTGTTTTACC GGAACCTGAA ACGCCGCCTG CCGCAACGGT TGTCGCCCTG
GAGGGCGAAG ATGAATATCA GTTGATTGTG CGCGTGCTGA AAGAAACCAA CGGCGTGGTT
GCCGGGCCTA AAGGCGCTGC GCAACGTCTG GGGCTGAAAC GCACGACCCT GCTGTCACGG
ATGAAGCGGC TGGGAATTGA TAAATCGGCA TTGATTTAA
 
Protein sequence
MSYTPMSDLG QQGLFDITRT LLQQPDLASL CEALSQLVKR SALADNAAIV LWQAQTQRAS 
YYASREKDTP IKYEDETVLA HGPVRSILSR PDTLHCSYEE FCETWPQLDA GGLYPKFGHY
CLMPLAAEGH IFGGCEFIRY DDRPWSEKEF NRLQTFTQIV SVVTEQIQSR VVNNVDYELL
CRERDNFRIL VAITNAVLSR LDMDELVSEV AKEIHYYFDI DDISIVLRSH RKNKLNIYST
HYLDKQHPAH EQSEVDEAGT LTERVFKSKE MLLINLHERD DLAPYERMLF DTWGNQIQTL
CLLPLMSGDT MLGVLKLAQC EEKVFTTTNL NLLRQIAERV AIAVDNALAY QEIHRLKERL
VDENLALTEQ LNNVDSEFGE IIGRSEAMYS VLKQVEMVAQ SDSTVLILGE TGTGKELIAR
AIHNLSGRNN RRMVKMNCAA MPAGLLESDL FGHERGAFTG ASAQRIGRFE LADKSSLFLD
EVGDMPLELQ PKLLRVLQEQ EFERLGSNKI IQTDVRLIAA TNRDLKKMVA DREFRSDLYY
RLNVFPIHLP PLRERPEDIP LLAKAFTFKI ARRLGRNIDS IPAETLRTLS NMEWPGNVRE
LENVIERAVL LTRGNVLQLS LPDIVLPEPE TPPAATVVAL EGEDEYQLIV RVLKETNGVV
AGPKGAAQRL GLKRTTLLSR MKRLGIDKSA LI