Gene EcolC_1431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1431 
Symbol 
ID6067643 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1574651 
End bp1576477 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content49% 
IMG OID641600850 
Productsensory histidine kinase AtoS 
Protein accessionYP_001724421 
Protein GI170019467 
COG category[T] Signal transduction mechanisms 
COG ID[COG3852] Signal transduction histidine kinase, nitrogen specific 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.250093 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00150562 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATTATA TGAAGTGGAT TTATCCACGC CGCTTACGCA ATCAAATGAT CCTGATGGCA 
ATCCTGATGG TCATTGTCCC AACGCTTACT ATTGGTTATA TCGTAGAAAC GGAAGGATGT
TCAGCAGTCT TATCTGAAAA AGAGAAAAAA CTTTCTGCCG TGGTCAACCT GCTTAATCAG
GCGCTAGGCA ATCGCTACGA TCTCTACATC GACTTACCGC GTGAGGAGCG TATCCGCGCA
TTAAATGCAG AACTTGCCCC CATTACCGAA AATATCACTC ACGCCTTCCC TGGCATCGGC
GCTGGTTATT ACAACAAAAC GCTGGATGCG ATAATCACCT ACGCGCCTTC AGCGCTATAT
CAGAATAATG TCGGCGTTAC CATTGCCGCA GATCACCCTG GTCGCGAAGT CATGCGTACA
AATACCCCTT TGGTTTATTC AGGCAGGCAG GTGCGCGGCG ATATTTTGAA TTCAATGATC
CCCATTGAGC GTAATGGTGA AATCCTCGGC TATATCTGGG CCAATGAATT AACCGAAGAT
ATTCGCCGCC AGGCCTGGAA AATGGATGTG AGGATTATCA TTGTGCTCAC CGCCGGTTTG
CTGATAAGCC TGCTGTTGAT TGTCCTTTTC TCCCGTCGCC TGAGCGCCAA TATTGATATC
ATCACCGATG GCCTCTCGAC TCTGGCACAA AATATTCCCA CTCGATTACC ACAATTGCCC
GGTGAAATGG GGCAAATCAG TCAGAGTGTT AATAACCTCG CCCAGGCACT GCGTGAAACG
CGGACACTTA ACGATCTGAT TATTGAAAAC GCTGCCGATG GCGTCATTGC CATTGACCGC
CAGGGTGATG TAACCACCAT GAACCCAGCA GCAGAAGTTA TCACTGGCTA TCAACGCCAT
GAACTGGTAG GGCAGCCTTA CTCCATGTTG TTCGACAATA CTCAGTTCTA CAGTCCAGTA
CTGGATACGC TGGAACATGG CACCGAACAT GTGGCGCTGG AGATCAGTTT TCCAGGTCGT
GACCGCACCA TTGAACTCAG TGTCACTACC AGTCGTATTC ATAACACGCA CGGTGAAATG
ATAGGTGCTT TGGTGATTTT CTCTGATTTA ACTGCCCGCA AAGAAACCCA GCGCCGCATG
GCGCAAGCAG AACGCCTCGC CACACTGGGT GAGCTGATGG CTGGCGTCGC GCATGAAGTA
CGTAATCCGT TAACGGCTAT TCGTGGTTAT GTACAGATCT TGCGCCAACA AACCAGTGAC
CCAATACATC AGGAATATCT GTCCGTAGTA CTCAAAGAAA TCGATTCAAT TAACAAAGTT
ATTCAGCAAT TGCTCGAATT TTCACGTCCA CGCCACAGTC AATGGCAACA AGTCAGCCTC
AATGCATTGG TTGAAGAAAC TCTGGTACTG GTACAAACCG CCGGCGTACA AGCGCGGGTC
GACTTCATAA GCGAACTGGA TAATGAATTA AGCCCGATTA ACGCCGATCG TGAACTGCTC
AAACAGGTAC TACTGAATAT CCTGATCAAT GCCGTCCAGG CTATCAGCGC ACGAGGGAAA
ATTCGCATTC AAACCTGGCA ATACAGCGAC TCACAACAGG CCATTTCGAT AGAGGACAAC
GGCTGTGGCA TTGATCTCTC GCTGCAAAAA AAGATCTTCG ATCCCTTTTT CACCACCAAA
GCCTCAGGAA CCGGGCTTGG TCTGGCGTTA AGTCAACGCA TCATTAATGC CCATCAGGGT
GATATTCGCG TCGCCAGTTT GCCGGGCTAC GGCGCAACCT TCACGCTTAT TTTACCGATC
AACCCGCAGG GAAATCAGAC TGTATGA
 
Protein sequence
MHYMKWIYPR RLRNQMILMA ILMVIVPTLT IGYIVETEGC SAVLSEKEKK LSAVVNLLNQ 
ALGNRYDLYI DLPREERIRA LNAELAPITE NITHAFPGIG AGYYNKTLDA IITYAPSALY
QNNVGVTIAA DHPGREVMRT NTPLVYSGRQ VRGDILNSMI PIERNGEILG YIWANELTED
IRRQAWKMDV RIIIVLTAGL LISLLLIVLF SRRLSANIDI ITDGLSTLAQ NIPTRLPQLP
GEMGQISQSV NNLAQALRET RTLNDLIIEN AADGVIAIDR QGDVTTMNPA AEVITGYQRH
ELVGQPYSML FDNTQFYSPV LDTLEHGTEH VALEISFPGR DRTIELSVTT SRIHNTHGEM
IGALVIFSDL TARKETQRRM AQAERLATLG ELMAGVAHEV RNPLTAIRGY VQILRQQTSD
PIHQEYLSVV LKEIDSINKV IQQLLEFSRP RHSQWQQVSL NALVEETLVL VQTAGVQARV
DFISELDNEL SPINADRELL KQVLLNILIN AVQAISARGK IRIQTWQYSD SQQAISIEDN
GCGIDLSLQK KIFDPFFTTK ASGTGLGLAL SQRIINAHQG DIRVASLPGY GATFTLILPI
NPQGNQTV