Gene EcolC_0628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_0628 
Symbol 
ID6065695 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp676833 
End bp678353 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID641600034 
Productmethyl-accepting chemotaxis sensory transducer with Pas/Pac sensor 
Protein accessionYP_001723631 
Protein GI170018677 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCTC ATCCGTATGT CACCCAGCAA AATACCCCGC TGGCGGACGA TACCACTCTG 
ATGTCCACTA CCGATCTGCA AAGCTATATC ACTCATGCTA ATGACACTTT TGTGCAGGTG
AGCGGCTTTA CCTTGCAAGA GTTACAAGGG CAGCCGCACA ACATGGTGCG TCACCCGGAT
ATGCCAAAAG CGGCGTTTGC GGATATGTGG TTCACCCTGA AAAAAGGGGA GCCCTGGAGC
GGCATCGTGA AAAATCGCCG CAAAAATGGC GACCATTATT GGGTGCGGGC CAATGCGGTA
CCGATGGTGC GCGAGGGAAA AATCAGTGGC TATATGTCGA TTCGTACCCG GGCGACGGAT
GAAGAGATCG CGGCGGTGGA GCCGCTGTAC AAAGCGTTGA ACTCCGGACG TACCAGTAAG
CGTATTCATA AAGGCCTGGT GGTGCGTAAA GGCTGGCTGG GTAAACTGCC TTCATTACCG
CTTCGCTGGC GGGCGCGTGG CGTGATGACC CTGATGTTTA TCTTGCTGGC GGCCATGCTT
TGGTTTGTTG CTGCCCCGGT GGTGACGTAT TTCCTCTGTG TGTTAGTGGT ATTGTTGGCA
AGCGCTTGTT TTGAATGGCA GATTGTGCGC CCGATAGAAA ATGTCGCCCG TCAGGCACTG
AAGGTGGCGA CCGGAGAGCG TAATAGTGTT GAGCACCTGA ATCGCAGCGA TGAGCTGGGG
CTGACATTAC GCGCGGTAGG GCAGCTTGGC CTGATGTGCC GTTGGCTAAT TAACGATGTC
TCAAGCCAGG TGTCCAGTGT CAGAAATGGC AGTGAGACGC TGGCGAAAGG CACCGATGAA
CTGAACGAAC ATACCCAGCA GACAGTTGAT AACGTTCAGC AAACGGTGGC GACCATGAAC
CAAATGGCGG CGTCGGTGAA ACAGAACTCT GCCACGGCGT CGGCTGCCGA TAAACTTTCT
ATCACCGCCA GTAATGCGGC AGTGCAGGGT GGGGAGGCGA TGACCACGGT GATCAAGACA
ATGGACGATA TCGCCGACAG TACCCAGCGC ATTGGCACCA TTACTTCGCT GATTAACGAT
ATTGCGTTTC AGACCAATAT TCTGGCCCTG AATGCGGCGG TGGAAGCGGC GCGTGCCGGC
GAACAGGGCA AAGGTTTTGC AGTGGTGGCA GGGGAAGTGC GTCATTTAGC CAGCCGCAGC
GCTAATGCTG CCAACGATAT TCGCAAGCTG ATTGATGCCA GTGCTGATAA GGTGCAATCC
GGTTCGCAGC AGGTACACGC CGCCGGACGG ACGATGGAAG ATATTGTGGC ACAGGTGAAA
AACGTCACCC AGTTGATCGC CCAGATTAGC CATTCAACGC TGGAACAGGC CGATGGGCTT
TCCAGCCTGA CCCGTGCAGT GGATGAGCTT AACCTGATCA CCCAGAAAAA TGCCGAGCTG
GTGGAAGAGA GTGCGCAGGT GTCGGCGATG GTGAAACACC GCGCCAGCCG ACTGGAAGAC
GCGGTGACGG TGCTGCATTA A
 
Protein sequence
MSSHPYVTQQ NTPLADDTTL MSTTDLQSYI THANDTFVQV SGFTLQELQG QPHNMVRHPD 
MPKAAFADMW FTLKKGEPWS GIVKNRRKNG DHYWVRANAV PMVREGKISG YMSIRTRATD
EEIAAVEPLY KALNSGRTSK RIHKGLVVRK GWLGKLPSLP LRWRARGVMT LMFILLAAML
WFVAAPVVTY FLCVLVVLLA SACFEWQIVR PIENVARQAL KVATGERNSV EHLNRSDELG
LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SETLAKGTDE LNEHTQQTVD NVQQTVATMN
QMAASVKQNS ATASAADKLS ITASNAAVQG GEAMTTVIKT MDDIADSTQR IGTITSLIND
IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASADKVQS
GSQQVHAAGR TMEDIVAQVK NVTQLIAQIS HSTLEQADGL SSLTRAVDEL NLITQKNAEL
VEESAQVSAM VKHRASRLED AVTVLH