Gene Acel_2051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_2051 
Symbol 
ID4484730 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp2324743 
End bp2326218 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content66% 
IMG OID639730847 
Productmethyl-accepting chemotaxis sensory transducer 
Protein accessionYP_873809 
Protein GI117929258 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAG CTGCGCAGAC CCGAGTCCGC GATTGGCTGG CGGCCGTGGT CTCGCCGCGG 
AGCCGGCGGG ACGCGCGCGC TCTCCAGGGT TACCTGCGAC GAATCGCCTC CGGTGACCTC
GTATCGTCGC CGCCCGTAAT GCGTACCCGG TGGGCGCACG AGATCTTGAC CGGTATCGAC
GAACTTCGTG GCCAGTGGTC TGCGCACCTC AACACGGACC GGCGGATCGT CCGCGGGCTG
GCCACCGAAT GGCGGCGGAT GAACGACGTG GCCTGGGAGA TGATGTCGGC GTCCGAAAAT
ACCGTCCGTG ACGTGGCGGA CGCCGCAGCG TCAGCAACCG AGCTCTCCGA ACGGATTACG
GTCATCGCAC ACGGAGTGGA AGAAACCGCG GCAACAATTC GGAGCATCGC CGACCACGCA
TCGCAAACGT CGACGGTCTC CTCGCACGGC GCCAAGAAGG CGGAAATCGC CCGAGAATGC
TTTGAAACCC TGCGTGCCGC CACGCAACGG GTGGAAGACA TTGTCAAACT GATCGTCCGC
ATCGCGTCCC AGACGCAATT GCTCGCCCTC AACGCGTCGA TTGAAGCGGC GCGAGCCGGT
GAGCTGGGGC GCGGCTTCGC GGTCGTCGCC GGCGAAGTGA AGCAGTTGGC CGAGGCAACC
GGGCGGGCCA CCGACGATGT GATCGCTACC ATGCGGGACA TCGAAACCGG CTCGGCAGCC
GCCGCGGACG CGGTCCGAGA CGTCACCGTC ACGATTGATC AGGTGGACGC CGGACAGGCG
GCCATCGCCG CGGCGGTCGT CCAGCAGACG GCGACAACAA GTTCGATCGG GGACAGCGCG
TCGGCCGCAG CCGAACGCGC GACCGTTCTC GCCGAGCACG TCAAGGCGCT CACGCACGCG
GTCCGGCTCT CCGCTTACGC CGGTGCTCAG GCCCGCACGG TCGCCGCGGA AGTGGCGCAC
GCCGAGCAGG CACTGAACGA CGCATTGGCG CGGTGGACCT TCGCCGAGAT GCCCGACGAT
GACGTCGGGG AACCCGCGGC CGGCTTGGAC CCGGATTCCG GTGTCACGAA AGAGAACGGC
GTCATCACCA TCCAGAATTA TGCGATCGGC GAAGGCTTGC ATCGCTTTTC ATACCGCGGG
CGATGGGGAC ACGCGACAGC GAATATCGAG GCGGAAGGCA CGAATTCCCA TTCGAGCATG
CCCGGCGACA CGGCGACCCT GCGCTTTTCG GGACGGCAAA TCCGGTTCTA CGGCGTCCTC
GCACCGAACC ACGGTCTGGC CAGCGTCCGC GTCGACGACC AGCCGGAGAC GATTATCGAT
CAGTACGCCG AGCAACGGGT GCACGGGGCC TTGCAGTGGG AGAGTCCGCT GCTGCCGCCG
GGTGAGCACA CGTTCACGCT CACGGTTCTC GGTGAGGCGA ATCCCAAGTC CCGCTACGTC
TGGGTCAACA TCGACCGGGT CGAGGTCGTC GAATAA
 
Protein sequence
MAEAAQTRVR DWLAAVVSPR SRRDARALQG YLRRIASGDL VSSPPVMRTR WAHEILTGID 
ELRGQWSAHL NTDRRIVRGL ATEWRRMNDV AWEMMSASEN TVRDVADAAA SATELSERIT
VIAHGVEETA ATIRSIADHA SQTSTVSSHG AKKAEIAREC FETLRAATQR VEDIVKLIVR
IASQTQLLAL NASIEAARAG ELGRGFAVVA GEVKQLAEAT GRATDDVIAT MRDIETGSAA
AADAVRDVTV TIDQVDAGQA AIAAAVVQQT ATTSSIGDSA SAAAERATVL AEHVKALTHA
VRLSAYAGAQ ARTVAAEVAH AEQALNDALA RWTFAEMPDD DVGEPAAGLD PDSGVTKENG
VITIQNYAIG EGLHRFSYRG RWGHATANIE AEGTNSHSSM PGDTATLRFS GRQIRFYGVL
APNHGLASVR VDDQPETIID QYAEQRVHGA LQWESPLLPP GEHTFTLTVL GEANPKSRYV
WVNIDRVEVV E