Gene B21_02891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02891 
Symbolaer 
ID8116634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3082842 
End bp3084362 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content55% 
IMG OID644849079 
Producthypothetical protein 
Protein accessionYP_003000652 
Protein GI251786348 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTTCTC ATCCGTATGT CACCCAGCAA AATACCCCGC TGGCGGACGA TACCACTCTG 
ATGTCCACTA CCGATCTGCA AAGCTATATC ACTCATGCTA ATGACACTTT TGTGCAGGTG
AGCGGCTATA CCTTGCAAGA GTTACAAGGG CAGCCGCACA ACATGGTGCG TCACCCGGAT
ATGCCAAAAG CGGCGTTTGC GGATATGTGG TTCACCCTGA AAAAAGGGGA GCCCTGGAGC
GGCATCGTGA AAAATCGCCG CAAAAATGGT GACCATTATT GGGTGCGGGC CAATGCGGTA
CCGATGGTGC GCGAGGGAAA AATCAGTGGC TATATGTCGA TTCGTACCCG GGCGACGGAT
GAAGAGATCG CGGCGGTGGA GCCGCTGTAC AAAGCGTTGA ACGCCGGACG TACCAGTAAG
CGTATTCATA AAGGCCTGGT GGTGCGTAAA GGCTGGCTGG GTAAACTGCC TTCATTACCG
CTTCGCTGGC GGGCGCGTGG AGTGATGACC CTGATGTTTA TCTTGCTGGC GGCCATGCTT
TGGTTTGTTG CTGCCCCGGT GGTGACGTAT ATCCTCTGTG CGTTAGTGGT ATTGTTGGCA
AGCGCCTGTT TTGAATGGCA GATTGTGCGC CCGATAGAAA ATGTTGCCCA TCAGGCACTG
AAGGTGGCGA CCGGAGAACG TAATAGTGTT GAGCATCTGA ATCGCAGCGA TGAGCTGGGG
CTGACATTAC GTGCGGTAGG GCAACTTGGC CTGATGTGCC GTTGGCTAAT TAACGATGTC
TCAAGCCAGG TGTCCAGTGT CAGAAATGGC AGTGAGACGC TGGCGAAAGG CACCGATGAA
CTGAACGAAC ATACCCAGCA GACAGTTGAT AACGTTCAGC AAACGGTGGC GACCATGAAC
CAAATGGCGG CGTCGGTGAA ACAGAACTCT GCCACGGCGT CGGCTGCCGA TAAACTGTCA
ATCACTGCCA GTAATGCGGC AGTGCAGGGT GGGGAGGCGA TGACCACGGT GATCAAGACA
ATGGACGATA TCGCCGACAG TACCCAGCGC ATTGGCACCA TTACTTCGCT GATTAACGAT
ATTGCGTTTC AGACCAATAT TCTGGCCCTG AATGCGGCGG TGGAAGCGGC GCGTGCCGGC
GAACAGGGCA AAGGTTTTGC AGTGGTGGCA GGGGAAGTGC GTCATTTAGC CAGCCGCAGC
GCTAATGCTG CCAACGATAT TCGCAAGCTG ATTGATGCCA GTGCTGATAA GGTGCAATCC
GGTTCGCAGC AGGTACACGC CGCCGGACGG ACGATGGAAG ATATTGTGGC ACAGGTGAAA
AACGTCACCC AGTTGATCGC CCAGATTAGC CATTCAACGC TGGAACAGGC CGATGGGCTT
TCCAGCCTGA CCCGTGCAGT GGATGAGCTT AACCTGATCA CCCAGAAAAA TGCCGAGCTG
GTGGAAGAGA GTGCGCAGGT GTCGGCGATG GTGAAACACC GCGCCAGCCG ACTGGAAGAC
GCGGTGACGG TACTGCATTA A
 
Protein sequence
MSSHPYVTQQ NTPLADDTTL MSTTDLQSYI THANDTFVQV SGYTLQELQG QPHNMVRHPD 
MPKAAFADMW FTLKKGEPWS GIVKNRRKNG DHYWVRANAV PMVREGKISG YMSIRTRATD
EEIAAVEPLY KALNAGRTSK RIHKGLVVRK GWLGKLPSLP LRWRARGVMT LMFILLAAML
WFVAAPVVTY ILCALVVLLA SACFEWQIVR PIENVAHQAL KVATGERNSV EHLNRSDELG
LTLRAVGQLG LMCRWLINDV SSQVSSVRNG SETLAKGTDE LNEHTQQTVD NVQQTVATMN
QMAASVKQNS ATASAADKLS ITASNAAVQG GEAMTTVIKT MDDIADSTQR IGTITSLIND
IAFQTNILAL NAAVEAARAG EQGKGFAVVA GEVRHLASRS ANAANDIRKL IDASADKVQS
GSQQVHAAGR TMEDIVAQVK NVTQLIAQIS HSTLEQADGL SSLTRAVDEL NLITQKNAEL
VEESAQVSAM VKHRASRLED AVTVLH