Gene B21_03833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03833 
Symbolybl202 
ID8116698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4108060 
End bp4109325 
Gene Length1266 bp 
Protein Length422 aa 
Translation table11 
GC content55% 
IMG OID644849992 
Producthypothetical protein 
Protein accessionYP_003001565 
Protein GI251787261 
COG category[T] Signal transduction mechanisms 
COG ID[COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0772561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTA TGCAACGTTC TAAAGACTCC TTAGCTAAAT GGTTAAGCGC GATCCTCCCC 
GTGGTCATTG TTGGGCTGGT GGGGCTGTTT GCGGTGACGG TGATTCGCGA TTATGGGCGC
GAGACTGCCG CCGCCAGACA AACGCTGCTG GAAAAAGGCA GTGTACTTAT TCGCGCTCTT
GAATCCGGCT CGCGCGTCGG CATGGGGATG CGCATGCATC ATGCGCAGCA GCAGGCCTTA
CTGGAAGAAA TGGCCGGGCA GCCTGGAGTA CGTTGGTTTG CGGTCACGGA TGAACAAGGA
ACAATCGTGA TGCATAGCAA CTCCGGCATG GTGGGAAAAC AGCTTTATTC CCCGCAGGAA
ATGCAGCAGT TACATCCGGG AGATGAAGAA GCGTGGCGGC GGATCGATAG CGCAGACGGC
GAGCCTGTTC TGGAAATTTA TCGCCAGTTT CAACCGATGT TTGCTGCTGG AATGCACCGG
ATGCGCCATA TGCAGCAGTA TGCCGCGACA CCACAAGCAA TTTTCATCGC TTTCGACGCC
AGTAATATTG TGAGTGCCGA AGATCGTGAG CAGAGAAACA CCCTGATTAT CCTCTTCGCC
CTGGCGACGG TCTTGCTGGC AAGCGTGTTG TCATTCTTCT GGTATCGCCG CTATCTGCGC
TCGCGCCAGC TGTTGCAGGA TGAAATGAAG CGCAAAGAGA AGCTGGTGGC ACTGGGGCAT
CTTGCAGCAG GCGTTGCCCA CGAAATCCGT AATCCACTTT CCTCAATTAA AGGGCTGGCG
AAATACTTTG CCGAACGCGC GCCAGCAGGG GGAGAAGCGC ATCAATTGGC GCAGGTGATG
GCGAAAGAAG CCGACCGTTT AAACCGTGTG GTAAGCGAGT TGCTGGAACT GGTTAGGCCA
ACGCATCTGG CTTTGCAGGC GGTGGATCTC AACACGCTGA TTAACCACTC ATTACAGCTG
GTAAGCCAGG ATGCAAACAG CCGGGAGATC CAGTTACGCT TTACCGCCAA CGACACATTA
CCGGTAATTC AGGCCGACCC AGACAGGCTG ACTCAGGTCC TGTTGAATCT CTATCTCAAT
GCTATTCAGG CGATTGGTCA GCATGGTGTG ATTAGCGTGA CGGCCAGCGA AAGCGGCGCG
GGCGTGAAAA TCAGCGTTAC CGACAGCGGT AAGGGAATTG CGGCAGGTCA GCTTGAAGCC
ATCTTCACTC CGTACTTCAC CACCAAAGCC GAAGGCACCG GATTGGGGCT GGCGGTCGTG
CATAAT
 
Protein sequence
MRFMQRSKDS LAKWLSAILP VVIVGLVGLF AVTVIRDYGR ETAAARQTLL EKGSVLIRAL 
ESGSRVGMGM RMHHAQQQAL LEEMAGQPGV RWFAVTDEQG TIVMHSNSGM VGKQLYSPQE
MQQLHPGDEE AWRRIDSADG EPVLEIYRQF QPMFAAGMHR MRHMQQYAAT PQAIFIAFDA
SNIVSAEDRE QRNTLIILFA LATVLLASVL SFFWYRRYLR SRQLLQDEMK RKEKLVALGH
LAAGVAHEIR NPLSSIKGLA KYFAERAPAG GEAHQLAQVM AKEADRLNRV VSELLELVRP
THLALQAVDL NTLINHSLQL VSQDANSREI QLRFTANDTL PVIQADPDRL TQVLLNLYLN
AIQAIGQHGV ISVTASESGA GVKISVTDSG KGIAAGQLEA IFTPYFTTKA EGTGLGLAVV
HN