Gene B21_04061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04061 
SymbolytfR 
ID8116683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4360764 
End bp4362266 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content55% 
IMG OID644850209 
Producthypothetical protein 
Protein accessionYP_003001782 
Protein GI251787478 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACTA ACCAACACCA GGAGATCCTC CGCACCGAAG GATTAAGTAA ATTTTTCCCC 
GGCGTCAAAG CGTTAGACAA CGTTGATTTC AGCCTGCGCC GCGGCGAAAT CATGGCGCTG
CTCGGTGAAA ACGGGGCAGG AAAATCAACG CTAATCAAAG CATTAACTGG TGTATACCAC
GCCGATCGCG GCACCATCTG GCTGGAAGGC CAGGCTATCT CACCGAAAAA TACCGCCCAT
GCACAACAAC TCGGTATCGG CACCGTCTAT CAGGAAGTAA ACCTGCTACC CAATATGTCG
GTCGCTGATA ATCTATTTAT AGGCCGCGAA CCCAAACGCT TCGGCCTTCT ACGCCGCAAA
GAGATGGAAA AGCGCGCCAC CGAACTGATG GCATCTTACG GTTTCTCCCT CGACGTGCGC
GAACCGCTCA ACCGCTTTTC AGTCGCGATG CAGCAAATCG TCGCTATTTG CCGGGCTATC
GATCTCTCTG CCAAAGTGCT GATCCTCGAT GAACCCACCG CCAGTCTCGA CACCCAGGAA
GTGGAGTTAC TGTTTGACCT GATGCGTCAG TTGCGCGATC GCGGCGTCAG CCTGATTTTT
GTCACTCACT TTCTCGATCA GGTCTATCAG GTCAGCGATC GGATCACCGT CTTACGCAAC
GGCAGTTTCG TAGGCTGTCG GGAAACGTGC GAGCTACCGC AGATCGAACT GGTAAAAATG
ATGCTGGGGC GCGAGCTGGA TACCCACGCG CTACAGCGTG CCGGGCGAAC ATTGTTGAGC
GACAAACCCG TTGCCGCGTT CAAAAATTAC GGCAAAAAAG GAACGATCGC ACCGTTTGAT
CTCGAAGTAC GCCCCGGCGA GATCGTCGGT CTGGCTGGAT TGCTGGGATC AGGACGTACC
GAAACCGCCG AAGTGATCTT CGGTATCAAA CCTGCTGACA GCGGCACGGC GTTGATCAAA
GGCAAACCGC AAAACCTGCG ATCGCCACAT CAGGCTTCGG TACAGGGCAT TGGCTTCTGC
CCGGAAGACA GGAAAACCGA TGGCATCATC GCTGCCGCCT CGGTGCGGGA AAATATCATC
CTCGCTCTCC AGGCCCAGCG CGGCTGGCTA CGTCCCATTT CCCGCAAAGA ACAGCAAGAG
ATTGCCGAAC GCTTTATCCG CCAGCTTGGC ATTCGCACAC CTTCAACTGA ACAACCGATT
GAATTTCTCT CTGGCGGCAA TCAGCAAAAA GTGTTGCTTT CACGTTGGCT ACTGACCCGA
CCGCAATTTC TGATCCTTGA TGAGCCAACT CGCGGCATTG ATGTTGGTGC CCACGCCGAG
ATCATCCGCC TGATTGAAAC GCTATGCGCC GATGGTCTGG CGCTGCTGGT GATCTCCTCC
GAACTGGAAG AACTGGTGGG CTATGCCGAC CGGGTGATCA TCATGCGCGA TCGCAAACAG
GTGGCGGAGA TCCCGCTGGC AGAGCTTTCC GTTCCGGCGA TCATGAACGC CATTGCGGCG
TAA
 
Protein sequence
MTTNQHQEIL RTEGLSKFFP GVKALDNVDF SLRRGEIMAL LGENGAGKST LIKALTGVYH 
ADRGTIWLEG QAISPKNTAH AQQLGIGTVY QEVNLLPNMS VADNLFIGRE PKRFGLLRRK
EMEKRATELM ASYGFSLDVR EPLNRFSVAM QQIVAICRAI DLSAKVLILD EPTASLDTQE
VELLFDLMRQ LRDRGVSLIF VTHFLDQVYQ VSDRITVLRN GSFVGCRETC ELPQIELVKM
MLGRELDTHA LQRAGRTLLS DKPVAAFKNY GKKGTIAPFD LEVRPGEIVG LAGLLGSGRT
ETAEVIFGIK PADSGTALIK GKPQNLRSPH QASVQGIGFC PEDRKTDGII AAASVRENII
LALQAQRGWL RPISRKEQQE IAERFIRQLG IRTPSTEQPI EFLSGGNQQK VLLSRWLLTR
PQFLILDEPT RGIDVGAHAE IIRLIETLCA DGLALLVISS ELEELVGYAD RVIIMRDRKQ
VAEIPLAELS VPAIMNAIAA