Gene B21_03899 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03899 
SymbolactP 
ID8115752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4188754 
End bp4190403 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content55% 
IMG OID644850052 
Producthypothetical protein 
Protein accessionYP_003001625 
Protein GI251787321 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02711] cation/acetate symporter ActP 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.790578 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG TTCTGACGGC GCTTGCCGCC ACACTCCCTT TCGCAGCTAA CGCCGCGGAT 
GCTATTAGCG GGGCCGTAGA GCGCCAGCCA ACGAACTGGC AGGCGATTAT TATGTTCCTG
ATTTTCGTCG TGTTTACGCT CGGCATTACT TACTGGGCAT CAAAACGCGT ACGTTCTCGT
AGCGACTACT ACACGGCAGG CGGCAATATC ACAGGCTTCC AGAACGGGCT GGCGATTGCC
GGTGACTATA TGTCCGCCGC CTCATTTTTG GGGATCTCCG CACTGGTGTT TACCTCCGGC
TATGACGGCT TAATTTACTC GCTGGGCTTC CTGGTGGGCT GGCCGATCAT TTTGTTCCTG
ATTGCCGAAC GTCTGCGTAA CCTGGGACGC TACACCTTTG CCGATGTAGC CTCTTACCGC
CTGAAACAAG GGCCGATTCG TATTCTTTCG GCCTGTGGTT CTCTGGTGGT GGTGGCGCTT
TACCTTATCG CTCAGATGGT AGGCGCAGGT AAACTGATCG AGCTGCTGTT TGGCCTTAAC
TATCACATTG CGGTGGTGCT GGTCGGCGTG CTGATGATGA TGTATGTCCT GTTCGGCGGC
ATGCTGGCGA CCACCTGGGT GCAAATTATT AAAGCCGTGC TGTTACTGTT CGGTGCCAGC
TTTATGGCCT TTATGGTGAT GAAACACGTT GGCTTTAGCT TCAACAATCT GTTCAGCGAA
GCGATGGCGG TACACCCGAA AGGGGTCGAC ATCATGAAGC CGGGCGGGCT GGTGAAAGAT
CCGATCTCCG CGCTCTCTCT GGGTCTGGGA CTGATGTTTG GTACGGCGGG CTTGCCGCAC
ATTCTGATGC GCTTCTTTAC AGTCAGCGAT GCCCGCGAAG CACGTAAGAG CGTGTTCTAC
GCCACCGGGT TTATGGGCTA CTTCTATATT CTGACCTTTA TTATCGGCTT CGGCGCGATC
ATGCTGGTTG GTGCGAATCC GGAATATAAA GACGCGGCGG GCCATCTGAT TGGTGGTAAC
AACATGGCGG CCGTTCACCT GGCGAATGCA GTGGGCGGCA ACCTGTTCCT CGGTTTTATT
TCAGCGGTTG CTTTCGCCAC TATCCTCGCG GTGGTTGCGG GTCTGACGCT GGCGGGCGCA
TCCGCGGTTT CGCATGACTT GTACGCTAAC GTCTTCAAAA AAGGCGCGAC CGAACGTGAA
GAGCTGCGGG TATCAAAAAT CACCGTACTG ATCCTCGGCG TGATTGCGAT TATCCTCGGC
GTGCTGTTTG AGAATCATAA CATCGCCTTT ATGGTGGGGC TGGCGTTTGC CATCGCGGCG
AGCTGTAACT TCCCGATCAT TCTGCTTTCT ATGTACTGGT CGAAACTGAC CACGCGTGGC
GCGATGATGG GTGGCTGGCT GGGGCTGATT ACCGCAGTAG TACTGATGAT CCTCGGCCCG
ACGATTTGGG TACAGATCCT TGGTCACGAA AAAGCCATCT TCCCGTATGA ATACCCGGCG
CTGTTCTCTA TCACCGTGGC ATTCCTCGGC ATCTGGTTCT TCTCGGCAAC CGATAACTCA
GCGGAAGGGG CGCGCGAGCG TGAACTGTTC CGCGCGCAGT TTATCCGCTC CCAGACCGGC
TTTGGCGTTG AGCAAGGCCG CGCACATTAA
 
Protein sequence
MKRVLTALAA TLPFAANAAD AISGAVERQP TNWQAIIMFL IFVVFTLGIT YWASKRVRSR 
SDYYTAGGNI TGFQNGLAIA GDYMSAASFL GISALVFTSG YDGLIYSLGF LVGWPIILFL
IAERLRNLGR YTFADVASYR LKQGPIRILS ACGSLVVVAL YLIAQMVGAG KLIELLFGLN
YHIAVVLVGV LMMMYVLFGG MLATTWVQII KAVLLLFGAS FMAFMVMKHV GFSFNNLFSE
AMAVHPKGVD IMKPGGLVKD PISALSLGLG LMFGTAGLPH ILMRFFTVSD AREARKSVFY
ATGFMGYFYI LTFIIGFGAI MLVGANPEYK DAAGHLIGGN NMAAVHLANA VGGNLFLGFI
SAVAFATILA VVAGLTLAGA SAVSHDLYAN VFKKGATERE ELRVSKITVL ILGVIAIILG
VLFENHNIAF MVGLAFAIAA SCNFPIILLS MYWSKLTTRG AMMGGWLGLI TAVVLMILGP
TIWVQILGHE KAIFPYEYPA LFSITVAFLG IWFFSATDNS AEGARERELF RAQFIRSQTG
FGVEQGRAH