Gene B21_01397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01397 
SymboltehA 
ID8114861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1454044 
End bp1455036 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content53% 
IMG OID644847640 
Producthypothetical protein 
Protein accessionYP_002999213 
Protein GI251784909 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1275] Tellurite resistance protein and related permeases 
TIGRFAM ID[TIGR00816] C4-dicarboxylate transporter/malic acid transport protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGAGCG ATAAAGTGCT CAATTTGCCG GCAGGCTACT TTGGTATTGT GTTGGGGACG 
ATAGGGATGG GATTTGCCTG GCGCTATGCC AGCCAGGTTT GGCAGGTCAG CCACTGGTTA
GGGGATGGGC TGGTGATTCT GGCGATGATC ATCTGGGGAT TATTGACTAG CGCATTTATT
GCCCGACTCA TACGCTTTCC GCATAGCGTG CTGGCGGAAG TTCGCCATCC AGTGCTGAGC
AGTTTTGTGA GTTTGTTTCC GGCAACGACG ATGCTGGTGG CGATTGGTTT TGTTCCGTGG
TTTCGCCCAC TGGCGGTGTG CCTGTTCAGC TTTGGTGTCG TGGTTCAGTT GGCTTATGCC
GCCTGGCAAA CTGCGGGATT ATGGCGCGGA TCTCACCCTG AAGAAGCTAC CACGCCTGGA
CTGTATCTGC CGACAGTTGC CAACAACTTT ATCAGCGCAA TGGCCTGTGG TGCGTTGGGC
TACACCGACG CCGGTCTGGT GTTTTTAGGC GCAGGCGTTT TCTCATGGCT AAGCCTTGAA
CCGGTGATCT TGCAGCGTCT GCGTAGTTCG GGAGAATTAC CCACGGCACT GCGGACATCA
CTCGGCATTC AGCTCGCTCC TGCGCTGGTG GCCTGTAGTG CCTGGCTGAG CGTCAACGGC
GGCGAGGGTG ACACGCTGGC GAAAATGCTT TTCGGTTATG GACTGCTGCA ACTGCTGTTT
ATGCTACGTC TGATGCCATG GTATCTCTCC CAGCCATTTA ATGCTTCATT CTGGAGTTTC
TCGTTCGGCG TATCTGCACT GGCAACCACC GGTTTGCATC TGGGGAGTGG CAGCGATAAT
GGATTTTTCC ATACGCTGGC GGTGCCGCTG TTTATCTTTA CCAATTTTAT TATTGCAATA
CTGCTCATCC GTACTTTTGC GCTTCTGATG CAGGGAAAAT TGTTAGTCAG AACCGAGCGC
GCCGTTTTAA TGAAAGCAGA GGACAAAGAA TGA
 
Protein sequence
MQSDKVLNLP AGYFGIVLGT IGMGFAWRYA SQVWQVSHWL GDGLVILAMI IWGLLTSAFI 
ARLIRFPHSV LAEVRHPVLS SFVSLFPATT MLVAIGFVPW FRPLAVCLFS FGVVVQLAYA
AWQTAGLWRG SHPEEATTPG LYLPTVANNF ISAMACGALG YTDAGLVFLG AGVFSWLSLE
PVILQRLRSS GELPTALRTS LGIQLAPALV ACSAWLSVNG GEGDTLAKML FGYGLLQLLF
MLRLMPWYLS QPFNASFWSF SFGVSALATT GLHLGSGSDN GFFHTLAVPL FIFTNFIIAI
LLIRTFALLM QGKLLVRTER AVLMKAEDKE