Gene B21_04139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_04139 
SymbolyjhT 
ID8114977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp4444252 
End bp4445358 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content44% 
IMG OID644850283 
Producthypothetical protein 
Protein accessionYP_003001856 
Protein GI251787552 
COG category[S] Function unknown 
COG ID[COG3055] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03547] mutatrotase, YjhT family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA CAATAACGGC GCTTGCTATC ATGATGGCTT CATTTGCCGC AAACGCGTCT 
GTATTACCGG AAACTCCTGT GCCATTTAAA AGTGGTACCG GAGCAATTGA TAACGACACT
GTCTACATTG GTTTAGGTAG CGCAGGTACG GCATGGTACA AGCTGGATAC ACAGGCCAAA
GATAAAAAAT GGACAGCGTT AGCTGCATTC CCTGGCGGAC CAAGAGATCA AGCAACCTCT
GCATTTATTG ATGGCAATCT GTATGTGTTT GGCGGCATTG GCAAAAACAG CGAGGGCTTG
ACTCAGGTAT TTAATGACGT ACACAAATAC AACCCCAAAA CCAATAGTTG GGTTAAATTG
ATGTCGCACG CGCCGATGGG CATGGCGGGC CATGTGACTT TTGTACACAA CGGCAAGGCT
TATGTTACTG GTGGTGTTAA CCAGAATATC TTCAATGGCT ATTTTGAAGA TCTCAACGAG
GCTGGAAAAG ATTCAACCGC TATAGATAAA ATCAATGCTC ACTATTTTGA CAAAAAAGCA
GAAGATTATT TCTTCAATAA GTTTCTGTTG TCTTTTGATC CCTCAACACA GCAATGGAGT
TACGCTGGCG AATCGCCCTG GTACGGAACG GCTGGTGCGG CGGTTGTGAA TAAAGGTGAT
AAAACCTGGC TTATTAATGG CGAAGCCAAA CCAGGATTGC GAACGGATGC CGTATTTGAA
CTTGATTTCA CCGGTAATAA TTTAAAATGG AATAAGCTTG CTCCCGTCTC ATCACCAGAT
GGCGTAGCTG GCGGTTTTGC GGGGATAAGC AATGATTCTC TTATATTTGC CGGAGGGGCC
GGATTCAAAG GTTCACGAGA AAATTACCAG AACGGTAAGA ACTATGCGCA TGAAGGCCTG
AAAAAATCAT ATAGCACTGA TATTCATCTT TGGCATAACG GGAAATGGGA TAAATCGGGT
GAATTATCGC AAGGTCGGGC CTACGGAGTA TCATTGCCCT GGAATAATAG TCTATTGATT
ATTGGCGGTG AAACTGCAGG CGGCAAAGCG GTGACGGATT CAGTTTTGAT CACTGTGAAG
GATAATAAAG TCACAGTACA AAACTAA
 
Protein sequence
MNKTITALAI MMASFAANAS VLPETPVPFK SGTGAIDNDT VYIGLGSAGT AWYKLDTQAK 
DKKWTALAAF PGGPRDQATS AFIDGNLYVF GGIGKNSEGL TQVFNDVHKY NPKTNSWVKL
MSHAPMGMAG HVTFVHNGKA YVTGGVNQNI FNGYFEDLNE AGKDSTAIDK INAHYFDKKA
EDYFFNKFLL SFDPSTQQWS YAGESPWYGT AGAAVVNKGD KTWLINGEAK PGLRTDAVFE
LDFTGNNLKW NKLAPVSSPD GVAGGFAGIS NDSLIFAGGA GFKGSRENYQ NGKNYAHEGL
KKSYSTDIHL WHNGKWDKSG ELSQGRAYGV SLPWNNSLLI IGGETAGGKA VTDSVLITVK
DNKVTVQN