Gene B21_02077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02077 
SymbolyejM 
ID8114709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2178535 
End bp2180295 
Gene Length1761 bp 
Protein Length586 aa 
Translation table11 
GC content52% 
IMG OID644848285 
Producthypothetical protein 
Protein accessionYP_002999858 
Protein GI251785554 
COG category[R] General function prediction only 
COG ID[COG3083] Predicted hydrolase of alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0973017 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAACTC ATCGTCAGCG CTACCGTGAA AAAGTCTCCC AGATGGTCAG TTGGGGGCAC 
TGGTTTGCAC TGTTCAATAT TCTGCTTTCG CTCGTCATTG GCAGCCGTTA CCTGTTTATC
GCCGACTGGC CGACAACGCT TGCTGGTCGC ATTTATTCCT ACGTAAGCAT TATCGGCCAT
TTCAGCTTCC TGGTGTTCGC CACCTACTTG CTGATCCTCT TCCCGCTGAC CTTTATCGTC
GGCTCCCAGA GGCTGATGAG GTTTTTGTCC GTCATTCTGG CAACGGCGGG AATGACGCTA
TTACTGATCG ATAGCGAAGT CTTTACTCGT TTCCATCTCC ATCTTAATCC CATCGTCTGG
CAACTGGTTA TCAACCCAGA CGAAAATGAG ATGGCGCGCG ACTGGCAGCT GATGTTCATC
AGCGTGCCGG TTATTTTATT GCTTGAACTG GTGTTTGCGA CGTGGAGCTG GCAAAAGCTG
CGCAGCCTGA CGCGTCGTCG ACGCTTCGCG CGCCCGCTGG CCGCATTCTT ATTTATCGCC
TTTATCGCCT CGCATGTGGT GTATATCTGG GCCGATGCCA ACTTCTATCG CCCGATCACC
ATGCAGCGCG CTAACCTGCC GCTTTCGTAC CCGATGACGG CGCGACGTTT TCTTGAGAAG
CATGGTCTGC TTGATGCGCA GGAGTATCAA CGCCGTCTTA TTGAGCAAGG TAATCCAGAC
GCCGTTTCCG TTCAGTATCC GTTAAGCGAA CTGCGCTATC GCGATATGGG CACCGGGCAG
AATGTGCTGT TGATTACTGT CGATGGCCTG AACTACTCAC GCTTCGAGAA GCAGATGCCT
GCGCTGGCAG GTTTTGCTGA GCAAAATATT TCGTTCACGC GCCATATGAG CTCCGGCAAC
ACTACAGACA ACGGCATCTT TGGCCTGTTC TATGGCATCT CGCCGAGCTA TATGGACGGC
ATTCTGTCGA CCCGTACGCC TGCGGCATTA ATTACTGCGC TTAATCAGCA AGGCTATCAG
CTGGGGTTAT TCTCATCAGA TGGCTTTACC AGCCCGCTGT ATCGCCAGGC ATTGTTGTCA
GATTTCTCGA TGCCGAGCGT ACGCACCCAA TCCGACGAGC AGACCGCCAC GCAGTGGATC
AACTGGCTGG GACGCTACGC ACAAGAAGAT AACCGCTGGT TCTCGTGGGT TTCTTTCAAT
GGTACTAACA TTGACGACAG CAATCAGCAG GCATTTGCAC GGAAATATAG CCGGGCGGCA
GGCAATGTCG ATGACCAGAT CAACCGCGTG CTCAATGCAC TGCGTGATTC TGGCAAACTG
GACAATACGG TAGTGATTAT CACTGCCGGT CGGGGTATTC CACTGAGCGA AGAGGAAGAA
ACCTTTGACT GGTCCCACGG TCATCTGCAG GTACCATTAG TGATTCACTG GCCAGGCACG
CCGGCGCAGC GTATTAATGC TCTGACGGAT CATACCGATC TGATGACGAC ACTGATGCAA
CGCCTGCTAC ATGTCAGCAC ACCTGCCAGC GAATATTCGC AAGGTCAGGA TTTGTTCAAC
CCTCAACGCC GTCATTACTG GGTTACCGCA GCGGATAACG ATACGCTGGC AATTACCACC
CCGAAAAAGA CGCTGGTGCT GAACAATAAC GGTAAATACC GCACTTACAA CTTACGTGGT
GAAAGAGTGA AAGATGAAAA ACCACAGTTA AGTTTGTTAT TGCAAGTGCT GACAGACGAG
AAGCGTTTTA TCGCTAACTG A
 
Protein sequence
MVTHRQRYRE KVSQMVSWGH WFALFNILLS LVIGSRYLFI ADWPTTLAGR IYSYVSIIGH 
FSFLVFATYL LILFPLTFIV GSQRLMRFLS VILATAGMTL LLIDSEVFTR FHLHLNPIVW
QLVINPDENE MARDWQLMFI SVPVILLLEL VFATWSWQKL RSLTRRRRFA RPLAAFLFIA
FIASHVVYIW ADANFYRPIT MQRANLPLSY PMTARRFLEK HGLLDAQEYQ RRLIEQGNPD
AVSVQYPLSE LRYRDMGTGQ NVLLITVDGL NYSRFEKQMP ALAGFAEQNI SFTRHMSSGN
TTDNGIFGLF YGISPSYMDG ILSTRTPAAL ITALNQQGYQ LGLFSSDGFT SPLYRQALLS
DFSMPSVRTQ SDEQTATQWI NWLGRYAQED NRWFSWVSFN GTNIDDSNQQ AFARKYSRAA
GNVDDQINRV LNALRDSGKL DNTVVIITAG RGIPLSEEEE TFDWSHGHLQ VPLVIHWPGT
PAQRINALTD HTDLMTTLMQ RLLHVSTPAS EYSQGQDLFN PQRRHYWVTA ADNDTLAITT
PKKTLVLNNN GKYRTYNLRG ERVKDEKPQL SLLLQVLTDE KRFIAN