Gene B21_02954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02954 
SymbolagaS 
ID8115350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3149681 
End bp3150835 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content57% 
IMG OID644849139 
Producthypothetical protein 
Protein accessionYP_003000712 
Protein GI251786408 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2222] Predicted phosphosugar isomerases 
TIGRFAM ID[TIGR02815] putative sugar isomerase, AgaS family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAGAAA ATTACACCCC TGCTGCCGCC GCAACCGGTA CATGGACTGA AGAAGAGATC 
CGCCATCAGC CTCGCGCATG GATCCGTTCA CTCACCAACA TCGACGCGCT ACGTTCCGCG
CTCAATAACT TCCTTGAACC GTTACTGCGC AAAGAGAATC TGCGGATCAT CCTGACCGGA
GCCGGAACGT CGGCATTTAT CGGTGACATC ATCGCGCCGT GGCTCGCCAG CCATACCGGT
AAAAACTTCA GCGCCGTACC GACCACCGAT CTGGTCACCA ATCCGATGGA CTACCTGAAC
CCAGCCCATC CGCTGCTGTT GATCTCCTTC GGTCGATCCG GCAACAGCCC GGAAAGCGTC
GCTGCCGTGG AACTGGCAAA TCAATTTGTA CCAGAATGCT ATCACCTGCC GATCACCTGC
AACGAAGCGG GCGCTCTTTA CCAAAACGCG ATCAACAGCG ATAACGCGTT TGCCCTGCTG
ATGCCCGCAG AAACGCACGA TCGCGGCTTT GCGATGACCA GCAGCATTAC CACCATGATG
GCCAGCTGCC TCGCGGTTTT CGCACCTGAG ACGATCAACA GCCAAACCTT CCGCGACGTG
GCGGATCGTT GCCAGGCGAT CCTGACCTCA CTGGGCGATT TCAGCGAAGG TGTGTTTGGT
TACGCACCGT GGAAACGGAT CGTTTATCTC GGCAGCGGTG GCTTACAGGG CGCAGCACGC
GAGTCGGCGC TGAAAGTGCT GGAACTGACT GCGGGTAAAC TGGCGGCCTT CTATGATTCC
CCGACCGGAT TCCGTCATGG CCCGAAATCA CTGGTCGATA ACGAAACGCT GGTGGTGGTA
TTTGTCTCAA GCCACCCTTA CACCCGTCAG TATGATCTTG ATCTGCTGGC TGAACTCCGC
CGTGACAACC AGGCAATGCG CGTAATCGCC ATCGCCGCGG AAAGCACCGA CATCGTCGCT
GCCGGTCCAC ATATTATCCT GCCGCCGTCA CGTCACTTTA TCGACGTTGA GCAGGCATTT
TGCTTCCTGA TGTACGCCCA GACGTTTGCA CTGATGCAGT CGCTGCACAT GGGCAATACG
CCGGATACCC CATCAGCCAG CGGCACCGTT AACCGCGTGG TGCAAGGCGT AATCATTCAT
CCGTGGCAGG CATAA
 
Protein sequence
MPENYTPAAA ATGTWTEEEI RHQPRAWIRS LTNIDALRSA LNNFLEPLLR KENLRIILTG 
AGTSAFIGDI IAPWLASHTG KNFSAVPTTD LVTNPMDYLN PAHPLLLISF GRSGNSPESV
AAVELANQFV PECYHLPITC NEAGALYQNA INSDNAFALL MPAETHDRGF AMTSSITTMM
ASCLAVFAPE TINSQTFRDV ADRCQAILTS LGDFSEGVFG YAPWKRIVYL GSGGLQGAAR
ESALKVLELT AGKLAAFYDS PTGFRHGPKS LVDNETLVVV FVSSHPYTRQ YDLDLLAELR
RDNQAMRVIA IAAESTDIVA AGPHIILPPS RHFIDVEQAF CFLMYAQTFA LMQSLHMGNT
PDTPSASGTV NRVVQGVIIH PWQA