Gene B21_00471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_00471 
SymbolallC 
ID8115035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp513001 
End bp514236 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content51% 
IMG OID644846753 
Producthypothetical protein 
Protein accessionYP_002998326 
Protein GI251784022 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID[TIGR01879] amidase, hydantoinase/carbamoylase family
[TIGR03176] allantoate amidohydrolase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAC ATTTTCGTCA AGCCATAGAA GAAACGCTGC CCTGGCTTTC CTCTTTTGGC 
GCTGACCCAA CGGGTGGTAT GACCCGTTTA CTTTATTCGC CGGAATGGCT GGAAACTCAG
CAGCAATTTA AAAAAAGAAT GGCAGCAAGC GGGCTGGAAA CACGTTTCGA TGAAGTGGGG
AATTTATATG GTCGCCTGAG TGGCACCGAA TATCCACAAG AAGTGGTTCT GAGCGGTTCG
CATATCGATA CCGTGGTTAA CGGCGGTAAC CTCGACGGGC AATTCGGCGC GCTGGCGGCG
TGGCTGGCAA TTGACTGGCT GAAAACGCAA TACGGCGCGC CGCTACGTAC GGTCGAAGTG
GTGGCGATGG CAGAAGAAGA AGGCAGCCGC TTCCCGTATG TCTTCTGGGG CAGTAAAAAT
ATCTTTGGGC TGGCGAATCC TGACGACGTG CGGAATATCT GTGATGCCAA AGGAAATAGT
TTTGTCGATG CGATGAAGGC TTGCGGATTT ACTCTTCCGA ACGCCCCACT AACTCCGCGT
CAGGATATTA AAGCCTTTGT TGAACTGCAT ATTGAACAGG GCTGTGTGCT GGAAAGTAAT
GGGCAATCAA TTGGCGTGGT GAATGCAATT GTCGGGCAGC GTCGTTATAC GGTAACGCTG
AACGGCGAAT CAAACCATGC AGGCACCACG CCGATGGGTT ATCGTCGTGA TACAGTTTAC
GCTTTCAGTC GCATTTGCCA TCAGTCGGTC GAAAAAGCGA AAAGGATGGG CGATCCGCTG
GTTCTGACCT TTGGCAAAGT AGAGCCGCGC CCGAATACGG TAAATGTGGT GCCGGGTAAA
ACCACGTTCA CCATTGATTG TCGTCATACC GACGCTGCCG TGCTGCGCGA TTTCACCCAA
CAGTTAGAAA ACGACATGCG GGCGATTTGC GATGAAATGG ACATTGGTAT TGATATCGAT
TTATGGATGG ACGAAGAACC CGTGCCGATG AATAAGGAGC TGGTCGCCAC CCTGACAGAA
TTGTGTGAAA GAGAAAAACT GAATTACCGG GTGATGCACA GTGGTGCCGG GCACGACGCG
CAAATTTTCG CGCCTCGCGT ACCAACCTGC ATGATTTTCA TTCCCAGCAT CAACGGGATC
AGCCATAACC CGGCGGAACG CACCAATATT ACCGACCTTG CCGAAGGGGT CAAAACGTTG
GCACTCATGC TTTATCAACT TGCCTGGCAG AAATAA
 
Protein sequence
MITHFRQAIE ETLPWLSSFG ADPTGGMTRL LYSPEWLETQ QQFKKRMAAS GLETRFDEVG 
NLYGRLSGTE YPQEVVLSGS HIDTVVNGGN LDGQFGALAA WLAIDWLKTQ YGAPLRTVEV
VAMAEEEGSR FPYVFWGSKN IFGLANPDDV RNICDAKGNS FVDAMKACGF TLPNAPLTPR
QDIKAFVELH IEQGCVLESN GQSIGVVNAI VGQRRYTVTL NGESNHAGTT PMGYRRDTVY
AFSRICHQSV EKAKRMGDPL VLTFGKVEPR PNTVNVVPGK TTFTIDCRHT DAAVLRDFTQ
QLENDMRAIC DEMDIGIDID LWMDEEPVPM NKELVATLTE LCEREKLNYR VMHSGAGHDA
QIFAPRVPTC MIFIPSINGI SHNPAERTNI TDLAEGVKTL ALMLYQLAWQ K