Gene B21_03347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_03347 
SymboleptB 
ID8114549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp3569014 
End bp3570705 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content50% 
IMG OID644849521 
Producthypothetical protein 
Protein accessionYP_003001094 
Protein GI251786790 
COG category[R] General function prediction only 
COG ID[COG2194] Predicted membrane-associated, metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATACA TCAAATCGAT TACACAGCAG AAGCTGAGCT TTTTGCTTGC AATCTATATT 
GGCCTTTTTA TGAATGGCGC GGTTTTTTAC CGCCGCTTCG GCAGCTATGC GCACGATTTT
ACCGTCTGGA AAGGCATTTC TGCTGTTGTT GAACTGGCCG CCACCGTACT GGTGACCTTC
TTTTTACTAC GTCTTCTTTC GCTGTTTGGC CGCCGCAGCT GGCGTATTCT GGCATCGCTG
GTGGTGCTCT TTTCCGCAGG TGCCAGCTAT TACATGACCT TCCTTAATGT GGTCATTGGT
TATGGCATCA TCGCTTCCGT CATGACCACC GATATCGACC TGTCAAAAGA AGTTGTTGGT
CTGAACTTTA TTCTCTGGTT AATCGCCGTT AGTGCATTGC CTCTTATCCT TATCTGGAAT
AACCGCTGTC GCTACACCTT GCTCCGACAA CTGCGAACCC CAGGGCAGCG TATTCGCAGC
CTGGCGGTCG TCGTACTGGC GGGTATTATG GTTTGGGCAC CGATTCGTTT GCTGGATATC
CAGCAGAAGA AAGTGGAGAG GGCGACCGGC GTTGATTTGC CGAGTTATGG CGGTGTCGTA
GCGAACTCTT ATCTGCCATC AAACTGGCTT TCTGCGTTGG GGCTGTATGC CTGGGCGCGG
GTCGATGAAT CTTCCGATAA TAATTCATTG CTTAATCCGG CGAAGAAATT CACTTATCAG
GCACCGCAAA ACGTTGATGA CACTTATGTC GTGTTTATCA TCGGTGAAAC CACGCGTTGG
GACCATATGG GTATTTTCGG CTATGAGCGT AATACCACGC CGAAACTGGC CCAGGAGAAA
AATCTGGCGG CGTTCCGTGG TTACTCCTGT GATACCGCAA CCAAACTCTC ACTGCGTTGC
ATGTTTGTAC GTCAGGGGGG CGCGGAAGAT AATCCGCAGC GCACATTAAA AGAACAGAAC
ATTTTCGCGG TTCTGAAGCA GTTAGGATTC AGTTCTGACC TCTACGCTAT GCAGAGCGAA
ATGTGGTTCT ACAGCAACAC GATGGCGGAC AACATTGCTT ATCGTGAGCA GATTGGTGCG
GAGCCACGTA ATCGTGGCAA GCCGGTAGAT GATATGTTGC TGGTAGACGA AATGCAGCAA
TCGCTAGGGC GCAACCCGGA TGGTAAGCAT CTGATCATTC TGCATACCAA AGGTTCGCAT
TTTAACTACA CCCAGCGTTA TCCGCGTAGC TTCGCGCAGT GGAAGCCGGA ATGTATTGGT
GTTGATAGCG GCTGTACCAA AGCGCAGATG ATCAACTCCT ATGACAACTC GGTGACCTAT
GTGGATCACT TTATCTCCAG CGTAATTGAT CAGGTTCGCG ATAAGAAAGC GATTGTGTTC
TACGCAGCTG ACCACGGTGA GTCAATTAAT GAACGCGAGC ACCTGCACGG CACGCCGCGT
GAACTGGCAC CGCCGGAGCA GTTCCGCGTA CCGATGATGG TCTGGATGTC AGATAAATAT
CTGGAAAATC CGGCCAATGC GCAGGCGTTT GCGCAGCTGA AAAAAGAAGC CGACATGAAA
GTGCCACGCC GTCACGTAGA GCTGTACGAT ACCATCATGG GTTGTCTTGG CTATACTTCA
CCGGATGGTG GAATTAACGA AAACAACAAC TGGTGTCACA TCCCGCAGGC AAAAGAGGCA
GCGGCTAACT AA
 
Protein sequence
MRYIKSITQQ KLSFLLAIYI GLFMNGAVFY RRFGSYAHDF TVWKGISAVV ELAATVLVTF 
FLLRLLSLFG RRSWRILASL VVLFSAGASY YMTFLNVVIG YGIIASVMTT DIDLSKEVVG
LNFILWLIAV SALPLILIWN NRCRYTLLRQ LRTPGQRIRS LAVVVLAGIM VWAPIRLLDI
QQKKVERATG VDLPSYGGVV ANSYLPSNWL SALGLYAWAR VDESSDNNSL LNPAKKFTYQ
APQNVDDTYV VFIIGETTRW DHMGIFGYER NTTPKLAQEK NLAAFRGYSC DTATKLSLRC
MFVRQGGAED NPQRTLKEQN IFAVLKQLGF SSDLYAMQSE MWFYSNTMAD NIAYREQIGA
EPRNRGKPVD DMLLVDEMQQ SLGRNPDGKH LIILHTKGSH FNYTQRYPRS FAQWKPECIG
VDSGCTKAQM INSYDNSVTY VDHFISSVID QVRDKKAIVF YAADHGESIN EREHLHGTPR
ELAPPEQFRV PMMVWMSDKY LENPANAQAF AQLKKEADMK VPRRHVELYD TIMGCLGYTS
PDGGINENNN WCHIPQAKEA AAN